Global ETD Search

1	Difference-Based Temporal Module for Monocular Category-Level 6 DoF Object Pose Tracking Chen, Zishen 22 January 2024 (has links) Monocular 6DoF pose tracking has many applications in augmented reality, robotics and other areas and because of the rise of deep learning new approaches such as category-level models are successful. The temporal information in sequential data is essential for both online and offline tasks, which can help boost the quality of predictions while encountering some unexpected influences like occlusions and vibration. In 2D object detection and tracking, substantial research has been done in leveraging temporal information to improve the performance of the model. Nevertheless, it is challenging to lift the temporal processing to 3D space because of the ambiguity of the visual data. In this thesis, we propose a method to calculate the temporal difference of points and pixels assuming that the K nearest points share similar features. The extracted features from the difference are learned to weigh the relevant points in the temporal sequence and aggregate them to provide support to the current frame's prediction. We propose a novel difference-based temporal module to incorporate both RGB and 3D points data in a temporal sequence. This module can be easily integrated with any category-level 6DoF pose tracking model which uses RGB and 3D points as input. We evaluate this module on two state-of-the-art category-level 6D pose tracking models and the result shows that it can increase the model's accuracy and robustness in complex scenarios. 6DoF Pose Tracking Temporal Module RGBD
2	Learned structural and temporal context for dynamic 3D pose optimization and tracking Patel, Mahir 30 September 2022 (has links) Accurate 3D tracking of animals from video recordings is critical for many behavioral studies. However, other than for humans, there is a lack of publicly available datasets of videos of animals that the computer vision community could use for model development. Furthermore, due to occlusion and the uncontrollable nature of the animals, existing pose estimation models suffer from inadequate precision. People rely on biomechanical expertise to design mathematical models to optimize poses to mitigate this issue at the cost of generalization. We propose OptiPose, a generalizable attention-based deep learning pose optimization model, as a part of a post-processing pipeline for refining 3D poses estimated by pre-existing systems. Our experiments show how OptiPose is highly robust to noise and occlusion and can be used to optimize pose sequences provided by state-of-the-art models for animal pose estimation. Furthermore, we will make Rodent3D, a multimodal (RGB, Thermal, and Depth) dataset for rats, publicly available. Artificial intelligence Computer vision Deep learning Pose estimation Pose optimization Pose tracking
3	Triangulation Based Fusion of Sonar Data with Application in Mobile Robot Mapping and Localization Wijk, Olle January 2001 (has links) No description available. mobile robots sensor fusion sonars odometry triangulation based fusion mapping occupancy grids pose tracking localization Kalman filter condensation navigation
4	Triangulation Based Fusion of Sonar Data with Application in Mobile Robot Mapping and Localization Wijk, Olle January 2001 (has links) No description available. mobile robots sensor fusion sonars odometry triangulation based fusion mapping occupancy grids pose tracking localization Kalman filter condensation navigation
5	Facial Feature Tracking and Head Pose Tracking as Input for Platform Games Andersson, Anders Tobias January 2016 (has links) Modern facial feature tracking techniques can automatically extract and accurately track multiple facial landmark points from faces in video streams in real time. Facial landmark points are deﬁned as points distributed on a face in regards to certain facial features, such as eye corners and face contour. This opens up for using facial feature movements as a handsfree human-computer interaction technique. These alternatives to traditional input devices can give a more interesting gaming experience. They also open up for more intuitive controls and can possibly give greater access to computers and video game consoles for certain disabled users with diﬃculties using their arms and/or ﬁngers. This research explores using facial feature tracking to control a character's movements in a platform game. The aim is to interpret facial feature tracker data and convert facial feature movements to game input controls. The facial feature input is compared with other handsfree inputmethods, as well as traditional keyboard input. The other handsfree input methods that are explored are head pose estimation and a hybrid between the facial feature and head pose estimation input. Head pose estimation is a method where the application is extracting the angles in which the user's head is tilted. The hybrid input method utilises both head pose estimation and facial feature tracking. The input methods are evaluated by user performance and subjective ratings from voluntary participants playing a platform game using the input methods. Performance is measured by the time, the amount of jumps and the amount of turns it takes for a user to complete a platform level. Jumping is an essential part of platform games. To reach the goal, the player has to jump between platforms. An ineﬃcient input method might make this a diﬃcult task. Turning is the action of changing the direction of the player character from facing left to facing right or vice versa. This measurement is intended to pick up diﬃculties in controling the character's movements. If the player makes many turns, it is an indication that it is diﬃcult to use the input method to control the character movements eﬃciently. The results suggest that keyboard input is the most eﬀective input method, while it is also the least entertaining of the input methods. There is no signiﬁcant diﬀerence in performance between facial feature input and head pose input. The hybrid input version has the best results overall of the alternative input methods. The hybrid input method got signiﬁcantly better performance results than the head pose input and facial feature input methods, while it got results that were of no statistically signiﬁcant diﬀerence from the keyboard input method. Keywords: Computer Vision, Facial Feature Tracking, Head Pose Tracking, Game Control / Moderna tekniker kan automatiskt extrahera och korrekt följa multipla landmärken från ansikten i videoströmmar. Landmärken från ansikten är deﬁnerat som punkter placerade på ansiktet utefter ansiktsdrag som till exempel ögat eller ansikts konturer. Detta öppnar upp för att använda ansiktsdragsrörelser som en teknik för handsfree människa-datorinteraktion. Dessa alternativ till traditionella tangentbord och spelkontroller kan användas för att göra datorer och spelkonsoler mer tillgängliga för vissa rörelsehindrade användare. Detta examensarbete utforskar användbarheten av ansiktsdragsföljning för att kontrollera en karaktär i ett plattformsspel. Målet är att tolka data från en appliktion som följer ansiktsdrag och översätta ansiktsdragens rörelser till handkontrollsinmatning. Ansiktsdragsinmatningen jämförs med inmatning med huvudposeuppskattning, en hybrid mellan ansikstdragsföljning och huvudposeuppskattning, samt traditionella tangentbordskontroller. Huvudposeuppskattning är en teknik där applikationen extraherar de vinklar användarens huvud lutar. Hybridmetoden använder både ansiktsdragsföljning och huvudposeuppskattning. Inmatningsmetoderna granskas genom att mäta eﬀektivitet i form av tid, antal hopp och antal vändningar samt subjektiva värderingar av frivilliga testanvändare som spelar ett plattformspel med de olika inmatningsmetoderna. Att hoppa är viktigt i ett plattformsspel. För att nå målet, måste spelaren hoppa mellan plattformar. En inefektiv inmatningsmetod kan göra detta svårt. En vändning är när spelarkaraktären byter riktning från att rikta sig åt höger till att rikta sig åt vänster och vice versa. Ett högt antal vändningar kan tyda på att det är svårt att kontrollera spelarkaraktärens rörelser på ett eﬀektivt sätt. Resultaten tyder på att tangentbordsinmatning är den mest eﬀektiva metoden för att kontrollera plattformsspel. Samtidigt ﬁck metoden lägst resultat gällande hur roligt användaren hade under spelets gång. Där var ingen statisktiskt signiﬁkant skillnad mellan huvudposeinmatning och ansikstsdragsinmatning. Hybriden mellan ansiktsdragsinmatning och huvudposeinmatning ﬁck bäst helhetsresultat av de alternativa inmatningsmetoderna. Nyckelord: Datorseende, Följning av Ansiktsdrag, Följning av Huvud, Spelinmatning facial feature tracking head pose tracking alternative interface real-time hci human computer interface Interaction Technologies Interaktionsteknik
6	Hybrid marker-less camera pose tracking with integrated sensor fusion Moemeni, Armaghan January 2014 (has links) This thesis presents a framework for a hybrid model-free marker-less inertial-visual camera pose tracking with an integrated sensor fusion mechanism. The proposed solution addresses the fundamental problem of pose recovery in computer vision and robotics and provides an improved solution for wide-area pose tracking that can be used on mobile platforms and in real-time applications. In order to arrive at a suitable pose tracking algorithm, an in-depth investigation was conducted into current methods and sensors used for pose tracking. Preliminary experiments were then carried out on hybrid GPS-Visual as well as wireless micro-location tracking in order to evaluate their suitability for camera tracking in wide-area or GPS-denied environments. As a result of this investigation a combination of an inertial measurement unit and a camera was chosen as the primary sensory inputs for a hybrid camera tracking system. After following a thorough modelling and mathematical formulation process, a novel and improved hybrid tracking framework was designed, developed and evaluated. The resulting system incorporates an inertial system, a vision-based system and a recursive particle filtering-based stochastic data fusion and state estimation algorithm. The core of the algorithm is a state-space model for motion kinematics which, combined with the principles of multi-view camera geometry and the properties of optical flow and focus of expansion, form the main components of the proposed framework. The proposed solution incorporates a monitoring system, which decides on the best method of tracking at any given time based on the reliability of the fresh vision data provided by the vision-based system, and automatically switches between visual and inertial tracking as and when necessary. The system also includes a novel and effective self-adjusting mechanism, which detects when the newly captured sensory data can be reliably used to correct the past pose estimates. The corrected state is then propagated through to the current time in order to prevent sudden pose estimation errors manifesting as a permanent drift in the tracking output. Following the design stage, the complete system was fully developed and then evaluated using both synthetic and real data. The outcome shows an improved performance compared to existing techniques, such as PTAM and SLAM. The low computational cost of the algorithm enables its application on mobile devices, while the integrated self-monitoring, self-adjusting mechanisms allow for its potential use in wide-area tracking applications. 006.3
7	Approaches to Mobile Robot Localization in Indoor Environments Jensfelt, Patric January 2001 (has links) QC 20100621 mobile robot laser scanner sonar odometric model sensor fusion pose tracking global localization SLAM Kalman filter particle filter Multiple Hypothesis Localization Monte Carlo Localization TECHNOLOGY TEKNIKVETENSKAP
8	Detekce a sledování polohy hlavy v obraze / Head Pose Estimation and Tracking Pospíšil, Aleš January 2011 (has links) Diplomová práce je zaměřena na problematiku detekce a sledování polohy hlavy v obraze jako jednu s možností jak zlepšit možnosti interakce mezi počítačem a člověkem. Hlavním přínosem diplomové práce je využití inovativních hardwarových a softwarových technologií jakými jsou Microsoft Kinect, Point Cloud Library a CImg Library. Na úvod je představeno shrnutí předchozích prací na podobné téma. Následuje charakteristika a popis databáze, která byla vytvořena pro účely diplomové práce. Vyvinutý systém pro detekci a sledování polohy hlavy je založený na akvizici 3D obrazových dat a registračním algoritmu Iterative Closest Point. V závěru diplomové práce je nabídnuto hodnocení vzniklého systému a jsou navrženy možnosti jeho budoucího zlepšení.
9	Integrating Machine Learning for Intelligent Fitness Exercise Monitoring : master's thesis Эль Хамзауи, У., El Hamzaoui, O. January 2024 (has links) Фитнес занимает важное место в жизни людей. Хорошие привычки фитнеса могут улучшить работу сердца и легких, повысить концентрацию, предотвратить ожирение и эффективно снизить риск смерти. Люди получают свои знания о фитнесе в основном из социальных сетей. Исследования показывают, что поддержание фитнеса имеет решающее значение для пропаганды здорового образа жизни и используется для оценки качества жизни, связанного со здоровьем. Хотя привлечение фитнес-тренера может быть эффективным подходом к поощрению регулярных упражнений и общего благополучия, это не всегда может быть осуществимо или доступно в определенных ситуациях. Стоит отметить, что упражнения имеют многочисленные преимущества для здоровья, но при неправильном выполнении они могут быть как неэффективными, так и потенциально опасными. Люди, которые тренируются без надлежащего контроля, часто совершают ошибки, такие как использование неправильных форм, что может привести к серьезным последствиям, таким как травмы подколенных сухожилий или падения. но способность к обучению ограничена. Неполная физическая подготовка может привести к травмам, а дешевая, своевременная и точная система определения физической подготовки может снизить риск травм и эффективно улучшить осведомленность людей о своей физической форме. В прошлом многие исследования были посвящены обнаружению фитнес-движений, среди которых обнаружение фитнес-движений на основе носимых устройств, узлов тела и глубокого обучения изображений достигло более высокой производительности. Однако носимое устройство не может обнаруживать различные фитнес-движения, может мешать физическим упражнениям пользователя и имеет высокую стоимость. Оба метода, основанные на узлах тела и на глубоком обучении изображений, имеют более низкую стоимость, но у каждого есть некоторые недостатки. Поэтому в этой статье использовался алгоритм оценки позы человека, такой как Yolov7, OpenPose и, в частности, Mediapipe, для оптимизации производительности приседаний на разных уровнях мастерства; эта система обеспечивает анализ техник приседаний в реальном времени. Настраиваемые режимы, предназначенные для новичков и профессионалов, обеспечивают персонализированную обратную связь, позволяя пользователям эффективно совершенствовать свою форму. Используя методы компьютерного зрения и машинного обучения, включая MediaPipe, OpenCV и Python, система отслеживает движения пользователей, предоставляя на экране руководство и слуховые подсказки для коррекции осанки и прогресса тренировки. AI-Fit предлагает решение, позволяющее людям безопасно заниматься спортом под руководством экспертов, и удовлетворяет потребность в персонализированных фитнес-тренировках, профилактике травм и мотивации, в конечном итоге улучшая общую физическую форму и самочувствие пользователей. / Fitness is important in people’s lives. Good fitness habits can improve cardiopulmonary capacity, increase concentration, prevent obesity, and effectively reduce the risk of death. People obtain their fitness knowledge mostly from social media. Research indicates that maintaining fitness is crucial for promoting a healthy way of living and is used to assess one's health-related quality of life. While engaging a fitness trainer can be an effective approach to encouraging regular exercise and overall well-being, it may not always be feasible or affordable in certain situations. It is worth noting that exercise has numerous health benefits, but if performed incorrectly, it can be both ineffective and potentially hazardous. Individuals who work out without proper supervision often make mistakes such as using improper forms, which can lead to severe consequences, such as hamstring injuries or falls. but learning ability is limited. Incomplete fitness is likely to lead to injury, and a cheap, timely, and accurate fitness detection system can reduce the risk of fitness injuries and can effectively improve people’s fitness awareness. In the past, many studies have engaged in the detection of fitness movements, among which the detection of fitness movements based on wearable devices, body nodes, and image deep learning has achieved better performance. However, a wearable device cannot detect a variety of fitness movements, may hinder the exercise of the fitness user, and has a high cost. Both body-node-based and image-deep-learning-based methods have lower costs, but each has some drawbacks. Therefore, this paper used a human pose estimation algorithm such as Yolov7, OpenPose and particularly Mediapipe, to optimize squat performance across various skill levels, this system provides real-time analysis of squat techniques. Customized modes tailored for beginners and professionals deliver personalized feedback, empowering users to refine their form effectively. By employing techniques from computer vision and machine learning, including MediaPipe, OpenCV, and Python, the system tracks users' movements, providing on-screen guidance and auditory cues for posture correction and workout progression. AI-Fit offers a solution for individuals to exercise safely with expert guidance and addresses the need for personalized fitness training, injury prevention, and motivation, ultimately enhancing users' overall physical fitness and well-being. MASTER'S THESIS COMPUTER VISION OPENCV YOLOV7 MEDIAPIPE BLAZEPOSE COCO VGG MPII IMAGE PROCESSING HUMAN POSE ESTIMATION POSE TRACKING ALGORITHMS REAL-TIME MOVEMENT ANALYSIS POSTURE CORRECTION КОМПЬЮТЕРНОЕ ЗРЕНИЕ OPENCV YOLOV7 MEDIAPIPE BLAZEPOSE COCO VGG MPII ОЦЕНКА ПОЗЫ ЧЕЛОВЕКА КОРРЕКЦИЯ ОСАНКИ

Search results