Global ETD Search

161	Light-weighted Deep Learning for LiDAR and Visual Odometry Fusion in Autonomous Driving Zhang, Dingnan 20 December 2022 (has links) No description available. Electrical Engineering
162	Towards A Theory of Prose Brelsford, Joanne 10 1900 (has links) <p> A critical analysis of several approaches to prose, and an attempt to construct a theory of prose as art, on which a language of prose criticism might be based. </p> / Thesis / Master of Arts (MA)
163	What, When, and Where Exactly? Human Activity Detection in Untrimmed Videos Using Deep Learning Rahman, Md Atiqur 06 December 2023 (has links) Over the past decade, there has been an explosion in the volume of video data, including internet videos and surveillance camera footage. These videos often feature extended durations with unedited content, predominantly filled with background clutter, while the relevant activities of interest occupy only a small portion of the footage. Consequently, there is a compelling need for advanced processing techniques to automatically analyze this vast reservoir of video data, specifically with the goal of identifying the segments that contain the events of interest. Given that humans are the primary subjects in these videos, comprehending human activities plays a pivotal role in automated video analysis. This thesis seeks to tackle the challenge of detecting human activities from untrimmed videos, aiming to classify and pinpoint these activities both in their spatial and temporal dimensions. To achieve this, we propose a modular approach. We begin by developing a temporal activity detection framework, and then progressively extend the framework to support activity detection in the spatio-temporal dimension. To perform temporal activity detection, we introduce an end-to-end trainable deep learning model leveraging 3D convolutions. Additionally, we propose a novel and adaptable fusion strategy to combine both the appearance and motion information extracted from a video, using RGB and optical flow frames. Importantly, we incorporate the learning of this fusion strategy into the activity detection framework. Building upon the temporal activity detection framework, we extend it by incorporating a spatial localization module to enable activity detection both in space and time in a holistic end-to-end manner. To accomplish this, we leverage shared spatio-temporal feature maps to jointly optimize both spatial and temporal localization of activities, thus making the entire pipeline more effective and efficient. Finally, we introduce several novel techniques for modeling actor motion, specifically designed for efficient activity recognition. This is achieved by harnessing 2D pose information extracted from video frames and then representing human motion through bone movement, bone orientation, and body joint positions. Our experimental evaluations, conducted using benchmark datasets, showcase the effectiveness of the proposed temporal and spatio-temporal activity detection methods when compared to the current state-of-the-art methods. Moreover, the proposed motion representations excel in both performance and computational efficiency. Ultimately, this research shall pave the way forward towards imbuing computers with social visual intelligence, enabling them to comprehend human activities in any given time and space, opening up exciting possibilities for the future. Machine Learning Deep Learning Human Activity Detection Untrimmed Video Analysis Pose-based Motion Modeling
164	Controllable Visual Synthesis AlBahar, Badour A. Sh A. 08 June 2023 (has links) Computer graphics has become an integral part of various industries such as entertainment (i.e.,films and content creation), fashion (i.e.,virtual try-on), and video games. Computer graphics has evolved tremendously over the past years. It has shown remarkable image generation improvement from low-quality, pixelated images with limited details to highly realistic images with fine details that can often be mistaken for real images. However, the traditional pipeline of rendering an image in computer graphics is complex and time- consuming. The whole process of creating the geometry, material, and textures requires not only time but also significant expertise. In this work, we aim to replace this complex traditional computer graphics pipeline with a simple machine learning model. This machine learning model can synthesize realistic images without requiring expertise or significant time and effort. Specifically, we address the problem of controllable image synthesis. We propose several approaches that allow the user to synthesize realistic content and manipulate images to achieve their desired goals with ease and flexibility. / Doctor of Philosophy / Computer graphics has become an integral part of various industries such as entertainment (i.e.,films and content creation), fashion (i.e.,virtual try-on), and video games. Computer graphics has evolved tremendously over the past years. It has shown remarkable image generation improvement from low-quality, pixelated images with limited details to highly realistic images with fine details that can often be mistaken for real images. However, the traditional process of generating an image in computer graphics is complex and time- consuming. You need to set up a camera and light, and create objects with all sorts of details. This requires not only time but also significant expertise. In this work, we aim to replace this complex traditional computer graphics pipeline with a simple machine learning model. This machine learning model can generate realistic images without requiring expertise or significant time and effort. Specifically, we address the problem of controllable image synthesis. We propose several approaches that allow the user to synthesize realistic content and manipulate images to achieve their desired goals with ease and flexibility. Computer vision Computer graphic Image-to-image translation Pose transfer Human reposing Video editing
165	TO KILL AND TO BE KILLED: THE TRANSFERENCE, TRANSFORMATION AND USE OF THE SMITING POSE IN EGYPT AND THE AEGEAN DURING THE BRONZE AGE Kellenbarger, Tenninger 08 1900 (has links) The smiting pose is a motif used by the Egyptians, Minoans, and the Mycenaeans during the Bronze Age (ca. 3000–1200 BCE). Although the smiting pose has been identified as an emblem of the pharaonic office, the pose has never been investigated in the field of Aegean prehistory. This motif is incorporated as evidence when discussing larger topics, such as warriors and warfare of the Aegean during the Late Bronze Age. In these arguments, art-bearing iconography is used as evidence to support the presence of martial Minoans and are only ever mentioned as such. This dissertation investigates the smiting scenes from the Egypt and Crete and the Mainland of Greece and examines them to answer the following questions: how people are creating and expressing power in the Eastern Mediterranean and how do trade networks influence this. The first part of this approach considers different trade routes explored by Crete and the Mainland as well as the role the Aegean peoples played in the international trade networks. The second part of this study focuses on the smiting motif in its regional context to explore how power was constructed and represented through violence to fit their concepts of ruling and kingship. / Art History Art history Archaeology Bronze Age Aegean Bronze Age Egypt Smiting pose
166	Automated Implementation of the Edinburgh Visual Gait Score (EVGS) Ramesh, Shri Harini 14 July 2023 (has links) Analyzing a person's gait is important in determining their physical and neurological health. However, typical motion analysis laboratories are only in urban specialty care facilities and can be expensive due to the specialized personnel and technology needed for these examinations. Many patients, especially those who reside in underdeveloped or isolated locations, find it impractical to go to such facilities. With the help of recent developments in high-performance computing and artificial intelligence models, it is now feasible to evaluate human movement using digital video. Over the past 20 years, various visual gait analysis tools and scales have been developed. A study of the literature and discussions with physicians who are domain experts revealed that the Edinburgh Visual Gait Score (EVGS) is one of the most effective scales currently available. Clinical implementations of EVGS currently rely on human scoring of videos. In this thesis, an algorithmic implementation of EVGS scoring based on hand-held smart phone video was implemented. Walking gait was recorded using a handheld smartphone at 60Hz as participants walked along a hallway. Body keypoints representing joints and limb segments were then identified using the OpenPose - Body 25 pose estimation model. A new algorithm was developed to identify foot events and strides from the keypoints and determine EVGS parameters at relevant strides. The stride identification results were compared with ground truth foot events that were manually labeled through direct observation, and the EVGS results were compared with evaluations by human scorers. Stride detection was accurate within 2 to 5 frames. The level of agreement between the scorers and the algorithmic EVGS score was strong for 14 of 17 parameters. The algorithm EVGS results were highly correlated to scorers' scores (r>0.80) for eight of the 17 factors. Smartphone-based remote motion analysis with automated implementation of the EVGS may be employed in a patient's neighborhood, eliminating the need to travel. These results demonstrated the viability of automated EVGS for remote human motion analysis. gait analysis Edinburgh Visual Gait Score Computer vision Pose estimation smartphone video remote gait analysis
167	Reconstructing 3D Humans From Visual Data Zheng, Ce 01 January 2023 (has links) (PDF) Understanding humans in visual content is fundamental for numerous computer vision applications. Extensive research has been conducted in the field of human pose estimation (HPE) to accurately locate joints and construct body representations from images and videos. Expanding on HPE, human mesh recovery (HMR) addresses the more complex task of estimating the 3D pose and shape of the entire human body. HPE and HMR have gained significant attention due to their applications in areas such as digital human avatar modeling, AI coaching, and virtual reality [135]. However, HPE and HMR come with notable challenges, including intricate body articulation, occlusion, depth ambiguity, and the limited availability of annotated 3D data. Despite the progress made so far, the research community continues to strive for robust, accurate, and efficient solutions in HPE and HMR, advancing us closer to the ultimate goals in the field. This dissertation tackles various challenges in the domains of HPE and HMR. The initial focus is on video-based HPE, where we proposed a transformer architecture named PoseFormer [136] to leverage to capture the spatial relationships between body joints and temporal correlations across frames. This approach effectively harnesses the comprehensive connectivity and expressive power of transformers, leading to improved pose estimation accuracy in video sequences. Building upon this, the dissertation addresses the heavy computational and memory burden associated with image-based HMR. Our proposed Feater Map-based Transformer method (FeatER [133]) and a Pooling attention transformer method (POTTER[130]), demonstrate superior performance while significantly reducing computational and memory requirements compared to existing state-of-the-art techniques. Furthermore, a diffusion-based framework (DiffMesh[134]) is proposed for reconstructing high-quality human mesh outputs given input video sequences. These achievements provide practical and efficient solutions that cater to the demands of real-world applications in HPE and HMR. In this dissertation, our contributions advance the fields of HPE and HMR, bringing us closer to accurate and efficient solutions for understanding humans in visual content. human pose estimation human mesh recovery deep-learning human reconstruction Computer Sciences
168	STUDENT ATTENTIVENESS CLASSIFICATION USING GEOMETRIC MOMENTS AIDED POSTURE ESTIMATION Gowri Kurthkoti Sridhara Rao (14191886) 30 November 2022 (has links) <p> Body Posture provides enough information regarding the current state of mind of a person. This idea is used to implement a system that provides feedback to lecturers on how engaging the class has been by identifying the attentive levels of students. This is carried out using the posture information extracted with the help of Mediapipe. A novel method of extracting features are from the key points returned by Mediapipe is proposed. Geometric moments aided features classification performs better than the general distances and angles features classification. In order to extend the single person pose classification to multi person pose classification object detection is implemented. Feedback is generated regarding the entire lecture and provided as the output of the system. </p> Computer vision : Pose Classification Mediapipe Geometric Moments Object detection Attentiveness classification Random Forest Classifier Retina-net
169	Analysis and control of an eight degree-of-freedom manipulator Nyzen, Robert J. January 1999 (has links) No description available. Engineering, Mechanical Advanced Research Manipulator II kinematics Cartesian pose control esolved-rate control computed torque control
170	Rigorous Model of Panoramic Cameras Shin, Sung Woong 31 March 2003 (has links) No description available. Engineering, Civil Rigorous Model Panoramic Cameras Pose estimation Space Intersection DEM Ortho-rectification

Search results