• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 196
  • 24
  • 17
  • 10
  • 9
  • 6
  • 6
  • 3
  • 2
  • 2
  • 2
  • 1
  • 1
  • 1
  • 1
  • Tagged with
  • 339
  • 214
  • 143
  • 105
  • 70
  • 61
  • 56
  • 48
  • 44
  • 43
  • 43
  • 43
  • 39
  • 38
  • 36
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
141

Robust Classification of Head Pose from Low Resolution Images Under Various Lighting Condition

Khaki, Mohammad January 2017 (has links)
Companies have long been interested in gauging the customer’s level of interest in their advertisements. By analyzing the gaze direction of individuals viewing a public advertisement, we can infer their level of engagement. Head pose detection allows us to deduce pertinent information about gaze direction. Using video sensors, machine learning methods, and image processing techniques, information pertaining to the head pose of people viewing advertisements can be automatically collected and mined. We propose a method for the coarse classification of head pose from low-resolution images in crowded scenes captured through a single camera and under different lighting conditions. Our method improves on the technique described in [1]; we introduce several modifications to the latter scheme to improve classification accuracy. First, we devise a mechanism that uses a cascade of three binary Support Vector Machines (SVM) classifiers instead of a single multi-class classifier. Second, we employ a bigger dataset for training by combining eight publicly available databases. Third, we use two sets of appearance features, Similarity Distance Map (SDM) and Gabor Wavelet (GW), to train the SVM classifiers. The scheme is tested with cross validation using the dataset and on videos we collected in a lab experiment. We found a significant improvement in the results achieved by the proposed method over existing schemes, especially for video pose classification. The results show that the proposed method is more robust under varying light conditions and facial expressions and in the presences of facial accessories compared to [1].
142

Human pose augmentation for facilitating Violence Detection in videos: a combination of the deep learning methods DensePose and VioNetHuman pose augmentation for facilitating Violence Detection in videos: a combination of the deep learning methods DensePose and VioNet

Calzavara, Ivan January 2020 (has links)
In recent years, deep learning, a critical technology in computer vision, has achieved remarkable milestones in many fields, such as image classification and object detection. In particular, it has also been introduced to address the problem of violence detection, which is a big challenge considering the complexity to establish an exact definition for the phenomenon of violence. Thanks to the ever increasing development of new technologies for surveillance, we have nowadays access to an enormous database of videos that can be analyzed to find any abnormal behavior. However, by dealing with such huge amount of data it is unrealistic to manually examine all of them. Deep learning techniques, instead, can automatically study, learn and perform classification operations. In the context of violence detection, with the extraction of visual harmful patterns, it is possible to design various descriptors to represent features that can identify them. In this research we tackle the task of generating new augmented datasets in order to try to simplify the identification step performed by a violence detection technique in the field of Deep Learning. The novelty of this work is to introduce the usage of DensePose model to enrich the images in a dataset by highlighting (i.e. by identifying and segmenting) all the human beings present in them. With this approach we gained knowledge of how this algorithm performs on videos with a violent context and how the violent detection network benefit from this procedure. Performances have been evaluated from the point of view of segmentation accuracy and efficiency of the violence detection network, as well from the computational point of view. Results shows how the context of the scene is the major indicator that brings the DensePose model to correct segment human beings and how the context of violence does not seem to be the most suitable field for the application of this model since the common overlap of bodies (distinctive aspect of violence) acts as disadvantage for the segmentation. For this reason, the violence detection network does not exploit its full potential. Finally, we understood how such augmented datasets can boost up the training speed by reducing the time needed for the weights-update phase, making this procedure a helpful adds-on for implementations in different contexts where the identification of human beings still plays the major role.
143

Towards a framework for multi class statistical modelling of shape, intensity, and kinematics in medical images

Fouefack, Jean-Rassaire 10 August 2021 (has links)
Statistical modelling has become a ubiquitous tool for analysing of morphological variation of bone structures in medical images. For radiological images, the shape, relative pose between the bone structures and the intensity distribution are key features often modelled separately. A wide range of research has reported methods that incorporate these features as priors for machine learning purposes. Statistical shape, appearance (intensity profile in images) and pose models are popular priors to explain variability across a sample population of rigid structures. However, a principled and robust way to combine shape, pose and intensity features has been elusive for four main reasons: 1) heterogeneity of the data (data with linear and non-linear natural variation across features); 2) sub-optimal representation of three-dimensional Euclidean motion; 3) artificial discretization of the models; and 4) lack of an efficient transfer learning process to project observations into the latent space. This work proposes a novel statistical modelling framework for multiple bone structures. The framework provides a latent space embedding shape, pose and intensity in a continuous domain allowing for new approaches to skeletal joint analysis from medical images. First, a robust registration method for multi-volumetric shapes is described. Both sampling and parametric based registration algorithms are proposed, which allow the establishment of dense correspondence across volumetric shapes (such as tetrahedral meshes) while preserving the spatial relationship between them. Next, the framework for developing statistical shape-kinematics models from in-correspondence multi-volumetric shapes embedding image intensity distribution, is presented. The framework incorporates principal geodesic analysis and a non-linear metric for modelling the spatial orientation of the structures. More importantly, as all the features are in a joint statistical space and in a continuous domain; this permits on-demand marginalisation to a region or feature of interest without training separate models. Thereafter, an automated prediction of the structures in images is facilitated by a model-fitting method leveraging the models as priors in a Markov chain Monte Carlo approach. The framework is validated using controlled experimental data and the results demonstrate superior performance in comparison with state-of-the-art methods. Finally, the application of the framework for analysing computed tomography images is presented. The analyses include estimation of shape, kinematic and intensity profiles of bone structures in the shoulder and hip joints. For both these datasets, the framework is demonstrated for segmentation, registration and reconstruction, including the recovery of patient-specific intensity profile. The presented framework realises a new paradigm in modelling multi-object shape structures, allowing for probabilistic modelling of not only shape, but also relative pose and intensity as well as the correlations that exist between them. Future work will aim to optimise the framework for clinical use in medical image analysis.
144

Palm Programmierung unter Linux

Jahre, Daniel 12 March 2002 (has links)
Die PDAs von Palm Inc. und seinen Lizenznehmern werden gerne zur Adress- und Terminverwaltung eingesetzt. Damit ist ihr Leistungspotential jedoch nicht erschöpft. Wer gerne selbst Applikationen für Palm PDAs entwickeln möchte, ist dabei nicht zwingend auf eine windowsbasierte Entwicklungsumgebung angewiesen. Unter Linux gibt es Compiler, Ressourceeditoren und Emulatoren für PalmOS. Ich werde in meinem Vortrag diese Werkzeuge vorstellen, demonstrieren und ein Beispielprogramm zeigen.
145

Positional calibration methods for linear pipetting robot

Uudelepp, Oscar January 2020 (has links)
This thesis aims to investigate and develop two positional calibration methods that can be applied to a linear pipetting robot. The goal of the calibration is to detect displacements that have been made to objects that are located in the the robot’s reference system and try to estimate their new position. One of the methods utilizes the pressure system that is mounted on the robot’s arm. The pressure system was able to detect surfaces by blowing air through a pipette against desired surfaces. Positional information of targeted objects are acquired by using the surface detection feature against an extruded square landmark that acts as a reference for estimating displacements.  The other method uses a barcode scanning camera by using its images to detect and retrieve positional information on Aruco markers. Estimation of the targeted object is done by tracking the movement of the Arucos position and orientation. Tests were made in order to analyse the performance of both methods and to verify that the requirement of 0.1 mm accuracy and precision could be obtained. The tests were limited to analysing the methods performance on stationary targets to guarantee that the methods did not detect incorrect displacements. It was found that the camera method could fulfill the requirement when it came to estimating XY-coordinates  by using multiple images and placing the Aruco marker within a reasonable distance to the targeted object. However, the camera method was not accurate when it came to estimating the Z-coordinates of objects. As for the pressure method, it was able to fulfill the requirement when it came to estimating Z-coordinates, but its ability to estimate the XY-coordinates of an object was not sufficient. A recommendation would be combine both methods so that they can compensate each other by using the camera method for estimating the XY-coordinates and the pressure method for estimating the Z-coordinates.
146

Pose Estimation using Genetic Algorithm with Line Extraction using Sequential RANSAC for a 2-D LiDAR

Kumat, Ashwin Dharmesh January 2021 (has links)
No description available.
147

Identification, classification and modelling of Traditional African dances using deep learning techniques

Adebunmi Elizabeth Odefunso (10711203) 06 May 2021 (has links)
<p>Human action recognition continues to evolve and is examined better using deep learning techniques. Several successes have been recorded in the field of action recognition but only very few has focused on dance. This is because dance actions and, especially Traditional African dance, are long and involve fast movement of body parts. This research proposes a novel framework that applies data science algorithms to the field of cultural preservation by applying various deep learning techniques to identify, classify and model Traditional African dances from videos. Traditional African dances are important part of the African culture and heritage. Digital preservation of these dances in their myriad forms is a problem. The dance dataset was constituted using freely available YouTube videos. Three Traditional African dances – Adowa, Bata and Swange – were used for the dance classification process. Two Convolutional Neural Network (CNN) models were used for the classification and they achieved an accuracy of 97% and 98% respectively. Sound classification of Adowa, Bata and Swange drum ensembles were also carried out; an accuracy of 96% was achieved. Human Pose Estimation Algorithms were applied to the Sinte dance. A model of Sinte dance, which can be exported to other environments, was obtained.</p>
148

Classification of the different movements (walk/trot/canter) anddata collection of pose estimation

Sjöström, Moa January 2020 (has links)
Pose estimation uses computer vision to predict how a body moves. The likeliness off different movements is predicted with a neural network and the most likely pose is predicted. With DeepLabCut, an open source software package for 3D animal pose estimation, information about animals behaviour and movement can be extracted. In this report the pose estimation of horses four hooves is used. By looking at the position of the hooves different gaits can be identified. Horses used for riding in the major disciplines in Sweden have three different gaits, walk, trot and canter. Walk is a four-stoke gait, trot is two-stoke and canter is three-stoke. This can be used to classify the different gaits. By looking at the hooves movement in vertical position over time and fitting a sinewave to the data it is possible to see the phase difference in the hooves movement. For walk and trot there was a significant pattern which was easy to identify and corresponded well to the theory of horses movement. For canter our pre-trained model lacked in accuracy, so the output data were insufficient. Therefore it was not possible to find a significant pattern for canter which corresponds to the theory of horses movements. The Fourier Transform were also tested to classify the gaits and when plotted it was possible to detect the different gaits, but not significant enough to be reliable for different horses in different sizes running in different paces. It was also possible to add the data for all four hooves together and fit a sinewave to the added data, and then compare it with the sinewaves for each hoof separately. Depending on the gait the frequency of the sinewaves differed between the hooves separately and added together and the gaits could be identified.
149

Facial Identity Embeddings for Deepfake Detection in Videos

Emir, Alkazhami January 2020 (has links)
Forged videos of swapped faces, so-called deepfakes, have gained a  lot  of  attention in recent years. Methods for automated detection of this type of manipulation are also seeing rapid progress in their development. The purpose of this thesis work is to evaluate the possibility and effectiveness of using deep embeddings from facial recognition networks as base for detection of such deepfakes. In addition, the thesis aims to answer whether or not the identity embeddings contain information that can be used for detection while analyzed over time and if it is suitable to include information about the person's head pose in this analysis. To answer these questions, three classifiers are created with the intent to answer one question each. Their performances are compared with each other and it is shown that identity embeddings are suitable as a basis for deepfake detection. Temporal analysis of the embeddings also seem effective, at least for deepfake methods that only work on a frame-by-frame basis. Including information about head poses in the videos is shown to not improve a classifier like this.
150

Angles-Only Navigation for Autonomous Orbital Rendezvous

Woffinden, David Charles 01 December 2008 (has links)
The proposed thesis of this dissertation has both a practical element and theoretical component which aim to answer key questions related to the use of angles-only navigation for autonomous orbital rendezvous. The first and fundamental principle to this work argues that an angles-only navigation filter can determine the relative position and orientation (pose) between two spacecraft to perform the necessary maneuvers and close proximity operations for autonomous orbital rendezvous. Second, the implementation of angles-only navigation for on-orbit applications is looked upon with skeptical eyes because of its perceived limitation of determining the relative range between two vehicles. This assumed, yet little understood subtlety can be formally characterized with a closed-form analytical observability criteria which specifies the necessary and sufficient conditions for determining the relative position and velocity with only angular measurements. With a mathematical expression of the observability criteria, it can be used to 1) identify the orbital rendezvous trajectories and maneuvers that ensure the relative position and velocity are observable for angles-only navigation, 2) quantify the degree or level of observability and 3) compute optimal maneuvers that maximize observability. In summary, the objective of this dissertation is to provide both a practical and theoretical foundation for the advancement of autonomous orbital rendezvous through the use of angles-only navigation.

Page generated in 0.0581 seconds