<p>This thesis aims to design and develop the spatial adaptation approach through spatial transformers to improve the accuracy of human keypoint recognition models. We have studied different model types and design choices to gain an accuracy increase over models without spatial transformers and analyzed how spatial transformers increase the accuracy of predictions. A neural network called Widenet has been leveraged as a specialized network for providing the parameters for the spatial transformer. Further, we have evaluated methods to reduce the model parameters, as well as the strategy to enhance the learning supervision for further improving the performance of the model. Our experiments and results have shown that the proposed deep learning framework can effectively detect the human key points, compared with the baseline methods. Also, we have reduced the model size without significantly impacting the performance, and the enhanced supervision has improved the performance. This study is expected to greatly advance the deep learning of human key points and movement dynamics. </p>
Identifer | oai:union.ndltd.org:purdue.edu/oai:figshare.com:article/22203679 |
Date | 31 May 2023 |
Creators | Chao Yang Dai (14709547) |
Source Sets | Purdue University |
Detected Language | English |
Type | Text, Thesis |
Rights | CC BY 4.0 |
Relation | https://figshare.com/articles/thesis/Deep_Image_Processing_with_Spatial_Adaptation_and_Boosted_Efficiency_Supervision_for_Accurate_Human_Keypoint_Detection_and_Movement_Dynamics_Tracking/22203679 |
Page generated in 0.002 seconds