Global ETD Search

Return to search

Multi-modal response generation.

Wong Ka Ho. / Thesis submitted in: October 2005. / Thesis (M.Phil.)--Chinese University of Hong Kong, 2006. / Includes bibliographical references (leaves 163-170). / Abstracts in English and Chinese. / Abstract --- p.2 / Acknowledgements --- p.5 / Chapter 1 --- Introduction --- p.10 / Chapter 1.1 --- Multi-modal and Multi-media --- p.10 / Chapter 1.2 --- Overview --- p.11 / Chapter 1.3 --- Thesis Goal --- p.13 / Chapter 1.4 --- Thesis Outline --- p.15 / Chapter 2 --- Background --- p.16 / Chapter 2.1 --- Multi-modal Fission --- p.17 / Chapter 2.2 --- Multi-modal Data collection --- p.21 / Chapter 2.2.1 --- Collection Time --- p.21 / Chapter 2.2.2 --- Annotation and Tools --- p.21 / Chapter 2.2.3 --- Knowledge of Multi-modal Using --- p.21 / Chapter 2.3 --- Text-to-audiovisual Speech System --- p.22 / Chapter 2.3.1 --- Different. Approaches to Generate a Talking Heading --- p.23 / Chapter 2.3.2 --- Sub-tasks in Animating a Talking Head --- p.25 / Chapter 2.4 --- Modality Selection --- p.27 / Chapter 2.4.1 --- Rules-based approach --- p.27 / Chapter 2.4.2 --- Plan-based approach --- p.28 / Chapter 2.4.3 --- Feature-based approach --- p.29 / Chapter 2.4.4 --- Corpus-based approach --- p.30 / Chapter 2.5 --- Summary --- p.30 / Chapter 3 --- Information Domain --- p.31 / Chapter 3.1 --- Multi-media Information --- p.31 / Chapter 3.2 --- "Task Goals, Dialog Acts, Concepts and Information Type" --- p.32 / Chapter 3.2.1 --- Task Goals and Dialog Acts --- p.32 / Chapter 3.2.2 --- Concepts and Information Type --- p.36 / Chapter 3.3 --- User's Task and Scenario --- p.37 / Chapter 3.4 --- Chapter Summary --- p.38 / Chapter 4 --- Multi-modal Response Data Collection --- p.41 / Chapter 4.1 --- Data Collection Setup --- p.42 / Chapter 4.1.1 --- Multi-modal Input Setup --- p.43 / Chapter 4.1.2 --- Multi-modal Output Setup --- p.43 / Chapter 4.2 --- Procedure --- p.45 / Chapter 4.2.1 --- Precaution --- p.45 / Chapter 4.2.2 --- Recording --- p.50 / Chapter 4.2.3 --- Data Size and Type --- p.50 / Chapter 4.3 --- Annotation --- p.52 / Chapter 4.3.1 --- Extensible Multi-Modal Markup Language --- p.52 / Chapter 4.3.2 --- "Mobile, Multi-biometric and Multi-modal Annotation" --- p.53 / Chapter 4.4 --- Problems in the Wizard-of-Oz Setup --- p.56 / Chapter 4.4.1 --- Lack of Knowledge --- p.57 / Chapter 4.4.2 --- Time Deficiency --- p.57 / Chapter 4.4.3 --- Information Availability --- p.58 / Chapter 4.4.4 --- Operation Delay --- p.59 / Chapter 4.4.5 --- Lack of Modalities --- p.59 / Chapter 4.5 --- Data Optimization --- p.61 / Chapter 4.5.1 --- Precaution --- p.61 / Chapter 4.5.2 --- Procedures --- p.61 / Chapter 4.5.3 --- Data Size in Expert Design Responses --- p.63 / Chapter 4.6 --- Analysis and Discussion --- p.65 / Chapter 4.6.1 --- Multi-modal Usage --- p.67 / Chapter 4.6.2 --- Modality Combination --- p.67 / Chapter 4.6.3 --- Deictic term --- p.68 / Chapter 4.6.4 --- Task Goal and Dialog Acts --- p.71 / Chapter 4.6.5 --- Information Type --- p.72 / Chapter 4.7 --- Chapter Summary --- p.74 / Chapter 5 --- Text-to-Audiovisual Speech System --- p.76 / Chapter 5.1 --- Phonemes and Visemes --- p.77 / Chapter 5.2 --- Three-dimensional Facial Animation --- p.82 / Chapter 5.2.1 --- Three-dimensional (3D) Face Model --- p.82 / Chapter 5.2.2 --- The Blending Process for Animation --- p.84 / Chapter 5.2.3 --- Connectivity between Visemes --- p.85 / Chapter 5.3 --- User Perception Experiments --- p.87 / Chapter 5.4 --- Applications and Extension --- p.89 / Chapter 5.4.1 --- Multilingual Extension and Potential Applications --- p.89 / Chapter 5.5 --- Talking Head in Multi-modal Dialogue System --- p.90 / Chapter 5.5.1 --- Prosody --- p.93 / Chapter 5.5.2 --- Body Gesture --- p.94 / Chapter 5.6 --- Chapter Summary --- p.94 / Chapter 6 --- Modality Selection and Implementation --- p.98 / Chapter 6.1 --- Multi-modal Response Examples --- p.98 / Chapter 6.1.1 --- Single Concept-value Example --- p.99 / Chapter 6.1.2 --- Two Concept-values with Different Information Types --- p.102 / Chapter 6.1.3 --- Multiple Concept-values with Same Information Types Example --- p.103 / Chapter 6.2 --- Heuristic Rules for Modality Selection --- p.105 / Chapter 6.2.1 --- General Principles --- p.106 / Chapter 6.2.2 --- Heuristic rules --- p.107 / Chapter 6.2.3 --- Temporal Coordination for Synchronization --- p.109 / Chapter 6.2.4 --- Physical Layout --- p.110 / Chapter 6.2.5 --- Deictic Term --- p.111 / Chapter 6.2.6 --- Example --- p.111 / Chapter 6.3 --- Spoken Content Generation --- p.113 / Chapter 6.4 --- Chapter Summary --- p.115 / Chapter 7 --- Conclusions and Future Work --- p.117 / Chapter 7.1 --- Summary --- p.117 / Chapter 7.2 --- Contributions --- p.118 / Chapter 7.3 --- Future work --- p.119 / Chapter A --- XML Schema for M3 Markup Language --- p.123 / Chapter B --- M3ML Examples --- p.128 / Chapter C --- Domain-Specific Task Goals in the Hong Kong Tourism Do- main --- p.131 / Chapter D --- Dialog Acts for User Request in the Hong Kong Tourism Do- main --- p.133 / Chapter E --- Dialog Acts for System Response in the Hong Kong Tourism Domain --- p.137 / Chapter F --- Information Type and Concepts --- p.141 / Chapter G --- Concepts --- p.143 / Bibliography --- p.149

Interactive computer systems

Human-computer interaction

Interactive multimedia

Tourism--Information services

Identifer	oai:union.ndltd.org:cuhk.edu.hk/oai:cuhk-dr:cuhk_325519
Date	January 2006
Contributors	Wong, Ka Ho., Chinese University of Hong Kong Graduate School. Division of Systems Engineering and Engineering Management.
Source Sets	The Chinese University of Hong Kong
Language	English, Chinese
Detected Language	English
Type	Text, bibliography
Format	print, 170 leaves : ill. (some col.) ; 30 cm.
Coverage	China, Hong Kong
Rights	Use of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Page generated in 0.0017 seconds

Multi-modal response generation.

Description

Links & Downloads

Tags

Additional Fields