The need for interacting with machines using spoken natural language is growing,
along with the expectation that synthetic speech in this context sound
natural. Such interaction includes answering questions, where prosody plays an
important role in producing natural English synthetic speech by communicating
the information structure of utterances.
CCG is a theoretical framework that exploits the notion that, in English, information
structure, prosodic structure and syntactic structure are isomorphic.
This provides a way to convert a semantic representation of an utterance into
a prosodically natural spoken utterance. GF is a framework for writing grammars,
where abstract tree structures capture the semantic structure and concrete
grammars render these structures in linearised strings. This research combines
these frameworks to develop a system that converts semantic representations
of utterances into linearised strings of natural language that are marked up to
inform the prosody-generating component of a speech synthesis system. / Computing / M. Sc. (Computing)
Identifer | oai:union.ndltd.org:netd.ac.za/oai:union.ndltd.org:unisa/oai:uir.unisa.ac.za:10500/14510 |
Date | 04 1900 |
Creators | Marais, Laurette |
Contributors | Ranta, Aarne, Horne, Tertia |
Source Sets | South African National ETD Portal |
Language | English |
Detected Language | English |
Type | Dissertation |
Format | 1 online resource (119 leaves) : illustrations |
Page generated in 0.0029 seconds