Global ETD Search

1	Towards the Development of an Automatic Diacritizer for the Persian Orthography based on the Xerox Finite State Transducer Nojoumian, Peyman 12 August 2011 (has links) Due to the lack of short vowels or diacritics in Persian orthography, many Natural Language Processing applications for this language, including information retrieval, machine translation, text-to-speech, and automatic speech recognition systems need to disambiguate the input first, in order to be able to do further processing. In machine translation, for example, the whole text should be correctly diacritized first so that the correct words, parts of speech and meanings are matched and retrieved from the lexicon. This is primarily because of Persian’s ambiguous orthography. In fact, the core engine of any Persian language processor should utilize a diacritizer and a lexical disambiguator. This dissertation describes the design and implementation of an automatic diacritizer for Persian based on the state-of-the-art Finite State Transducer technology developed at Xerox by Beesley & Karttunen (2003). The result of morphological analysis and generation on a test corpus is shown, including the insertion of diacritics. This study will also look at issues that are raised by phonological and semantic ambiguities as a result of short vowels in Persian being absent in the writing system. It suggests a hybrid model (rule-based & inductive) that is inspired by psycholinguistic experiments on the human mental lexicon for the disambiguation of heterophonic homographs in Persian using frequency and collocation information. A syntactic parser can be developed based on the proposed model to discover Ezafe (the linking short vowel /e/ within a noun phrase) or disambiguate homographs, but its implementation is left for future work. Persian Persian computational linguistics diacritizer morphological analyzer heterophonic homograph disambiguation
2	Towards the Development of an Automatic Diacritizer for the Persian Orthography based on the Xerox Finite State Transducer Nojoumian, Peyman 12 August 2011 (has links) Due to the lack of short vowels or diacritics in Persian orthography, many Natural Language Processing applications for this language, including information retrieval, machine translation, text-to-speech, and automatic speech recognition systems need to disambiguate the input first, in order to be able to do further processing. In machine translation, for example, the whole text should be correctly diacritized first so that the correct words, parts of speech and meanings are matched and retrieved from the lexicon. This is primarily because of Persian’s ambiguous orthography. In fact, the core engine of any Persian language processor should utilize a diacritizer and a lexical disambiguator. This dissertation describes the design and implementation of an automatic diacritizer for Persian based on the state-of-the-art Finite State Transducer technology developed at Xerox by Beesley & Karttunen (2003). The result of morphological analysis and generation on a test corpus is shown, including the insertion of diacritics. This study will also look at issues that are raised by phonological and semantic ambiguities as a result of short vowels in Persian being absent in the writing system. It suggests a hybrid model (rule-based & inductive) that is inspired by psycholinguistic experiments on the human mental lexicon for the disambiguation of heterophonic homographs in Persian using frequency and collocation information. A syntactic parser can be developed based on the proposed model to discover Ezafe (the linking short vowel /e/ within a noun phrase) or disambiguate homographs, but its implementation is left for future work. Persian Persian computational linguistics diacritizer morphological analyzer heterophonic homograph disambiguation
3	Towards the Development of an Automatic Diacritizer for the Persian Orthography based on the Xerox Finite State Transducer Nojoumian, Peyman 12 August 2011 (has links) Due to the lack of short vowels or diacritics in Persian orthography, many Natural Language Processing applications for this language, including information retrieval, machine translation, text-to-speech, and automatic speech recognition systems need to disambiguate the input first, in order to be able to do further processing. In machine translation, for example, the whole text should be correctly diacritized first so that the correct words, parts of speech and meanings are matched and retrieved from the lexicon. This is primarily because of Persian’s ambiguous orthography. In fact, the core engine of any Persian language processor should utilize a diacritizer and a lexical disambiguator. This dissertation describes the design and implementation of an automatic diacritizer for Persian based on the state-of-the-art Finite State Transducer technology developed at Xerox by Beesley & Karttunen (2003). The result of morphological analysis and generation on a test corpus is shown, including the insertion of diacritics. This study will also look at issues that are raised by phonological and semantic ambiguities as a result of short vowels in Persian being absent in the writing system. It suggests a hybrid model (rule-based & inductive) that is inspired by psycholinguistic experiments on the human mental lexicon for the disambiguation of heterophonic homographs in Persian using frequency and collocation information. A syntactic parser can be developed based on the proposed model to discover Ezafe (the linking short vowel /e/ within a noun phrase) or disambiguate homographs, but its implementation is left for future work. Persian Persian computational linguistics diacritizer morphological analyzer heterophonic homograph disambiguation
4	Towards the Development of an Automatic Diacritizer for the Persian Orthography based on the Xerox Finite State Transducer Nojoumian, Peyman January 2011 (has links) Due to the lack of short vowels or diacritics in Persian orthography, many Natural Language Processing applications for this language, including information retrieval, machine translation, text-to-speech, and automatic speech recognition systems need to disambiguate the input first, in order to be able to do further processing. In machine translation, for example, the whole text should be correctly diacritized first so that the correct words, parts of speech and meanings are matched and retrieved from the lexicon. This is primarily because of Persian’s ambiguous orthography. In fact, the core engine of any Persian language processor should utilize a diacritizer and a lexical disambiguator. This dissertation describes the design and implementation of an automatic diacritizer for Persian based on the state-of-the-art Finite State Transducer technology developed at Xerox by Beesley & Karttunen (2003). The result of morphological analysis and generation on a test corpus is shown, including the insertion of diacritics. This study will also look at issues that are raised by phonological and semantic ambiguities as a result of short vowels in Persian being absent in the writing system. It suggests a hybrid model (rule-based & inductive) that is inspired by psycholinguistic experiments on the human mental lexicon for the disambiguation of heterophonic homographs in Persian using frequency and collocation information. A syntactic parser can be developed based on the proposed model to discover Ezafe (the linking short vowel /e/ within a noun phrase) or disambiguate homographs, but its implementation is left for future work. Persian Persian computational linguistics diacritizer morphological analyzer heterophonic homograph disambiguation
5	Cross-culture Choral Music Education: Issues for Western Choral Conductors Related to the Performance of Arabic Choral Music Earnhart, Cari L. 08 1900 (has links) The concept of choral music as defined by the Western world was foreign to Arab cultures until the colonization of the Arab world began in the seventeenth century when we began to see the Western choral style emerging in the churches of the Arab world. Group singing of traditional music was done in unison or heterophonic textures. Notated part-singing is a product of colonization, Westernization, Christianization, and now globalization. In recent years, singing music in mixed or multiple voicings not of a heterophonic nature has spread beyond the churches to the secular Arab world. As choral singing has increased in the Arab world, a new genre of Arabic choral music has emerged. In order for Western conductors to effectively teach, conduct, or perform these new works, it is important for them to develop a basic understanding of traditional Arabic musical styles and pronunciation of the language, thereby making Arabic choral music more accessible and enabling it to become a part of the larger world’s musical vocabulary. This study serves as an introductory resource for non-Arab choral conductors concerning key elements related to performing Arabic choral music and provides a context for how these elements relate to this evolving choral genre. In addition, through interviews with composers and conductors of Arabic choral music, this project will further inform the reader regarding the performance of this genre. Arabic choral music cross-culture choral music choral conductors of Arabic music globalization heterophonic textures

1

Page generated in 0.0495 seconds