Spelling suggestions: "subject:"arabic dialect"" "subject:"krabic dialect""
1 |
Aspects of the syntax of the dialect of Abha (south west Saudi Arabia)Al-Azraqi, Munira Ali January 1998 (has links)
The present study deals with the syntax of the Arabic dialect of Abha in south-west of Saudi Arabia It is a synchronic study which deals with the everyday usage of the dialect. Diachronic changes are sometimes indicated where relevant. The phonology and morphology of the dialect are discussed in brief where necessary. This dialect has many distinctive features some of which do not occur in other dialects. The dialect is going through remarkable change due to people's tendency to change affected by the spread of education, mass media and communication. Thus the study has been conducted to examine some syntactic features of the dialect and record them before the dialect loses those features, and to make this dialect accessible for further research in sociolinguistic or diachronic studies. This study comes in two parts. The first part deals with the classification of the main parts of speech and their function in context. This part comprises four chapters: the first chapter deals with the noun and its sub-classes; the second chapter deals with the verb and its relation with the pronouns; the third chapter deals with particles and their functions in the sentence; the fourth chapter deals with functionals and their functions in the sentence. The second part examines the relationships between parts of speech. This part also comprises four chapters which deal respectively with: predication, annexation, complementation and attnbution.
|
2 |
The development the use of the negation particles miš and mā…š in Egyptian colloquial ArabicTown, Rosalie Melissa 09 November 2010 (has links)
The negation system in Modern Egyptian Colloquial Arabic does not follow an obvious set of rules. The particle that negates most verbal predicates also negates nominal predicates, and the particle that negates most nominal predicates also negates verbal predicates. By examining the behavior of these particles over time and comparing them to negation systems in other languages, it is possible to see the reasons for this complicated negation system. / text
|
3 |
Multi-dialect Arabic broadcast speech recognitionAli, Ahmed Mohamed Abdel Maksoud January 2018 (has links)
Dialectal Arabic speech research suffers from the lack of labelled resources and standardised orthography. There are three main challenges in dialectal Arabic speech recognition: (i) finding labelled dialectal Arabic speech data, (ii) training robust dialectal speech recognition models from limited labelled data and (iii) evaluating speech recognition for dialects with no orthographic rules. This thesis is concerned with the following three contributions: Arabic Dialect Identification: We are mainly dealing with Arabic speech without prior knowledge of the spoken dialect. Arabic dialects could be sufficiently diverse to the extent that one can argue that they are different languages rather than dialects of the same language. We have two contributions: First, we use crowdsourcing to annotate a multi-dialectal speech corpus collected from Al Jazeera TV channel. We obtained utterance level dialect labels for 57 hours of high-quality consisting of four major varieties of dialectal Arabic (DA), comprised of Egyptian, Levantine, Gulf or Arabic peninsula, North African or Moroccan from almost 1,000 hours. Second, we build an Arabic dialect identification (ADI) system. We explored two main groups of features, namely acoustic features and linguistic features. For the linguistic features, we look at a wide range of features, addressing words, characters and phonemes. With respect to acoustic features, we look at raw features such as mel-frequency cepstral coefficients combined with shifted delta cepstra (MFCC-SDC), bottleneck features and the i-vector as a latent variable. We studied both generative and discriminative classifiers, in addition to deep learning approaches, namely deep neural network (DNN) and convolutional neural network (CNN). In our work, we propose Arabic as a five class dialect challenge comprising of the previously mentioned four dialects as well as modern standard Arabic. Arabic Speech Recognition: We introduce our effort in building Arabic automatic speech recognition (ASR) and we create an open research community to advance it. This section has two main goals: First, creating a framework for Arabic ASR that is publicly available for research. We address our effort in building two multi-genre broadcast (MGB) challenges. MGB-2 focuses on broadcast news using more than 1,200 hours of speech and 130M words of text collected from the broadcast domain. MGB-3, however, focuses on dialectal multi-genre data with limited non-orthographic speech collected from YouTube, with special attention paid to transfer learning. Second, building a robust Arabic ASR system and reporting a competitive word error rate (WER) to use it as a potential benchmark to advance the state of the art in Arabic ASR. Our overall system is a combination of five acoustic models (AM): unidirectional long short term memory (LSTM), bidirectional LSTM (BLSTM), time delay neural network (TDNN), TDNN layers along with LSTM layers (TDNN-LSTM) and finally TDNN layers followed by BLSTM layers (TDNN-BLSTM). The AM is trained using purely sequence trained neural networks lattice-free maximum mutual information (LFMMI). The generated lattices are rescored using a four-gram language model (LM) and a recurrent neural network with maximum entropy (RNNME) LM. Our official WER is 13%, which has the lowest WER reported on this task. Evaluation: The third part of the thesis addresses our effort in evaluating dialectal speech with no orthographic rules. Our methods learn from multiple transcribers and align the speech hypothesis to overcome the non-orthographic aspects. Our multi-reference WER (MR-WER) approach is similar to the BLEU score used in machine translation (MT). We have also automated this process by learning different spelling variants from Twitter data. We mine automatically from a huge collection of tweets in an unsupervised fashion to build more than 11M n-to-m lexical pairs, and we propose a new evaluation metric: dialectal WER (WERd). Finally, we tried to estimate the word error rate (e-WER) with no reference transcription using decoding and language features. We show that our word error rate estimation is robust for many scenarios with and without the decoding features.
|
4 |
Language Tension, Terminology Variation and Terminology Policy in the Arabic-Speaking North African Countries: An Alternative Approach to Terminology PracticeHamed, Fawzi Younis 02 December 2014 (has links)
No description available.
|
Page generated in 0.0476 seconds