Arabic Levantine (ar-XL)
In this article you will find graphemes and phonemes for Levantine Arabic Speech to Text and Keyword Spotting.
Each language has a specific set of graphemes and phonemes that were used to train the Speech Recognition systems. Only these sets can be used for spelling keywords or preferred phrases and defining their pronunciations.
Graphemes
These are the valid graphemes to be used for
spelling.ء آ أ ؤ إ ئ ا ب ة ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع غ ف ق ك ل م ن ه و ى ي
Phonemes
These are the valid phonemes to be used for defining pronunciations.
Phoneme1 | Grapheme | Example words | Phonemic representation |
---|---|---|---|
a | ا | اكتب التانيه | a k t b a l t a n j h |
A | ى | أبدي | ?a b d A |
?a | أ | أمريكي بأبطئوا | ?a m r j k j b ?a b t` ?i w a |
?? | إ | إجاني لإحنا | ?? Z a n j l ?? X\ n a |
a: | آ | آلي القرآن | a: l j a l q r a: n |
b | ب | بعثلون عباتين | b ?\ T t l w n ?\ b a t j n |
t | ت | بتأذاني قيمتلي | b t ?a D a n j q j m t l j |
T | ث | ثقلي شغثة | T q l j S G T at |
Z | ج | جوزها دجاج | Z w h a d Z a Z |
X\ | ح | حلتها عالحيط | X\ l t h a ?\ a l X\ j t` |
x | خ | خربانه متخربت | x r b a n h m t x r b t |
d | د | دارسه صادق | d a r s h s` a d q |
D | ذ | ذكرياتي عالهذا | D k r j a t j ?\ a l h D a |
r | ر | راسكم زيرو | r a s k m z j r w |
z | ز | زيرو كالكنوز | z j r w k a l k n w z |
s | س | سنتان يدرسو | s n t a n j d r s w |
S | ش | شركة فيهاش | S r k at f j h a S |
s` | ص | صاحبي فاحص | s` a X\ b t j f a X\ s` |
d` | ض | ضرورية احمضت | d` r w r j at a X\ m d` t |
t` | ط | طافيه بأبطئوا | t` a f j h b ?a b t` ?i w a |
D` | ظ | ظافر احظه | D` a f r a X\ D` h |
?\ | ع | عارفين مبعطي | ?\ a r f j n m b ?\ t` j |
G | غ | غيرها شغثة | G j r h a S G T at |
f | ف | فاليوم طافيه | f a l j w m t` a f j h |
q | ق | قلبون صادق | q l b w n s` a d q |
k | ك | كالسيوم كالكنوز | k a l s j w m k a l k n w z |
l | ل | لأنتصار كالسيوم | l ?a n t s` a r k a l s j w m |
m | م | مبعطي قيمتلي | m b ?\ t` j q j m t l j |
n | ن | نوصي سنتان | n w s` j s n t a n |
h | ه | هوني بالهواء | h w n j b a l h w a ? |
at | ة | اخوة ضرورية | a x w at d` r w r j at |
w | و | وحليب بالهواء | w X\ l j b b a l h w a ? |
?u | ؤ | مسؤول أتؤلم | m s ?u w l ?a t ?u l m |
j | ي | يأخذنا زيرو | j ?a x D n a z j r w |
?i | ئ | كئيب لئيمة | k ?i j b l ?i j m at |
? | ء | بالهواء سمراء | b a l h w a ? s m r a ? |
Footnotes
-
Each phoneme corresponds to the grapheme (letter). Arabic texts and dictionary as well as phonemes used in training of STT and KWS do not use diacritics (fathah, kasrah, dammah, waslah, sukun, tanwin), while all representations of hamza ئ ,ؤ ,إ ,أ ,ء, and alif with maddah آ are present. Given the one-to-one correspondence between phonemes and graphemes, some phonemes had been modified and, therefore, phonemic system of AR_XL_6 present a few differences from the typical SAMPA representation of MSA phonemes. ↩