The Child Language Corpus of Jordanian Arabic (JA) is compiled with the collaboration of three main scholars: Dr. Marwan Jarrah (Principal Investigator), Dr. Ekab Al‑Shawashreh, and Dr. Mohammad A. M. Abushariah. This corpus is the first comprehensive and systematically compiled linguistic resource focused on documenting the spoken language of typically developing children in Jordan. This pioneering corpus marks a significant milestone in Arabic language acquisition research, providing an extensive and unique collection of natural child speech data across various regions, age groups, and genders.
Comprising around 500,000 words, the corpus is drawn from over 500 recorded interviews with children aged between 2.6 to 12 years. These recordings capture a wide range of everyday, spontaneous speech, offering authentic insights into how Jordanian children communicate in real-life situations. Reflecting voices from urban, rural, and Bedouin communities, the corpus represents a rich and inclusive portrayal of vernacular Jordanian Arabic (JA) as it is naturally spoken.
For more information, click here