OPTIMIZATION OF CORPUS LINGUISTICS IN THE PREPARATION OF ARABIC IDIOM DICTIONARY

Nafisatul Izza R.U., Mohammad Ahsanuddin, Hanik Mahliatussikah

Abstract


Abstract: Arabic, as one of the world’s international languages, is rich in vocabulary and idiomatic expressions, which often pose challenges for foreign learners in grasping accurate meanings. Observations and interviews with students of the Arabic Language Education Study Program at the State University of Malang revealed a key issue in maharah kalam and kitabah courses: the lack of learning resources that systematically present idioms with contextual meanings and authentic usage. Existing dictionaries remain largely conventional, focusing solely on lexical translation, thereby creating a gap between vocabulary mastery and the ability to apply idiomatic expressions in a communicative context. To address this, the present study develops an Arabic–Indonesian idiom dictionary by integrating corpus linguistics as the primary approach in identifying, analysing, and systematically presenting idioms. Employing a qualitative-descriptive method, authentic corpus data were processed using Sketch Engine, which features include Wordlist, N-grams, Concordance, and Word Sketch. This analysis revealed the frequency of occurrence, structural variations, and both literal and idiomatic meanings. The study successfully identified 2,100 Arabic idioms considered relevant for learning purposes. This research not only provides a systematic framework for dictionary compilation but also produces an interactive digital product that is practical and learner-oriented. The novelty lies in combining corpus linguistics with digital lexicography, offering significant contributions to Arabic lexicographic studies while advancing technology-based, digital Arabic learning practices responsive to the demands of the digital era.

Keywords: Corpus linguistics, Arabic idiom dictionary, State University of Malang

Full Text:

PDF

References


Abdumanapovna, S. A. (2019). The Role of Sketch Engine in Multiple Types of Corpora. International Journal of Innovative Technology and Exploring Engineering, 8(11), 250–254. https://doi.org/10.35940/ijitee.K1307.0981119

Althobaiti, M. J. (2022). A Simple Yet Robust Algorithm for Automatic Extraction of Parallel Sentences: A Case Study on Arabic-English Wikipedia Articles. IEEE Access, 10, 401–420. https://doi.org/10.1109/ACCESS.2021.3137830

Arruda, H. M., Bavaresco, R. S., Kunst, R., Bugs, E. F., Pesenti, G. C., & Barbosa, J. L. V. (2023). Data Science Methods and Tools for Industry 4.0: A Systematic Literature Review and Taxonomy. Sensors, 23(11), 5010. https://doi.org/10.3390/s23115010

Hidayat, R., Sulaimah Saleh, U., Satya, I., & Wargadinata, W. (2024). Idiomatic Phrase Processing in Arabic: A Psycholinguistic Study.” International Journal of Language and Ubiquitous Learning, 1(3), 209–221. https://doi.org/10.70177/ijlul.v1i3.668

Holes, C. (2020). Arabic corpus linguistics ed. by Tony McEnery, Andrew Hardie, and Nagwa Younis. Language, 96(1), 202–206. https://doi.org/10.1353/lan.2020.0007

Jarad, N. I., & SSaydeh, A. (2017). Idiom In The Arabic - English Dictionary. International Journal of Arabic-English Studies (IJAES), 7.

Maura Syafa’ah, D., & Hizbullah, N. (2023). Compiling an Arabic-English Transport Dictionary through Corpus Linguistic Methods. Scaffolding: Jurnal Pendidikan Islam Dan Multikulturalisme, 5(1), 251–270. https://doi.org/10.37680/scaffolding.v5i1.2546

Meyer, C. F. (2023). English Corpus Linguistics. Cambridge University Press. https://doi.org/10.1017/9781107298026

R.U., N. I., Mahliatussikah, H., & Ahsanuddin, M. (2025). Students’ Perception of the Development of a Digital Dictionary of Arabic Idioms Based on Corpus Linguistics. Arabiyat : Jurnal Pendidikan Bahasa Arab Dan Kebahasaaraban, 11(2), 234–244. https://doi.org/10.15408/a.v11i2.41665

Schierholz, S. J. (2015). Methods in Lexicography and Dictionary Research. Lexikos, 25. https://doi.org/10.5788/25-1-1302

Sugiyono. (2018). Metodologi Penelitian Kuantitatif, Kualitatif dan R&D. Alfabeta.

Wiegand, H. E. (1998). Wörterbuchforschung. DE GRUYTER. https://doi.org/10.1515/9783110802467


Refbacks

  • There are currently no refbacks.


ISSN: 2598-0653