Parsing Arabic Nominal Sentences Using Context Free Grammar and Fundamental Rules of Classical Grammar
Автор: Nabil Ababou, Azzeddine Mazroui, Rachid Belehbib
Журнал: International Journal of Intelligent Systems and Applications(IJISA) @ijisa
Статья в выпуске: 8, 2017 года.
Бесплатный доступ
This work falls within the framework of the Arabic natural language processing. We are interested in parsing Arabic texts. Existing parsers generate parse trees that give an idea about the structure of the sentence without considering the syntactic functions specific to the Arabic language. Thus, the results are still insufficient in terms of syntactic information. The system we have developed in this article takes into consideration all these syntactic functions. This system begins with a morphological analysis in the context. Then, it uses a CFG grammar to extract the phrases and ends by exploiting the formalism of unification grammar and traditional grammar to combine these phrases and generate the final sentence structure.
POS tagger, Parser, Arabic phrase, grammar, syntax tree, syntactic functions
Короткий адрес: https://sciup.org/15010953
IDR: 15010953
Список литературы Parsing Arabic Nominal Sentences Using Context Free Grammar and Fundamental Rules of Classical Grammar
- H. Bais, M. Machkour and L. Koutti, "A Model of a Generic Natural Language Interface for Querying Database", International Journal of Intelligent Systems and Applications (IJISA), vol. 8, no. 2, pp. 35-44, 2016. DOI: 10.5815/ijisa.2016.02.05
- J. Avinash, O. Agrawal and Kakde, G, "A Semantic Analysis of Natural Language Queries Using Domain Ontology for Information Access from Database", International Journal of Intelligent Systems and Applications (IJISA), vol. 5, no. 12, pp. 81-90, 2013. DOI: 10.5815/ijisa.2013.12.07
- Z. Žabokrtský and O. Smrž, "Arabic Syntactic Trees: From Constituency to Dependency", in Proceedings of the Tenth Conference on European Chapter of the Association for Computational Linguistics - Volume 2, Stroudsburg, PA, USA, 2003, pp. 183–186.
- N. Khoufi and M. Boudokhane, "Statistical-based System for Morphological Annotation of Arabic Texts", in RANLP, 2013, pp. 100–106.
- S. and C. D. Manning, "Better Arabic parsing: Baselines, evaluations, and analysis", in Proceedings of the 23rd International Conference on Computational Linguistics, 2010, pp. 394–402.
- D. M. Bikel, "On the parameter space of generative lexicalized statistical parsing models", University of Pennsylvania, 2004.
- N. Ababou and A. Mazroui, "A hybrid Arabic POS tagging for simple and compound morphosyntactic tags", Int J Speech Technol, pp. 1–14, Oct. 2015.
- E. Othman, K. Shaalan, and A. Rafea, "A chart parser for analyzing modern standard Arabic sentence", in Proceedings of the MT Summit IX Workshop on Machine Translation for Semitic Languages, 2003, pp. 37–44.
- S. Alqrainy, H. Muaidi, and M. S. Alkoffash, "Context-free grammar analysis for Arabic sentences", International Journal of Computer Applications, vol. 53, no. 3, 2012.
- E. Al-Daoud and A. Basata, "A framework to automate the parsing of Arabic language sentences", Int. Arab J. Inf. Technol., vol. 6, no. 2, pp. 191–195, 2009.
- L. Tounsi and J. Van Genabith, "Arabic parsing using grammar transforms," 2010.
- N. Chomsky, Syntactic structures, 14. printing. The Hague: Mouton, 1957.
- L. Tesnière, Esquisse d'une syntaxe structurale. Paris: C. Klincksieck, 1953.
- L. Tesnière, Eléments de syntaxe structurale. Librairie C. Klincksieck, 1959.
- A. T. Al-Taani, M. M. Msallam, and S. A. Wedian, "A top-down chart parser for analyzing arabic sentences", Int. Arab J. Inf. Technol., vol. 9, no. 2, pp. 109–116, 2012.
- M. A. Attia, "Handling Arabic morphological and syntactic ambiguity within the LFG framework with a view to machine translation", University of Manchester, 2008.
- O. Nadim, T. Abeer, Moubaiddin Asma, and Hammo Bassam1, "Formal description of Arabic syntactic structure in the framework of the government and binding theory", Computación y Sistemas, vol. 18, no. 3, pp. 611–625, 2014.
- N. Khoufi, C. Aloulou, and L. H. Belguith, "Parsing Arabic using induced probabilistic context free grammar", International Journal of Speech Technology, Sep. 2015.
- S. Kulick and R. Gabbard, "Parsing the Arabic Treebank: Analysis and Improvements", 2006.
- "The Stanford Natural Language Processing Group", [Online Available]: http://nlp.stanford.edu/software/lex-parser.html. [Accessed: 10-Apr-2016].
- S Petrov. Coarse-to-Fine Natural Language Processing. University of California-Berkeley, 2009.
- G. Sampson and A. Babarczy, "A test of the leaf-ancestor metric for parse accuracy", Natural Language Engineering, vol. 9, no. 04, pp. 365–380, 2003.
- E. Black, Meeting of interest group on evaluation of broad-coverage grammars of English. LINGUIST List 3.587. 1992.
- Nivre and Johan Hall, "The CoNLL 2007 shared task on dependency parsing", in Proceedings of the CoNLL shared task session of EMNLP-CoNLL, 2007, pp. 915–932.
- Y. Marton, N. Habash, and O. Rambow, "Dependency parsing of Modern Standard Arabic with lexical and inflectional features", Computational Linguistics, vol. 39, no. 1, pp. 161–194, 2013.
- D. Chen and C. D. Manning, "A Fast and Accurate Dependency Parser using Neural Networks", in EMNLP, 2014, pp. 740–750.
- M. Boudchiche, A. Mazroui, M. O. A. O. Bebah, A. Lakhouaja, and A. Boudlal, "AlKhalil Morpho Sys 2: A robust Arabic morpho-syntactic analyzer", Journal of King Saud University-Computer and Information Sciences, 2016.
- M. Attia, M. Yaseen, and K. Choukri, Specifications of the Arabic Written Corpus produced within the NEMLAR project. Technical report, NEMLAR, Center for Sprogteknologi, 2005.
- A. Siraf, explanation of Sibawayh's Al-Kitab. Dar Al kotob AlMasriya, Egypt (2000).
- A. Mustafa, binding theory in Arabic grammar and Study of grammatical structure. Damascus University Journal for Literature and Humanities. 18, 41 (2002).
- E. Husserl, Introduction to the Logical Investigations: A Draft of a Preface to the Logical Investigations (1913). Springer Science & Business Media, 2012.
- K. Ajdukiewicz, "Die Syntaktische Konnexitat. Studia Philosophica 1: 1-27; translated as 'Syntactic Connecxion'in S. McCall", Polish Logic, 1935.
- Y. Bar-Hillel, "A quasi-arithmetical notation for syntactic description", Language, vol. 29, no. 1, pp. 47–58, 1953.
- A. Pasha et al., "MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic", in LREC, 2014, pp. 1094–1101.