ASR for Tajweed Rules: Integrated with Self-Learning Environments
Автор: Ahmed AbdulQader Al-Bakeri, Abdullah Ahmad Basuhail
Журнал: International Journal of Information Engineering and Electronic Business(IJIEEB) @ijieeb
Статья в выпуске: 6 vol.9, 2017 года.
Бесплатный доступ
Due to the recent progress in technology, the traditional learning setting in several fields has been renewed by different environments of learning, most of which involve the use of computers and networking to achieve a type of e-learning. With great interest surrounding the Holy Quran related research, only a few scientific research has been conducted on the rules of Tajweed (intonation) based on automatic speech recognition (ASR). In this research, the use of ASR and MVC design is proposed. This system enhances the learners’ basic knowledge of Tajweed and facilitates self-learning. The learning process that is based on ASR ensures that the students have the proper pronunciation of the verses of the Holy Quran. However, the traditional method requires that both students and teacher meet face-to-face. This requirement is a limitation to enhancing individuals’ learning. The purpose of this research is to use speech recognition techniques to correct students’ recitation automatically, bearing in mind the rules of Tajweed. In the final steps, the system is integrated with self-learning environments which depend on MVC architectures.
Automatic Speech Recognition (ASR), Acoustic model, Phonetic dictionary, Language model, Hidden Markov Model, Model View Controller (MVC)
Короткий адрес: https://sciup.org/15013544
IDR: 15013544
Текст научной статьи ASR for Tajweed Rules: Integrated with Self-Learning Environments
Published Online November 2017 in MECS DOI: 10.5815/ijieeb.2017.06.01
The Quran is the Holy book for the Muslims. The Quran contains guidance for life, which has to be applied by the Muslim people. In order to achieve this goal, it is important for the Muslims to understand the Quran clearly so they can be capable of applying it. Recitation is one of the Holy Quran related sciences. Previously, it was necessary to have a teacher of Quran and a student to meet face-to-face for the student to learn the recitation orally. It is the only certified way to guarantee that a certain verse of the Holy Quran is recited correctly.
Nowadays, because of the continuous increase and huge demand of people to learn the Quran, several organizations have started serving the learners by providing an online instructor in order to help them learn how to recite the verses of the Holy Quran correctly according to the rules of intonation (Tajweed science). Prophet Muhammad (peace be upon him) is the founder of the rules of this science, and certain Sahabah (companions of the prophet; may Allah be pleased with them) have learned from him the rules of pronunciation, and then those Sahabah have taught the second generation [1]. The process has continued up in the same manner till now.
The model proposed in this research facilitates teaching the recitation of the Holy Quran online so that students can practice Quran rules through using automatic speech recognition (ASR).
This approach of teaching has several benefits such as the delivery to individuals who cannot attend the Halaqat (sessions) held at the masjids (mosques), and it facilitates Quran teaching styles to receive more than the long-established model. The extreme importance of recitation and memorization of the Quran is due to the numerous benefits for readers and learners, as stated in Quran and Sunnah [2]. The learning of Quran is achieved by a qualified reciter (called sheikh qari) who has elder licensed linked to the transmission chain until it reaches the Messenger of Allah, Prophet Muhammad (peace be upon him). Detailed information about the Holy Quran and its sciences can be found in many resources; for example, see [3].
Due to the extensive use of the Internet and its availability, there is a strong need to develop a system that emulates the traditional way of the Quran teaching. There is some research that focuses on these issues, such as Miqra’ah, which is a server that uses virtually over the Internet.
The Holy Quran is written in the Arabic language, which is considered as a complex morphological language. From the perspective of ASR, the combination of letters is pronounced in the same way or different, depending on the Harakat used in upper and lower-case character [4]. Intrinsic motivation to develop ASR as participation to serve the Holy Quran sciences and its proposed approaches is needed to implement a system to correct the pronunciation mistakes and integrate it within a self-learning environment. Therefore, we suggest the use of the Model View Controller (MVC) as a base structure that helps in massive development such as this research. Phonetic Quran is a special case of Arabic phonemes where there is a guttural letter followed by any other letter. This case is called guttural manifestation.
Gutturalness, in Quran, relates to the quality of being guttural (i.e., producing a particular sound that comes from the back of the throat). The articulation of Quran emphatics affects adjacent vowels.
There are many commercial packages that are available, such as audio applications to recite (Tarteel) the Holy Quran. One among these packages is the Quran Auto Reciter (QAR) [5]; however, this application does not support the rules of Tajweed to verify and validate the Quranic recitation.
The field of speech recognition in Quranic voice recognition is a significant field, where the processing and acoustic model has a relation with Arabic phonemes and articulation of each word; thus, the research in the recitation of Quran could be taken from a different aspect.
In general, and especially in computer science, there are substantial research achieves meant to produce worthy results in the correction of the pronunciation of the Quran words according to the rules of Tajweed. Hassan Tabbal has done research on the topic of automated delimiters, which extracts ayah (verse) from an audio file and then converts verses of Quran into an audio file using the technology of speech recognition tools. The developed system depends on the framework of Sphinx IV [6].
Putra, Atmaja, and Prananto developed a learning system that used speech recognition for the recitation of the Quranic verses to reduce obstacles in learning the Quran and to facilitate the learning process. Their implementation depended on the Gaussian Mixture Model (GMM) and the Mel Frequency Cepstral (MFCC) features. The system produces good results for an effective and flexible learning process. The method of template referencing was used in that research [7]. Noor Jamaliah, in her master thesis in the field of speech recognition, used Mel Frequency Cepstral Coefficients (MFCC) for extracting feature from the input sound, and she used the Hidden Markov Model (HMM) for recognition and training purposes. The engine showed recognition rates that exceeded 86.41% (phonemes), and 91.95% (ayates) [8].
KAPD are available on 3 CDs for researchers [9]. A research on e-Halagat is demonstrated in [10]. Noureddine Aloui with other researchers used Discrete Walsh Hadamard Transform (DWHT), where the original speech is converted into stationary frames, and then applied the DWHT to the output signal. The performance is evaluated by using some objective criteria such as NRMSE, SNR, CR, and PSNR [11].
Nijhawan and Soni used the MFCC for feature extraction to build Speaker Recognition System (SRS) [12]. The training phase was done by calculating MFCC, executing VQ, finding the nearest neighbor using Euclidean distance, and then computing centroid and creating codebook for each speaker. After the completion of the training process, the testing phase is achieved through calculating MFCC, finding the nearest neighbor, finding minimum distance and then decision making.
Список литературы ASR for Tajweed Rules: Integrated with Self-Learning Environments
- سنن سعيد بن منصور، دراسة وتحقيق د. سعد بن عبدهللا بن عبدالعزيز آلحميد، دار الصميعي، المملكة العربية السعودية، ج: 5 ص: 2
- Islam City Website. [Online]. http://www.islamicity.com/mosque/quran/
- Islam Way Website. [Online]. The Most Famous Broadcast Website about Islam. http://en.islamway.net/
- H. Tabbal, W. El-Falou, B. Monla, 2006. “Analysis and Implementation of a “Quranic” verses delimitation system in audio files using speech recognition techniques”.In: Proceeding of the IEEE Conference of 2nd.
- http://www.searchtruth.com/download.php.
- A.-F. W. a. M. B. Tabbal Hassan" ،Analysis and Implementation of an Automated Delimiter of "Quranic" Verses in Audio Files using Speech Recognition Techniques "، "Robust Speech Recognition and Understanding: 351.
- B.Putra , B.T. Atmaja , D.Prananto,’’ Developing Speech Recognition System for Quranic Verse Recitation Learning Software’’, International Journal on Informatics for Development (IJID), Vol. 1, No. 2, 2012.
- N. Jamaliah, I. “Automated Tajweed Checking rules engine for Quranic verse Recitation”, MCS Thesis, University of Malaya, Faculty of Computer Science & Information Technology, Department of Computer System & Technology, Kuala Lumpur, April 2010.
- Alghmadi M., "KACST Arabic Phonetic Database, "The Fifteenth International Congress of Phonetics Science, Barcelona 2003, pp 3109-3112.
- Yahya O. Mohamed ELHADJ,’’ E-HALAGAT: AN E-LEARNING SYSTEM FOR TEACHING THE HOLY QURAN’’, TOJET: The Turkish Online Journal of Educational Technology – January 2010, volume 9 Issue 1.
- Noureddine Aloui, Souha Bousselmi, Adnane Cherif," Speech Compression Based on Discrete Walsh Hadamard Transform", IJIEEB, vol.5, no.3, pp.59-65, 2013. DOI: 10.5815/ijieeb.2013.03.07
- Geeta Nijhawan, M.K Soni,"Real Time Speaker Recognition System for Hindi Words", IJIEEB, vol.6, no.2, pp.35-40, 2014. DOI: 10.5815/ijieeb.2014.02.04
- Alkhatib. B, Kawas. M, Alnahhas. A, Bondok. R, Kannous R,"BUILDING AN ASSISTANT MOBILE APPLICATION FOR TEACHING ARABIC PRONUNCIATION USING A NEW APPROACH FOR ARABIC SPEECH RECOGNITION", Journal of Theoretical and Applied Information Technology, Vol.95, No.3, pp.478-489, 2017.
- K. Seymore, S. Chen, S. Doh, M. Eskenazi, E. Gouvea, B. Raj, M. Ravishankar, R. Rosen-feld, M. Siegler, R. Stern, and E. Thayer, “The 1997 CMU Sphinx-3 English BroadcastNews Transcription System,” in Proc. of the DARPA Broadcast News Transcription andUnderstanding Workshop, Lansdowne, USA, 1998.
- D. Huggins-Daines, M. Kumar, A. Chan, A. Black, M. Ravishankar, and A. Rudnicky, “Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System for Hand Held Devices,” in Proc. of the ICASSP, Toulouse, France, 2006.
- The CMU-Cambridge Statistical Language Modeling Toolkit v2, [Online], http://www.speech.cs.cmu.edu/SLM/toolkit_documentation.html.
- M. Elshafei Ahmed, " Toward an Arabic Text-to-Speech System", in the special issue on Arabization, the Arabian Journal of Science and Engineering, Vol. 16, No. 4B, pp.565-583, October 1991.
- Moustafa, Al-Muhtaseb Husni, Al-Ghamdi Mansour," Techniques for high quality Arabic speech synthesis ", Information Sciences 140(3-4): 255-267 (2002).