Detection and correcting the wrong words from Hindi, English and Punjabi Text Documents
DOI:
https://doi.org/10.26438/ijcse/v7i6.314318Keywords:
Spell Checking, Hybrid approach for Spell Checking, N-Gram Approach, Rule Based Approach, Edit distance approachAbstract
Spell checking is a very important phase of any document processing system and Natural Language Processing. Spell Checking is a process to find the incorrect spells in a text document and to correct that particular incorrect spelling. There are various spell checking systems for various languages Like Hindi, Punjabi, English, French, Germen that can detect and correct the spell from a particular document. In this paper, we proposed a hybrid algorithm to detect and correct misspelled words from a text document written in three languages Hindi, English and Punjabi. Hybrid approach is a combination of various approaches like Dictionary lookup approach, Edit Distance Approach, Rule based approach and N-Gram approach. Proposed system can detect and correct the misspelled words from three given languages. A collision detection and correction system for alternates for misspell words has been also provided. Performance of proposed system is checked on various inputs collected from various books, websites etc. Results of the proposed system are evaluated on these outputs which have accuracy values higher than that of existing system.
References
[1] Ritika Mishra, Navjot Kaur, Design and Implementation of Online Punjabi Spell Checker Based on Dynamic Programming, Volume 3, Issue 8, August 2013, ISSN: 2277 128X, International Journal of Advanced Research in Computer Science and Software Engineering
[2] Neha Gupta, Pratistha Mathur, Spell Checking Techniques in NLP: A Survey, Volume 2, Issue 12, December 2012 , ISSN: 2277 128X, International Journal of Advanced Research in Computer Science and Software Engineering
[3] Baljeet Kaur, Review On Error Detection and Error Correction Techniques in NLP: Volume 4, Issue 6, June 2014 ISSN: 2277 128X, International Journal of Advanced Research in Computer Science and Software Engineering.
[4] Rupinderdeep Kaur and Parteek Bhatia, “Design and Implementation of SUDHAAR-Punjabi Spell Checker,” International Journal of Information and Telecommunication Technology, Vol. 1, Issue 15 May, 2010.
[5] S. Dasgupta, C.H. Papadimitriou, and U.V. Vazirani, `Algorithms`, p173, available at http:/ / www.cs.berkeley.edu/ ~vazirani/ algorithms.html.
[6] Neha Gupta &PratisthaMathur,“Spell Checking Techniques in NLP: A Survey,” International Journal of Advanced Research in Computer Science and Software Engineering, Vol. 2, Issue 12, December 2012.
[7] Gurpreet Singh Lehal, “Design and Implementation of Punjabi Spell Checker”, International Journal of Systemics, Cybemetics and Infomatics, 2007.
[8] Amit Sharma & Pulkit Jain, “Hindi Spell Checker”, Indian Institute of Technology Kanpur, April 17, 2013.
[9] MeenuBhagat, (2007), “Spelling Error Pattern Analysis of Punjabi Typed Text”, Thesis Report, Thapar University, Patiala.
[10] F.J. Damerau (1964), “A Technique for Error Detection and Correction of Spelling Errors”, Communication ACM, pp. 171-176.
[11] Monisha Das, S. Borgohain, JuliGogoi, S. B. Nair (2002), “Design and Implementation of a Spell Checker for Assamese”,lec, pp. 156, Language Engineering Conference (LEC’02).
[12] Morris, Robert & Cherry, Lorinda L, “Computer Detection of typographic errors”, IEEE Trans Professional Communications, vol. PC-18, no. 1, pp 54-64, March 1975.
[13] R.E. Gorin (1971), “SPELL: A spelling checking and correction program”, Online documentation for the DEC-10 computer.
[14] K. Kukich (1992) “Techniques for automatically correcting words in text”. ACM Computing Surveys. 24(4): 377-439.
[15] Peterson James (1980), “Computer Programs for Detecting and Correcting Spelling Errors”, Computing Practices, Communications of the ACM.
[16] G S Lehal & MeenuBhagat, “Spelling Error Pattern Analysis of Punjabi Typed Text”, In Proceedings of International Symposum on Machine Translation, NLP and TSS, pp. 128-141, 2007.
[17] Jesus Vilares& Manuel Vilares, “Managing Misspelled Queries in IR Application,” Issue 8, October 2010.
[18] Youssef Bassil& Mohammad Alwani, “Context-sensitive Spelling Correction using Google Web IT 5-Gram Information,” Department of Computer and Information Science, Vol. 5,No.3, May 2012.G. Eason, B. Noble, and I.N. Sneddon, “On certain integrals of Lipschitz-Hankel type involving products of Bessel functions,” Phil. Trans. Roy. Soc. London, vol. A247, pp. 529-551, April 1955.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
