Improving Existing Punjabi Morphological Analyzer using N-gram

Authors

  • SK Sharma Dept. of Computer Science and Applications, DAV University, Jalandhar, India

DOI:

https://doi.org/10.26438/ijcse/v5i9.171174

Keywords:

Morphological analyzer, Morph, N-gram approach

Abstract

Morphological analysis is an essential tool for almost all Natural Language Processes like POS tagging, Grammar checking, Sentence simplification, generation of Treebank and parsing. In this research article, author has used N-gram statistical technique to improve the existing morphological analyzer. The main factor that reduces the accuracy of morphological analyzer is presence of unknown words. In this research article author has used n-gram approach for detecting the POS tag of unknown word. The results shows an average precision of 82.34, recall 70.20 and F-measure 75.74.

References

. Bharati, Akshar, Amba P. Kulkarni, Vineet Chaitanya. (1998a).Challenges in Developing Word Analyzers for Indian Languages, Presented at Workshop on Morphology, CIEFL, Hyderabad, July 1998.

. Bharati, Akshar, Rajeev Sangal and S.M. Bendre (1998b). Some Observations on Corpora of Some Indian Languages. Knowledge Based Computing Systems, Tata McGraw-Hill.

. Goldsmith, John. (2001). Unsupervised Learning of the Morphology of a Natural Language. Computational Linguistics, Vol 27, No. 2, pp 153-198.

. Daniel Jurafsky, James H. Martin. Speech and Language Processing:An introduction to speech recognition, Natural Language Processing, and Computational Linguistics. LTRC, IIIT Hyderabad http://ltrc.iiit.ac.in

. Gill Mandeep Singh, Lehal Gurpreet Singh, Joshi S.S., A full form lexicon based Morphological Analysis and generation tool for Punjabi, International Journal of Cybernatics and Informatics, Hyderabad, India,October 2007, pp. 38-47

. Brants, TnT – A statistical part-of-speech tagger. In Proc. Of the 6th Applied NLP Conference, pp. 224-231, 2000

. Cutting, J. Kupiec, J. Pederson and P. Sibun, A practical part of-speech tagger. In Proc. of the 3rd Conference on Applied NLP, pp. 133-140, 1992

. Dermatas and K. George, Automatic stochastic tagging of natural language texts. Computational Linguistics, 21(2): 137-163, 1995

. Ekbal, Asif, and S. Bandyopadhyay,”Lexicon Development and POS tagging using a Tagged Bengali News Corpus”, In Proc. of FLAIRS-2007, Florida, 261-263, 2007

. E. Dermatas and K. George, Automatic stochastic tagging of Natural language texts, Computational Linguistics, 21(2): 137-163, 1995

. Ekbal Asif, et.al, “Bengali Part of Speech Tagging using Conditional Random Field” in Proceedings of the 7th International Symposium of Natural Language Processing (SNLP-2007), Pattaya, Thailand, 15 December 2007, pp.131-136

Downloads

Published

2025-11-12
CITATION
DOI: 10.26438/ijcse/v5i9.171174
Published: 2025-11-12

How to Cite

[1]
S. Sharma, “Improving Existing Punjabi Morphological Analyzer using N-gram”, Int. J. Comp. Sci. Eng., vol. 5, no. 9, pp. 171–174, Nov. 2025.

Issue

Section

Research Article