Contribution of Word length in Substitution Error Pattern analysis of Punjabi Typed Text

Authors

  • Bhagat M Department of Computer Science and Engineering, Punjab University SSG Regional Centre, Hoshiarpur, India

DOI:

https://doi.org/10.26438/ijcse/v6i5.11831185

Keywords:

Addak, Gurmukhi, Non-word, Bindi

Abstract

Spelling error pattern analysis of a language is useful in language related technology, such as creation of Natural Language Interfaces, Machine Translation, Optical Character Recognition, Spell Checker and Corrector etc. It includes analysis of various types of errors (insertion, deletion, transposition, substitution, run-on, split word error) Positional analysis, Word length effects, Phonetic errors, First position error analysis, Keyboard effects etc. This paper mainly focuses on the effect of word length in substitution error pattern analysis of Punjabi by doing Statistical Error analysis of Punjabi typed text. It also presents a brief overview of effect of word length on non-word error analysis in Punjabi Typed Text. This paper is based on the analysis done on 20000 misspelled words generated by typists.

References

F.J. Damerau (1964) "A technique for computer detection and correction of spelling errors".Commun. ACM. 7(3): 171-176.

POLLOCK, J. J., AND ZAMORA, A. 1983. Collection and Characterization of spelling errors in scientific and scholarly text. J. Amer. Soc. Inf. Sci. 34, 1, 51-58.

K. Kukich (1992) "Techniques for automatically correcting words in text". ACM Computing Surveys. 24(4): 377-439.

K.W. Church and W.A. Gale (1991) "Probability scoring for spelling correction". Statistical Computing. 1(1): 93-103.

P. Kundu and B.B. Chaudhuri (1999) "Error Pattern in Bangla Text". International Journal of Dravidian Linguistics. 28(2): 49-88.

Morris, Robert & Cherry, Lorinda L, 'Computer detection of typographical errors', IEEE Trans Professional Communication, vol. PC-18, no.1, pp54-64, March 1975.

Yannakoudakis, E.J. & Fawthrop, D 1983a. An Intelligent spelling corrector. Inf. Process. Manage. 19, 12, 101-108.

Downloads

Published

2025-11-13
CITATION
DOI: 10.26438/ijcse/v6i5.11831185
Published: 2025-11-13

How to Cite

[1]
M. Bhagat, “Contribution of Word length in Substitution Error Pattern analysis of Punjabi Typed Text”, Int. J. Comp. Sci. Eng., vol. 6, no. 5, pp. 1183–1185, Nov. 2025.

Issue

Section

Research Article