Development of Thesaurus for Hindi

Authors

  • Kaur M Department of Computer Science, Punjabi university, Patiala, India

DOI:

https://doi.org/10.26438/ijcse/v7i1.6772

Keywords:

Thesaurus, Hindi language, Synonyms, Antonyms

Abstract

NLP is a vast field and thesaurus is its integral part. Thesaurus is a software tool which is inbuilt in few word processors that provides synonyms for selected words. A thesaurus is used on a computer while writing an e-mail, letter, or paper to find an alternative meaning for words. A thesaurus is a reference work that lists the synonyms and sometimes antonyms of words. Synonyms are words with similar meanings, and antonyms are words with opposite meanings. The research work in the paper elaborates the development of thesaurus in relevance with the Hindi language. The paper focuses on the development of framework which may assist the people finding it difficult to write and dealing with Hindi.

References

[1] Chancharoen, K., Tannin, N., &Sirinaovakul, B., Sentence based machine translation for English-Thai. “Circuits and Systems”, 1998. IEEE APCCAS 1998. The 1998 IEEE Asia-Pacific Conference on (pp. 141-144). IEEE, 1998.

[2] Shirai, S., & Yamamoto, K, “Linking English words in two bilingual dictionaries to generate another language pair dictionary”, Proceedings of ICCPOL, pp. 174-179, 2001.

[3] Sahoo, K.,Vidyasagar, V. E., “Kannada WordNet-A lexical database”, TENCON 2003.Conference on Convergent Technologies for the Asia-Pacific Region (Vol. 4, pp. 1352-1356).IEEE, 2003.

[4] Annam, S. R., Choudhury, M., Sarkar, S., &Basu, A., “ABHIDHA: an extended WordNet for Indo Aryan languages”, Research Issues in Data Engineering: Multi-lingual Information Management,RIDE-MLIM Proceedings. 13th International Workshop (pp. 1-8), 2003.

[5] Fattah, M. A., Ren, F., & Shingo, K., “Internet archive as a source of a bilingual dictionary”, In Information Technology: Coding and Computing, Proceedings ITCC 2004.International Conference on (Vol. 2, pp. 298-302).IEEE, 2004.

[6] Banek, M., Vrdoljak, B., &Tjoa, A. M. (2007, June). Using ontologies for measuring semantic similarity in data warehouse schema matching process. In Telecommunications, ConTel 2007. 9th International Conference on (pp. 227-234). IEEE, 2007.

[7] Dai, L., Liu, B., Xia, Y., & Wu, S. (2008, August). Measuring semantic similarity between words using HowNet.In Computer Science and Information Technology, ICCSIT`08. International Conference on (pp. 601-605). IEEE, 2008.

[8] Isahara, H., Bond, F., Uchimoto, K., Utiyama, M., & Kanzaki, K. Development of the Japanese WordNet, 2008.

[9] Jin, P., Li, F., Zhu, D., Wu, Y., & Yu, S. (2008, October). Exploiting external knowledge sources to improve kernel-based word sense disambiguation. In Natural Language Processing and Knowledge Engineering,. NLP-KE`08. International Conference on (pp. 1-8). IEEE, 2008.

[10] Kulkarni, M., Dangarikar, C., Kulkarni, I., Nanda, A., & Bhattacharyya, P. (2010, January). Introducing Sanskrit wordnet.In Proceedings of the 5th Global Wordnet Conference (GWC 2010), Narosa, Mumbai, (pp. 287-294), 2010.

[11] Chowdhury, G. G., “Natural language processing”, Annual review of information science and technology, 37(1), 51-89, 2003.

[12] Slawsky, D., “Building a keyword library for a description of visual assets: Thesaurus basics”, Journal of Digital Asset Management, 3(3), pp. 130-138, 2007.

[13] Kilgarriff, A., “Thesauruses for natural language processing.”, In Natural Language Processing and Knowledge Engineering, 2003.Proceedings. 2003 International Conference on (pp. 5-13), 2003.

[14] Bradeško, L., Dali, L., Fortuna, B., Grobelnik, M., Mladenić, D., Novalija, I., &Pajntar, B., “Contextualized question answering”, Journal of computing and information technology, 18(4), pp. 325-332, 2010.

[15] Tayal, A., “THESAURUS FOR INDIAN LANGUAGES AND CONVERSION RULES DURING DESIGN OF PUNJABI THESAURUS”, Journal of Global Research in Computer Science, 2(7), pp. 38-41, 2011.

[16] Ramírez, J., Asahara, M., & Matsumoto, Y., Japanese-Spanish thesaurus construction using English as a pivot. arXiv preprint arXiv:1303.1232, 2013.

[17] Mohd, M., Zakr, H., Abidin, N. Z., Tiun, S., &Hisham, A. I. I. (2013, December).Word sense disambiguation for English Quranic IR system. NOORIC 2013: Taibah University International Conference on Advances in Information Technology for the Holy Quran and Its Sciences, 2013.

[18] Panchal, P., Panchal, N., &Samani, H., Development of Gujarati WordNet for Family of Words. Development, 1(4), 2014.

[19] Kanakaraj, M., &Kamath, S. S. (2014, December). NLP based intelligent news search engine using information extraction from e-newspapers. In Computational Intelligence and Computing Research (ICCIC), IEEE International Conference on (pp. 1-5), 2014.

[20] Redkar, H., Singh, S., Joshi, N., Ghosh, A., & Bhattacharyya, P., “Indowordnet Dictionary: An Online Multilingual Dictionary using Indowordnet”, Proceedings of the 12th International Conference on Natural Language Processing (pp. 71-78), 2015.

Downloads

Published

2019-01-31
CITATION
DOI: 10.26438/ijcse/v7i1.6772
Published: 2019-01-31

How to Cite

[1]
M. Kaur, “Development of Thesaurus for Hindi”, Int. J. Comp. Sci. Eng., vol. 7, no. 1, pp. 67–72, Jan. 2019.

Issue

Section

Research Article