Development of Thesaurus for Hindi

Authors

Kaur M Department of Computer Science, Punjabi university, Patiala, India

DOI:

https://doi.org/10.26438/ijcse/v7i1.6772

Keywords:

Thesaurus, Hindi language, Synonyms, Antonyms

Abstract

NLP is a vast field and thesaurus is its integral part. Thesaurus is a software tool which is inbuilt in few word processors that provides synonyms for selected words. A thesaurus is used on a computer while writing an e-mail, letter, or paper to find an alternative meaning for words. A thesaurus is a reference work that lists the synonyms and sometimes antonyms of words. Synonyms are words with similar meanings, and antonyms are words with opposite meanings. The research work in the paper elaborates the development of thesaurus in relevance with the Hindi language. The paper focuses on the development of framework which may assist the people finding it difficult to write and dealing with Hindi.

References

[1] Chancharoen, K., Tannin, N., &Sirinaovakul, B., Sentence based machine translation for English-Thai. “Circuits and Systems”, 1998. IEEE APCCAS 1998. The 1998 IEEE Asia-Pacific Conference on (pp. 141-144). IEEE, 1998.

[2] Shirai, S., & Yamamoto, K, “Linking English words in two bilingual dictionaries to generate another language pair dictionary”, Proceedings of ICCPOL, pp. 174-179, 2001.

[3] Sahoo, K.,Vidyasagar, V. E., “Kannada WordNet-A lexical database”, TENCON 2003.Conference on Convergent Technologies for the Asia-Pacific Region (Vol. 4, pp. 1352-1356).IEEE, 2003.

[4] Annam, S. R., Choudhury, M., Sarkar, S., &Basu, A., “ABHIDHA: an extended WordNet for Indo Aryan languages”, Research Issues in Data Engineering: Multi-lingual Information Management,RIDE-MLIM Proceedings. 13th International Workshop (pp. 1-8), 2003.

[5] Fattah, M. A., Ren, F., & Shingo, K., “Internet archive as a source of a bilingual dictionary”, In Information Technology: Coding and Computing, Proceedings ITCC 2004.International Conference on (Vol. 2, pp. 298-302).IEEE, 2004.

[6] Banek, M., Vrdoljak, B., &Tjoa, A. M. (2007, June). Using ontologies for measuring semantic similarity in data warehouse schema matching process. In Telecommunications, ConTel 2007. 9th International Conference on (pp. 227-234). IEEE, 2007.

[7] Dai, L., Liu, B., Xia, Y., & Wu, S. (2008, August). Measuring semantic similarity between words using HowNet.In Computer Science and Information Technology, ICCSIT`08. International Conference on (pp. 601-605). IEEE, 2008.

[8] Isahara, H., Bond, F., Uchimoto, K., Utiyama, M., & Kanzaki, K. Development of the Japanese WordNet, 2008.

[9] Jin, P., Li, F., Zhu, D., Wu, Y., & Yu, S. (2008, October). Exploiting external knowledge sources to improve kernel-based word sense disambiguation. In Natural Language Processing and Knowledge Engineering,. NLP-KE`08. International Conference on (pp. 1-8). IEEE, 2008.

[10] Kulkarni, M., Dangarikar, C., Kulkarni, I., Nanda, A., & Bhattacharyya, P. (2010, January). Introducing Sanskrit wordnet.In Proceedings of the 5th Global Wordnet Conference (GWC 2010), Narosa, Mumbai, (pp. 287-294), 2010.

[11] Chowdhury, G. G., “Natural language processing”, Annual review of information science and technology, 37(1), 51-89, 2003.

[12] Slawsky, D., “Building a keyword library for a description of visual assets: Thesaurus basics”, Journal of Digital Asset Management, 3(3), pp. 130-138, 2007.

[13] Kilgarriff, A., “Thesauruses for natural language processing.”, In Natural Language Processing and Knowledge Engineering, 2003.Proceedings. 2003 International Conference on (pp. 5-13), 2003.

[14] Bradeško, L., Dali, L., Fortuna, B., Grobelnik, M., Mladenić, D., Novalija, I., &Pajntar, B., “Contextualized question answering”, Journal of computing and information technology, 18(4), pp. 325-332, 2010.

[15] Tayal, A., “THESAURUS FOR INDIAN LANGUAGES AND CONVERSION RULES DURING DESIGN OF PUNJABI THESAURUS”, Journal of Global Research in Computer Science, 2(7), pp. 38-41, 2011.

[16] Ramírez, J., Asahara, M., & Matsumoto, Y., Japanese-Spanish thesaurus construction using English as a pivot. arXiv preprint arXiv:1303.1232, 2013.

[17] Mohd, M., Zakr, H., Abidin, N. Z., Tiun, S., &Hisham, A. I. I. (2013, December).Word sense disambiguation for English Quranic IR system. NOORIC 2013: Taibah University International Conference on Advances in Information Technology for the Holy Quran and Its Sciences, 2013.

[18] Panchal, P., Panchal, N., &Samani, H., Development of Gujarati WordNet for Family of Words. Development, 1(4), 2014.

[19] Kanakaraj, M., &Kamath, S. S. (2014, December). NLP based intelligent news search engine using information extraction from e-newspapers. In Computational Intelligence and Computing Research (ICCIC), IEEE International Conference on (pp. 1-5), 2014.

[20] Redkar, H., Singh, S., Joshi, N., Ghosh, A., & Bhattacharyya, P., “Indowordnet Dictionary: An Online Multilingual Dictionary using Indowordnet”, Proceedings of the 12th International Conference on Natural Language Processing (pp. 71-78), 2015.

Downloads

PDF ⁰

Published

2019-01-31

CITATION

DOI: 10.26438/ijcse/v7i1.6772

Published: 2019-01-31

How to Cite

[1]

M. Kaur, “Development of Thesaurus for Hindi”, Int. J. Comp. Sci. Eng., vol. 7, no. 1, pp. 67–72, Jan. 2019.

Download Citation

Issue

Vol. 7 No. 1 (2019): IJCSE January Edition

Section

Research Article

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.

Development of Thesaurus for Hindi

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Journal Information

UGC Gazette Regulation

Join Editorial Board

Information

Current Issue

Keywords