An Approach for Improving Accuracy of Machine Translation using WSD and GIZA
DOI:
https://doi.org/10.26438/ijcse/v5i10.256259Keywords:
WSD, Machine Translation, Corpus, Supervised, UnsupervisedAbstract
Word Sense Disambiguation (WSD) is a challenging problem of Natural Language Processing (NLP). Though there are lots of algorithms for WSD available, still little work is carried out for choosing optimal algorithm for that. The job of word sense disambiguation is to decide the accurate meaning of an ambiguous term in a particular circumstance. When WSD is used in machine translation, an accurate translation in the resultant linguistic must be determined for an ambiguous term entry in the original language. Therefore Word Sense Disambiguation remains one of the most common real life problems that are associated to natural language processing which needs to be resolved efficiently.Unsupervised techniques use online dictionary for learning, and supervised techniques use manual learning sets. As there are some advantages and disadvantages of supervised learning and unsupervised learning, aim of this paper is to disambiguate the ambiguous word by using the hybrid approach for WSD. We have made use of parallel corpus and aligned the text by using GIZA.
References
A.R. Pal and D. Saha, “Word Sense Disambiguation: A Survey”, International Journal of Control Theory and Computer Modeling (IJCTCM), Vol.5, No.3, pp. 1-16, 2015.
A. Kundu, A. Singh, R. Shekhar, “A Hybrid Approach to Word Sense Disambiguation Combining Supervised and Unsupervised Learning”, International Journal of Artificial Intelligence & Applications (IJAIA), Vol. 4, No. 4, pp. 89-101, 2013.
A.R. Pal, A. Munshi, D. Saha, “An Approach To Speed-Up The Word Sense Disambiguation Procedure Through Sense Filtering”, International Journal of Instrumentation and Control Systems (IJICS) Vol.3, No.4, pp. 29-41, 2013.
E. Agirre & G. Rigau, "Word sense disambiguation using conceptual density", In the Proceedings of the 16th International Conference on Computational Linguistics (COLING), Copenhagen, Denmark, pp. 16-22, 1996.
E. Agirre, & D. Martínez, “Learning class-to-class selectional preferences”, In the Proceedings of the Conference on Natural Language Learning, Toulouse, France, pp. 15–22, 2001.
S. Banerjee & T. Pedersen, “An adaptive Lesk Algorithm for Word Sense Disambiguation Using WordNet”, In the Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing, London, UK, pp. 136-145, 2002, ISBN: 3-540-43219-1..
G. Escudero, L.M`arquez and G. Rigau, “Naïve Bayes and Exemplar-based approaches to Word Sense Disambiguation Revisited”, In the Proceedings of the 14th European Conference on Artificial Intelligence, pp. 421-425, 2000.
R. Navigli, “word sense disambiguation: a survey”, ACM computing surveys, 41(2), ACM press, pp. 1-69, 2009.
E. Agirre and A. Soroa, “Personalizing PageRank for Word Sense Disambiguation,” In the Proceedings of the 12th Conference European Chapter of the Association for Computational Linguistics, Greece, pp. 33–41, 2009.
R. Mihalcea and D.I. Moldovan, “Pattern Learning and Automatic Feature Selection for Word Sense Disambiguation”, In the Proceedings of the Second international Workshop on Evaluating Word Sense Disambiguation Systems (SENSEVAL-2), Texas, pp. 127-130, 2001.
R. Navigli and P. Velardi, “Structural Semantic Interconnections: A Knowledge-Based Approach to Word Sense Disambiguation”, Ieee Transactions On Pattern Analysis And Machine Intelligence, Washington, DC, USA, Vol. 27, No. 7, 2005.
X. Zhou and H. Han, “Survey of Word Sense Disambiguation Approaches”, In the Proceedings of the 18th International FLAIR Conference, American Association for Artificial Intelligence, Philadelphia, pp. 307-313, 2005.
D. Chiang, “A hierarchical phrase-based model for statistical machine translation”, In Proceedings of the ACL-05, USA, pp. 263–270, 2005.
M. Galley, M. Hopkins, K. Knight, and D. Marcu. “What’s in a translation rule?”, In the Proceedings of the NAACL-04, pp. 273–280, 2004.
K. Philipp, et.al, “Moses: Open source toolkit for statistical machine translation”, In the Proceedings of the ACL, Demonstration Session, pp. USA, 177–180. 2007.
F. Och and H. Ney, “A systematic comparison of various statistical alignment models”, International Journal of Computational Linguistics, USA, Vol. 29, Issue 1, pp. 19–51, 2003.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
