Using Lexicon and Random Forest Classifier for Twitter Sentiment Analysis

Authors

  • M Thenmozhi Dept of Computer Science and Engineering, Pondicherry Engineering College, Puducherry, India
  • R Indira Dept of Computer Science and Engineering, Pondicherry Engineering College, Puducherry, India
  • R Dharani Dept of Computer Science and Engineering, Pondicherry Engineering College, Puducherry, India

DOI:

https://doi.org/10.26438/ijcse/v7i6.591594

Keywords:

Sentiment Analysis, Sentiment Classification, Lexicon based Analysis, Sentiment Score

Abstract

Today users prefer blogs and review sites to purchase products online. Thus, user reviews are considered as an important source of information in sentiment analysis applications for decision making. Machine Learning and Lexicon based sentiment analysis are the two popular methods available in the literature. The Machine Learning based classifiers does not work for unlabelled dataset such as tweets. On the other hand existing Lexicon based sentiment analysis approaches are becoming less efficient due to data sparseness, low accuracy and non-consideration n-gram words. N-grams can improve the accuracy of sentiment classification. Following these limitations the proposed work provides a combination of Lexicon and Machine learning based approach to perform sentiment analysis on Twitter datasets.

References

[1] Chen, Yubo, Scott Fay, and Qi Wang. "The role of marketing in social media: How online consumer reviews evolve." Journal of interactive marketing, Vol.25, No. 2, pp. 85-94, 2011.

[2] Pang, Bo, and Lillian Lee. "Opinion mining and sentiment analysis." Foundations and Trends® in Information Retrieval, Vol.2, No. 1–2, pp. 1-135, 2008.

[3] Fang, Xing, and Justin Zhan. "Sentiment analysis using product review data." Journal of Big Data, Vol.2, no. 1, pp. 5, 2015.

[4] Vohra, S. M., and J. B. Teraiya. "A comparative study of sentiment analysis techniques." Journal JIKRCE, Vol.2, no. 2, pp.313-317, 2013.

[5] Agarwal, Basant, and Namita Mittal. "Machine learning approach for sentiment analysis." In Prominent feature extraction for sentiment analysis, pp. 21-45. Springer, Cham, 2016.

[6] Dey, Atanu, Mamata Jenamani, and Jitesh J. Thakkar. "Senti-N-Gram: An n-gram lexicon for sentiment analysis." Expert Systems with Applications, Vol. 103, pp.92-105, 2018.

[7] Al-Ayyoub, Mahmoud, Safa Bani Essa, and Izzat Alsmadi. "Lexicon-based sentiment analysis of Arabic tweets." IJSNM, Vol.2, no. 2, pp. 101-114, 2015.

[8] Trinh, Son, Luu Nguyen, Minh Vo, and Phuc Do. "Lexicon-based sentiment analysis of Facebook comments in Vietnamese language." In Recent developments in intelligent information and database systems, pp. 263-276. Springer, Cham, 2016.

[9] Asghar, Muhammad Zubair, Shakeel Ahmad, Maria Qasim, Syeda Rabail Zahra, and Fazal Masud Kundi. "SentiHealth: creating health-related sentiment lexicon using hybrid approach." SpringerPlus 5, no. 1, pp. 1139, 2016.

[10] Hamilton, William L., Kevin Clark, Jure Leskovec, and Dan Jurafsky. "Inducing domain-specific sentiment lexicons from unlabeled corpora." In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing, vol. 2016, p. 595. NIH Public Access, 2016.

[11] Khan, Farhan Hassan, Usman Qamar, and Saba Bashir. "A semi-supervised approach to sentiment analysis using revised sentiment strength based on SentiWordNet." Knowledge and information Systems, Vol. 51, no. 3, pp. 851-872, 2017.

[12] Zhang, Shunxiang, Zhongliang Wei, Yin Wang, and Tao Liao. "Sentiment analysis of Chinese micro-blog text based on extended sentiment dictionary." Future Generation Computer Systems, Vol. 8, pp. 395-403, 2018.

[13] Asghar, Muhammad Zubair, Aurangzeb Khan, Shakeel Ahmad, Maria Qasim, and Imran Ali Khan. "Lexicon-enhanced sentiment analysis framework using rule-based classification scheme." PloS one, Vol. 12, no. 2, e0171649, 2017.

[14] C. Nanda, M. Dua, “A Survey on Sentiment Analysis” International Journal of Scientific Research in Computer Science and Engineering, Vol.5, Issue.2, pp.67-70, 2017.

[15] Amit Palve, Rohini D.Sonawane, Amol D. Potgantwar, “Sentiment Analysis of Twitter Streaming Data for Recommendation using, Apache Spark“, International Journal of Scientific Research in Network Security and Communication, Vol.5 , Issue.3 , pp.99-103, Jun-2017.

Downloads

Published

2019-06-30
CITATION
DOI: 10.26438/ijcse/v7i6.591594
Published: 2019-06-30

How to Cite

[1]
M. Thenmozhi, I. R, and R. Dharani, “Using Lexicon and Random Forest Classifier for Twitter Sentiment Analysis”, Int. J. Comp. Sci. Eng., vol. 7, no. 6, pp. 591–594, Jun. 2019.

Issue

Section

Research Article