Data Classification Approach For Text Analysis and Its Ambiguity

Authors

  • Supriya M Yawalkar Dept. of Computer Science and Engineering, P. R. Pote (Patil) College Of Engineering And Management, Amravati, Maharashtra, India
  • AS Kapse Dept. of Computer Science and Engineering, P. R. Pote (Patil) College Of Engineering And Management, Amravati, Maharashtra, India

DOI:

https://doi.org/10.26438/ijcse/v8i1.141145

Keywords:

Ambiguity, cyber hate, fuzzy, Sentiment analysis

Abstract

Sentiment analysis or opinion mining is one of the fastest growing fields with its demand and potential benefits that is increasing every day. With the onset of the internet and modern technology, there has been a vigorous growth in the amount of data. Each individual is able to express his/her own ideas freely on social media. All of this data can be analysed and used in order to draw benefits and quality information. In this paper, the focus is on cyber-hate classification based on for public opinion or views, since the spread of hate speech using social media can have disruptive impacts on social sentiment analysis. In particular, here proposing a modified fuzzy approach with two stage training for dealing with text ambiguity and classifying three type approach positive, negative and neutral sentiment, and compare its performance with those popular methods as well as some existing fuzzy approaches.

References

[1] R. Zhao, A. Zhou, and K. Mao, “Automatic detection of cyber bullying on social networks based on bullying features,” in the Proceedings of the 17th International Conference on Distributed Computer Network, pp. 4-7, Jan. 2016.

[2] K. Reynolds, A. Kontostathis, and L. Edwards, “Using machine learning to detect cyberbullying,” in Proceedings of 10th International Conference on Machine Learning Application, Honolulu, USA, pp. 241-244,Dec. 2011.

[3] J. H. Park and P. Fung, “One-step and two-step classification for abusive language detection on Twitter,” in Proc. 1st Workshop Abusive Lang. Online, Vancouver, BC, Canada, pp. 41–45,Aug 2016.

[4] H. Mubarak, K. Darwish, and W. Magdy, “Abusive language detection on Arabic social media,” in Proceeding of 1st Workshop Abusive Lang. Online, Vancouver, BC, Canada, pp. 52–56,Aug 2017.

[5] B. Pang, L. Lee, and S. Vaithyanathan, “Thumbs up? Sentiment classification using machine learning techniques,” in Proceedings of Conference Empirical Methods Natural Lang. Process. (ACL), pp. 79-86,2002.

[6] C. Jefferson, H. Liu, and M. Cocea, “Fuzzy approach for sentiment analysis,” in Proc. IEEE International Conference on Fuzzy Syst., Naples, Italy, pp. 1–6,July 2017.

[7] P. Burnap and M. L. Williams, “Cyber hate speech on Twitter: An application of machine classification and statistical modeling for policy and decision making,” Policy Internet, vol. 7, issue 2, pp. 223–242, 2015.

[8] P. Burnap and M. L. Williams, “Us and them: Identifying cyber hate on Twitter across multiple protected characteristics,” EPJ Data Sci., vol. 5, Issue 1, pp. 11-15, 2016.

[9] A. Agarwal, B. Xie, I. Vovsha, O. Rambow, and R. Passonneau, “Sentiment analysis of Twitter data,” in Proceedings of Workshop Language on Social Media, Stroudsburg, PA, USA, pp. 30–38,June 2011.

[10] I. Kwok and Y. Wang, “Locate the hate: Detecting tweets against blacks,” in Proceedings of 27th AAAI Conference on Artificial Intelligence, Bellevue, WA, USA, pp. 1621–1622,July 2013.

[11] A. Mahmud, K. Z. Ahmed, and M. Khan, “Detecting flames and insults in text,” in Proeeding of 6th International Conference on Natural Language Processing., Gothenburg, Sweden, pp. 25–27, Aug 2008.

[12] B. Gambäck and U. K. Sikdar, “Using convolutional neural networks to classify hate-speech,” in Proc. 1st Workshop Abusive Lang. Online, Vancouver, BC, Canada, Aug, pp. 85–90,Aug 2017.

[13] D. Chandran, K. A. Crockett, D. Mclean, and A. Crispin, “An automatic corpus based method for a building multiple fuzzy word dataset,” in Proceedings of IEEE International Conference on Fuzzy Syst., Istanbul, Turkey, pp. 1–8, Aug 2015.

[14] I. B. Sassi, S. B. Yahia, and S. Mellouli, “Fuzzy classification-based emotional context recognition from online social networks messages,” in Proc. IEEE Int. Conf. Fuzzy Syst., Naples, Italy, pp. 1–6,July 2017.

[15] S. Asur and B. Huberman, “Predicting the future with social media,” in International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010, vol. 1, pp. 492–499,2010.

[16] D. Hardt and J. Wulff, “What is the meaning of 5 *’s? An investigation of the expression and rating of sentiment,” in Proceedings of KONVENS 2012, J. Jancsary, Ed. OGAI, pATHOS 2012 workshop,pp. 319-326, 2012.

[17] H. Chen, P. De, Y. Hu, and B.-H. Hwang, “Sentiment revealed in social media and its effect on the stock market,” in Statistical Signal Processing Workshop (SSP), 2011 IEE, pp.25-28, 2011.

[18] S. P. Robertson, “Changes in referents and emotions over time in election-related social networking dialog,” in System Sciences (HICSS), on 44th Hawaii International Conference, pp. 1–9, 2011.

[19]M. Salath´e and S. Khandelwal, “Assessing vaccination sentiments with online social media: implications for infectious disease dynamics and control,” PLoS computational biology, vol. 7, no. 10, 2011.

[20] T. Menezes, C. Roth, J.-P. Cointet, “Finding the semantic-level precursors on a blog network.” IJSCCPS, vol. 1, pp. 115–134, 2011.

[21] Han Liu, Pete Burnap, Wafa Alorainy, Matthew L.Williams, ”A fuzzy approach to text classification with two stage training for ambiguous instances”, in proceeding of IEEE Transactions On Computational Social Systems ,vol. 6,no.2 ,pp. 227-240,2019.

[22] R.S. Walse, G.D. Kurundkar, P. U. Bhalchandra,”A review: Design and development of novel techniques for clustering and classification of data”, International journal of scientific research in computer sciences and engineering,Vol.06, Special Issue.01 , pp.19-22, Jan-2018.

[23]V.K.Jin, N.Tripathi,”Speech feature analysis and biometric person identification in multilingual environment”, International journal of scientific research in network security and communication,Vol.6, Issue 1,pp.7-11, Feb 2011.

[24]Bindushree V,Rashmi G.R, Uma H.R, ”Analysis of text recognition with data mining technique”, International journal of scientific research in computer science and engineering,Vol.7,Issue.6,pp. 40-42,Dec 2019.

Downloads

Published

2020-01-31
CITATION
DOI: 10.26438/ijcse/v8i1.141145
Published: 2020-01-31

How to Cite

[1]
S. M. Yawalkar and A. Kapse, “Data Classification Approach For Text Analysis and Its Ambiguity”, Int. J. Comp. Sci. Eng., vol. 8, no. 1, pp. 141–145, Jan. 2020.