Dengue Prediction Using Tweets in India

Authors

  • Kumari S Dept of Computer Science and Information Technology, Vaugh Institute of Agricultural Engineering and Technology, Sam Higginbottom University of Agriculture, Technology & Sciences, Allahabad, Uttar Pradesh, India
  • K Jeberson Dept of Computer Science and Information Technology, Vaugh Institute of Agricultural Engineering and Technology, Sam Higginbottom University of Agriculture, Technology & Sciences, Allahabad, Uttar Pradesh, India
  • W Jeberson Dept of Computer Science and Information Technology, Vaugh Institute of Agricultural Engineering and Technology, Sam Higginbottom University of Agriculture, Technology & Sciences, Allahabad, Uttar Pradesh, India

DOI:

https://doi.org/10.26438/ijcse/v7i10.5763

Keywords:

Dengue, Weka, Classification

Abstract

In India, people have started using twitter and nowadays, its craze has overshadowed the users all day. In India, a Twitter user across India was predicted to be more than 34 million in 2019. Twitter data is a very huge amount of data that can be used for the prediction of various diseases. Tweets are strongly related to Dengue cases. Dengue is a viral-borne disease that is also one of the widespread waterborne diseases. Nowadays people are trying a lot to avoid being a victim of dengue. But this communicable disease has highly increased alongside the urbanization rate in the tropical rain forest region. In this research paper, we focused on the retrieval of tweets using a hashtag keyword using a free analytic tool Vicinitas. We collected a set of 102 tweets to train a classifier to identify dengue, record and predict the emergence and transmission of dengue in a population. WEKA is a collection-set for machine learning and it is free open-source software. In this research, we used the dengue datasets with a total of one hundred two instances of dengue and two attributes i.e., text and class to determine accuracy using the various classifying algorithms. For the best outcome, we used seven classification techniques for accuracy. The main methodology and the techniques we used for predicting the dengue are J48, Naïve Bayesian, SMO, and Random tree, ZeroR, Random Forest and REP Tree. We after evaluating various attributes of the result finally concluded that Bayes obtained the highest accuracy rate.

References

[1] Andrea villanes, Emily Giffiths, Michael Rappa, Christopher G. Healey, “Dengue fever surveillance in India using text mining in public media”, The American Journal of Tropical Medicine & Hygiene, Vol.98, Issue.1, pp.181-191, 2018.

[2] Wajeeha Farooqi, Sadaf Ali, “A critical study of selected classification algorithms for dengue fever and dengue hemorrhagic fever”, In the Proceedings of the 2013 IEEE 11th International conference on Frontiers of Information Technology (FIT 2013), USA, pp.140-145, 2013.

[3] M. Vidhyalakshmi, P. Radha, “Social HashTag Techniques Using Data Mining-A Survey” International Journal of Scientific Research in Computer Science and Engineering, Vol.6, Issue.3, pp.86-92, 2018

[4] N.Saravanan, Dr. V. Gayathri, “Classification of dengue dataset using J48 algorithm and ant colony based a J48 algorithm”, In the Proceeding of the 2017 International Conference on Inventive Computing and Informatics (ICICI 2017), India, pp.1062-1067, 2017.

[5] Tina R.Patil, S.S.Sherekar, “Performance analysis of naive Bayes and j48 classification for data classification”, International journal of computer science and applications, Vol.6, No.2, pp.256-267, 2013.

[6] Shameem Fathima, Nisar Hundewale, “Comparison of classification techniques-SVM and naive Bayes to predict the arboviral disease-dengue”, In the Proceedings of the 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW), Atlanta, GA, pp.538-539, 2011.

[7] Thypparampil Karunakaran Sajana, MarojuNavya, YVSSV.Gayathri, Nimgire Reshma, “Classification of dengue using machine learning techniques”, International Journal of Engineering and Technology, Vol.7, Issue.2.32, pp.212-218, 2018.

[8] Nandini. V, Sriranjitha. R, Yazhini. T. P, “Dengue detection and prediction system using data mining with frequency analysis”, 6th International Conference on Advances in Computing and Information Technology, pp.53-67, 2016.

[9] Nelofar Rehman, “Data Mining Techniques Method Algorithms and Tools”, International Journal of Computer Science and Mobile Computing, Vol.6, Issue.7, pp.227-231, 2017.

[10] Kashish Ara Shakil, Samiya Khan, Shadma Anis, Mansafalam, “Dengue disease prediction using weka data mining tool”, In the Proceeding of IIRAJ International Conference (ICCI-SEM-2K17), India, pp.48-59, 2015.

[11] Iqra Jahangir, Abdul-Basit, Abdul Hannan, Sameen Javed, “Prediction of dengue disease through data mining by using modified apriori algorithm”, In the Proceeding of ACM 4th International Conference of Computing for Engineering and Sciences (ICCES 2018), Malaysia, pp.1-4, 2018.

[12] R. Sanjudevi, D.Savitha, “Dengue Fever Prediction Using Classification Techniques”, International Research Journal of Engineering and Technology (IRJET 2018), Vol.6, Issue.2, pp.558-563, 2018.

[13] M.Bhavani, S.Vinod Kumar, “A data mining approach for precise diagnosis of dengue fever”, International Journal of Latest Trends in Engineering and Technology, Vol.7, Issue.4, pp. 352-359, 2016.

[14] Kamran Shaukat, Nayyer Masood, SundasMehreen, UlyaAzmeen, “Dengue fever prediction: a data mining problem”, Journal of Data Mining in Genomics and Proteomics, Vol.6, Issue.3, pp.1-5, 2015.

[15] Amit Palve, Rohini D.Sonawane, Amol D. Potgantwar, “Sentiment Analysis of Twitter Streaming Data for Recommendation using Apache Spark” International Journal of Scientific Research in Network Security and Communication, Vol.5, Issue.3, pp.99-103, 2017.

Downloads

Published

2019-10-31
CITATION
DOI: 10.26438/ijcse/v7i10.5763
Published: 2019-10-31

How to Cite

[1]
S. Kumari, J. K, and J. W, “Dengue Prediction Using Tweets in India”, Int. J. Comp. Sci. Eng., vol. 7, no. 10, pp. 57–63, Oct. 2019.

Issue

Section

Research Article