An Experimental Study of Recall and Precision Rates in Retrieval of Text Documents Using Different Distance Measures

Authors

  • US Patki Dept. of Computer Science, NES Science College, Nanded, India
  • AB Kurhe Dept. of Computer Science, SGBS College Purna(Jn), India
  • PG Khot Dept. of Statistics, RSTM University, Nagpur, India

DOI:

https://doi.org/10.26438/ijcse/v5i12.7983

Keywords:

Text Mining, Information retrieval, distance measure, recall rate, precession rate, document

Abstract

Searching is the most important process in an information retrieval from available large databases. Many times we search for a set of documents which is relevant to the given search document. Text mining helps us to mine the information from a given set of documents and it is most popular technique in Information retrieval. In this research paper we have applied distinct distance measures for retrieval of most similar documents to the queried document from a set of given document. For obtaining optimality for required search, we have gone through pre-processing of documents, creating vector space model and used distance measure techniques. Precision and recall are the basic measures used in evaluating search strategies. We have presented five distance measure technique applied on hundred text documents from standard database 20NewsGroup and calculated Recall and precision rate for text documents retrieval. We have used MatLab 10a as a development tool for our experiment.

References

L. Kumar, et.al ,”Text Mining: Concept, Process and Applications” JGRCS,Vol-4,No.3, March 2013

K Mugunthadevi, et.al, “Survey on Feature Selection in Document Clustering” IJCSE, Vol-3,3 March 2011,

Sowmya P, et.al , “Survey On Algorithms Used for Text Document Clustering”, IJAEC Special Issue September 2016

A. Sudha Ramkumar et. al, “Text Document Clustering Using Dimension Reduction Technique”, IJAER Vol -11, November 7, 2016,

A. Singh, et.al ,”K-means with Three different Distance Metrics”, IJCA, (0975 – 8887) Volume 67– No.10, April 2013

A.Huang, ,” Similarity Measures for Text Document Clustering”, NZCSRSC 2008, April 2008, Christchurch, New Zealand

S. Goswami et.al, “A Fuzzy Based Approach To Text Mining And Document Clustering”2013,

A Text Book “ Text Mining and Application Programming” Manu Konchady ,Ed. 3 Indian Edition

Downloads

Published

2025-11-12
CITATION
DOI: 10.26438/ijcse/v5i12.7983
Published: 2025-11-12

How to Cite

[1]
U. Patki, A. Kurhe, and P. Khot, “An Experimental Study of Recall and Precision Rates in Retrieval of Text Documents Using Different Distance Measures”, Int. J. Comp. Sci. Eng., vol. 5, no. 12, pp. 79–83, Nov. 2025.

Issue

Section

Research Article