Discoveries of Research Genealogy from Large-Scale Academic Dataset: Issues, Challenges and Application
Keywords:
Genealogy tree, Author name disambiguation, CitationAbstract
Genealogical research is the tracing of an individual’s ancestral history using historical records, both official and unofficial. Challenges about genealogy problem like spelling names, legacy of a researcher can be measured not only in terms of his/her publications and scientific discoveries, in terms of the formation of other researchers. Now, research work is improving than oldest research. So population of researcher and scientist is increasing rapidly and it was more important now a days that to finding out who is better among all researcher. Author ranking can be solved this problem. Author ranking will not be perfect due to some causes, like naming disambiguation problem and uses of multiple name in paper. In Academic genealogy, is the relationship between advisor and advisee. Research area of advisor is more popular than his advisee research area may be good. From there we can do future prediction of an author. Another problem of author name disambiguity can be solved using genealogy tree hierarchy, as there are less chances of conflict in identifying an author based on his unique academic records. Another important challenge is that how much level (generation) we can visit from the genealogy tree. From the big dataset, we extract different metrics for an author. In this paper, we extract data of a particular author and from there we have analyze effects of an author rank.
References
[1] Mehmet Ali Abdulhayoglu and Bart Thijs. 2017. Use of ResearchGate and Google CSE for author name disambiguation. Scientometrics 111, 3 (2017), 1965–1985.
[2] Wellington Dores, Fabrício Benevenuto, and Alberto HF Laender. 2016. Extracting academic genealogy trees from the networked digital library of theses and dissertations. In Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries. ACM, 163–166.
[3] Janaína Gomide, Hugo Kling, and Daniel Figueiredo. 2017. Name usage pattern in the synonym ambiguity problem in bibliographic data. Scientometrics 112, 2 (2017), 747–766.
[4] Rasmus AX Persson. 2017. Bibliometric author evaluation through linear regression on the coauthor network. Journal of Informetrics 11, 1 (2017), 299–306.
[5] Luciano Rossi, Rafael JP Damaceno, Igor L Freire, Etelvino JH Bechara, and Jesús P Mena-Chalco. 2018. Topological metrics in academic genealogy graphs. Journal of Informetrics 12, 4 (2018), 1042–1058.
[6] Luciano Rossi, Igor L Freire, and Jesús P Mena-Chalco. 2017. Genealogical index: A metric to analyze advisor–advisee relationships. Journal of Informetrics 11, 2 (2017), 564–582.
[7] Min Song, Erin Hea-Jin Kim, and Ha Jin Kim. 2015. Exploring author name disambiguation on PubMed-scale. Journal of informetrics 9, 4 (2015), 924–941.
[8] Besiki Stvilia, Charles C Hinnant, Shuheng Wu, Adam Worrall, Dong Joon Lee, Kathleen Burnett, Gary Burnett, Michelle M Kazmer, and Paul F Marty. 2017. Toward collaborator selection and determination of data ownership and publication authorship in research collaborations. Library & Information Science Research 39, 2 (2017), 85–97.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
