Technical Report - Detecting Federally Funded Research and Development Trends Using Machine Learning and Information Retrieval Methods

Report
Authors:Linehan, Kathryn, PV-Biocomplexity InitiativeUniversity of Virginia ORCID icon orcid.org/0000-0001-9012-6261Oh, Eric, PV-Biocomplexity InitiativeUniversity of Virginia Thurston, Joel, PV-Biocomplexity InitiativeUniversity of Virginia ORCID icon orcid.org/0000-0002-3923-9065SIWE, Guy Leonel, PV-Biocomplexity InitiativeUniversity of Virginia ORCID icon orcid.org/0000-0002-9275-6416Garrett, MadelineUniversity of Colorado Keller, Sallie, PV-Biocomplexity InitiativeUniversity of Virginia ORCID icon orcid.org/0000-0001-7303-7267Shipp, Stephanie, PV-Biocomplexity InitiativeUniversity of Virginia ORCID icon orcid.org/0000-0002-2142-2136Kindlon, Audrey, National Center for Science and Engineering StatisticsNational Science Foundation Jankowski, John, National Center for Science and Engineering StatisticsNational Science Foundation
Abstract:

Federal RePORTER, a recently retired federally funded research and development (R&D) grant database, contained a vast amount of information on federally funded R&D and was utilized by researchers, citizens, and policymakers alike to uncover insights. In this report, we provide a classification of research topics contained within Federal RePORTER project abstracts, as well as trends in these topics over time, using natural language processing (NLP) and machine learning techniques. In collaboration with the National Center for Science and Engineering Statistics (NCSES), we examined how the topics and their trends change as a result of the number of topics produced by the model. In addition, we utilized information retrieval techniques to find theme-related topics and their trends over time. This is realized through two case-studies, the first using the theme of pandemics and the second using the theme of coronavirus.

Contributor:Lyman, Kimberly, PV-Biocomplexity InitiativeUniversity of Virginia
Language:
English
Source Citation:

Linehan K, Oh E, Thurston J, Siwe GL, Garrett M, Keller S, Shipp S, Kindlon A, Jankowski, J. Technical Report - Detecting Federally Funded Research and Development Trends Using Machine Learning and Information Retrieval Methods, Technical Report. BI-2022-1531. Proceedings of the Biocomplexity Institute. University of Virginia; 2022 May. DOI: https://doi.org/10.18130/4c3g-k017.

Publisher:
University of Virginia
Published Date:
05/20/2022