Wikinformetrics: Construction and description of an open Wikipedia knowledge graph data set for informetric purposes
Metadata
Show full item recordEditorial
MIT
Materia
Altmetrics Data Informetrics Knowledge graph Metrics Wikipedia
Date
2022-12-20Referencia bibliográfica
Arroyo-Machado, W., Torres- Salinas, D., & Costas, R. (2022). Wikinformetrics: Construction and description of an open Wikipedia knowledge graph data set for informetric purposes. Quantitative Science Studies, 3(4), 931–952. [https://doi.org/10.1162/qss_a_00226]
Sponsorship
Ministry of Science and Innovation, Spain (MICINN) Spanish Government PID2019-109127RB-I00/SRA Spanish Government FPU18/05835; Reincorporation Programme for Young Researchers of the University of Granada; South African DSI-NRF Centre of Excellence in Scientometrics and Science, Technology and Innovation Policy (SciSTIP)Abstract
Wikipedia is one of the most visited websites in the world and is also a frequent subject of
scientific research. However, the analytical possibilities of Wikipedia information have not yet
been analyzed considering at the same time both a large volume of pages and attributes. The
main objective of this work is to offer a methodological framework and an open knowledge
graph for the informetric large-scale study of Wikipedia. Features of Wikipedia pages are
compared with those of scientific publications to highlight the (dis)similarities between the two
types of documents. Based on this comparison, different analytical possibilities that Wikipedia
and its various data sources offer are explored, ultimately offering a set of metrics meant
to study Wikipedia from different analytical dimensions. In parallel, a complete dedicated
data set of the English Wikipedia was built (and shared) following a relational model. Finally,
a descriptive case study is carried out on the English Wikipedia data set to illustrate the
analytical potential of the knowledge graph and its metrics.