FACTors: a new dataset for studying the fact-checking ecosystem
Conference paper
Altuncu, E., Baskent, C., Bhattacherjee, S., Li, S. and Roy, D. 2025. FACTors: a new dataset for studying the fact-checking ecosystem. 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. Padua, Italy 13 - 18 Jul 2025 https://doi.org/10.1145/3726302.3730339
Type | Conference paper |
---|---|
Title | FACTors: a new dataset for studying the fact-checking ecosystem |
Authors | Altuncu, E., Baskent, C., Bhattacherjee, S., Li, S. and Roy, D. |
Abstract | Our fight against false information is spearheaded by fact-checkers. They investigate the veracity of claims and document their findings as fact-checking reports. With the rapid increase in the amount of false information circulating online, the use of automation in fact-checking processes aims to strengthen this ecosystem by enhancing scalability. Datasets containing fact-checked claims play a key role in developing such automated solutions. However, to the best of our knowledge, there is no fact-checking dataset at the ecosystem level, covering claims from a sufficiently long period of time and sourced from a wide range of actors reflecting the entire ecosystem that admittedly follows widely-accepted codes and principles of We present a new dataset FACTors, the first to fill this gap by presenting ecosystem-level data on fact-checking. It contains 118,112 claims from 117,993 fact-checking reports in English (co-)authored by 1,953 individuals and published during the period of 1995-2025 by 39 fact-checking organisations that are active signatories of the IFCN (International Fact-Checking Network) and/or EFCSN (European Fact-Checking Standards Network). It contains 7,327 overlapping claims investigated by multiple fact-checking organisations, corresponding to 2,977 unique claims. It allows to conduct new ecosystem-level studies of the fact-checkers (organisations and individuals). To demonstrate the usefulness of our dataset, we present three example applications. They include a first-of-its-kind statistical analysis of the fact-checking ecosystem, examining the political inclinations of the fact-checking organisations, and attempting to assign a credibility score to each organisation based on the findings of the statistical analysis and political leanings. Our methods for constructing FACTors are generic and can be used to maintain a live dataset that can be updated dynamically. |
Keywords | False information; misinformation; disinformation; fact-checking; dataset; resources; ecosystem |
Sustainable Development Goals | 9 Industry, innovation and infrastructure |
Middlesex University Theme | Sustainability |
Conference | 48th International ACM SIGIR Conference on Research and Development in Information Retrieval |
ISBN | 9798400715921 |
Publication process dates | |
Accepted | May 2025 |
Deposited | 05 Jun 2025 |
Output status | Accepted |
Accepted author manuscript | File Access Level Open |
Digital Object Identifier (DOI) | https://doi.org/10.1145/3726302.3730339 |
https://repository.mdx.ac.uk/item/25w4zz
Restricted files
Accepted author manuscript
3
total views1
total downloads3
views this month1
downloads this month