“This ground truth is muddy anyway”

Ground Truth Data Assemblages for Medical AI Development

Författare

  • Charlotte Högberg Lund University

DOI:

https://doi.org/10.37062/sf.62.27826

Nyckelord:

artificial intelligence, data, ground truth, medicine, epistemology

Abstract

This article explores assemblages of ground truth datasets for the development of medical artificial intelligence (AI). By drawing from interviews and observations, I examine how AI experts developing medical AI relate to the referential truth basis of their work, their ground truths, as an epistemic concern. By addressing how datasets are assembled from different sources, and produced, augmented and synthesised, this study shows how ground truths are valued based on humanness, quality of medical expert judgements, temporality and technical qualities. Moreover, this article analyses truth practices as productive moments in AI development, the role of human expertise and the perceived strengths and limits of expert-based annotations. The valuations of ground truths shatter the image of medical classifications, and AI models, as stable neutral entities. Moreover, this article shows how valuations of ground truths encompass more than alignment with standardised expertise. To better understand the possibilities for medical AI to live up to ideals of accuracy, fairness, trustworthiness and transparency, we need more
knowledge on assumptions, negotiations and epistemic concerns upon which medical AI is built.

Referenser

Amoore, L. (2013) The politics of possibility: Risk and security beyond probability. Duke University Press.

Amoore, L. (2020) Cloud ethics. Algorithms and the attributes of ourselves and others. Duke University Press.

Avlona, N.-R. & I. Shklovski (2024) "Torquing patients into data: Enactments of care about, for and through medical data in algorithmic systems", Information, Communication & Society 27 (4):735–757.

Bates, J. (2018) "The politics of data friction", Journal of Documentation 74 (2):412–429.

Bertelsen, P.S., C. Bossen, C. Knudsen & A. Pedersen (2024) "Data work and practices in healthcare: A scoping review", International Journal of Medical Informatics 184: 105348.

Bowker, G.C. & S.L. Star (1999) Sorting things out : classification and its consequences. MIT Press.

Campolo, A. & K. Schwerzmann (2023) "From rules to examples: Machine learning's type of authority", Big Data & Society 10 (2): 20539517231188725.

Collins, H.M. (1990) Artificial experts : Social knowledge and intelligent machines. MIT Press.

Crawford, K. (2021) Atlas of AI : power, politics, and the planetary costs of artificial intelligence. Yale University Press.

Daston, L. & P. Gailson (2007) Objectivity. New York: Zone Books.

Denton, E., A. Hanna, R. Amironesei, A. Smart & H.Nicole (2021) "On the genealogy of machine learning datasets: A critical history of ImageNet", Big Data & Society 8 (2):20539517211035955.

D'Ignazio, C. & L.F. Klein (2020) Data feminism. MIT Press.

Fine, G.A. (2006) "Ground truth: Verification games in operational meteorology", Journal of Contemporary Ethnography 35 (1):3–23.

Forsythe, D.E. (2001) Studying those who study us: An anthropologist in the world of artificial intelligence. Stanford University Press.

Fujimura, J.H. (1987) "Constructing `do-able' problems in cancer research: Articulating alignment", Social Studies of Science 17 (2):257–293.

Gitelman, L. (2013) Raw data is an oxymoron. Infrastructures Series. MIT Press. Cambridge, 1 Online Resource (203 Seiten).

Green, S. & M.N. Svendsen (2021) "Digital phenotyping and data inheritance", Big Data & Society 8 (2): 20539517211036799.

Grundmann, R. (2017) "The problem of expertise in knowledge societies", Minerva 55 (1):25–48.

Henriksen, A. & A. Bechmann (2020) "Building truths in AI: Making predictive algorithms doable in healthcare", Information, Communication & Society 23 (6):802–816.

Hoeyer, K. (2023) Data paradoxes the politics of intensified data sourcing in contemporary healthcare. MIT Press.

Jaton, F. (2017) "We get the algorithms of our ground truths: Designing referential databases in digital image processing", Social Studies of Science 47 6):811–840.

Jaton, F. (2021) The Constitution of Algorithms : Ground-truthing, programming, formulating. MIT Press.

Jaton, F. (2023) "Groundwork for AI: Enforcing a benchmark for neoantigen prediction in personalized cancer immunotherapy", Social Studies of Science 53 (5):787–810.

Kang, E.B. (2023) "Ground truth tracings (GTT): On the epistemic limits of machine learning", Big Data & Society 10 (1): 20539517221146122.

Kitchin, R. (2017) "Thinking critically about and researching algorithms", Information, Communication & Society 20 (1):14–29.

Kitchin, R. & T. Lauriault (2018) "Toward critical data studies: Charting and unpacking data assemblages and their work", 3–20 in J. Thatcher, J. Eckert & A. Shears (Eds) Thinking big data in geography: New regimes, new research. University of Nebraska Press.

Knorr-Cetina, K. (1999) Epistemic cultures : How the sciences make knowledge. Harvard University Press.

Latour, B. (1987) Science in action : How to follow scientists and engineers through society. Harvard University Press.

Latour, B. (2005) Reassembling the social: An introduction to actor-network-theory. Oxford University Press.

Lebovitz, S., N. Levina & H. Lifshitz-Assaf (2021) "Is AI ground truth really true? The dangers of traning and evaluating AI tools based on experts’ know-what", MIS Quarterly 45 (3):1501–1525.

Lee, F. (2021) "Enacting the pandemic: Analyzing agency, opacity, and power in algorithmic assemblages", Science & Technology Studies 34 (1):65–90.

Lee, F. & L. Björklund Larsen (2019) "How should we theorize algorithms? Five ideal types in analyzing algorithmic normativities", Big Data & Society 6 (2):2053951719867349.

Lee, F. & C.-F. Helgesson (2019) "Styles of valuation: Algorithms and agency in high-throughput bioscience", Science, Technology, & Human Values 45 (4):659–685.

Loukissas, Y.A. (2019) All data are local: Thinking critically in a data-driven society.MIT Press.

Mackenzie, A. (2017) Machine learners : Archaeology of a data practice. MIT Press.

Mackenzie, A., C. Waterton, R. Ellis, E. Frow, R. McNally, L. Busch & B. Wynne (2013) "Classifying, constructing, and identifying life: Standards as Ttansformations of 'The Biological'", Science, Technology, & Human Values 38 (5):701–722.

MacKenzie, D.A. (1990) Inventing accuracy : A historical sociology of nuclear missile guidance. MIT Press.

Meuser, M. & U. Nagel (2009) "The expert interview and changes in knowledge production", 17–42 in A. Bogner, B. Littig & W. Menz (Eds) Interviewing experts. London: Palgrave Macmillan.

Mol, A. (1999) "Ontological Politics: A Word and Some Questions", The Sociological Review 47(1):74–89

Mol, A. (2002) The body multiple : Ontology in medical practice. Duke University Press.

Pinch, T.J. & W.E. Bijker (1984) "The social construction+ of facts and artefacts: Or how the sociology of science and the sociology of technology might benefit each other", Social Studies of Science 14 (3):399–441.

Pink, S. & J. Morgan (2013) "Short-term ethnography: Intense routes to knowing", Symbolic Interaction 36 (3):351–361.

Rajpurkar, P., E. Chen, O. Banerjee & E. Topol (2022) "AI in health and medicine", Nature medicine 28 (1):31–38.

Ribes, D., A. Hoffman, S. Slota & G. Bowker (2019) "The logic of domains", Social Studies of Science 49 (3):281–309.

Ryan, G.W. & H.R. Bernard (2003) "Techniques to identify themes", Field Methods 15 (1):85–109.

Seaver, N. (2017) "Algorithms as culture: Some tactics for the ethnography of algorithmic systems", Big Data & Society 4(2): 2053951717738104.

Seaver, N. (2018) "What should an anthropology of algorithms do?" Cultural Anthropology 33 (3):375–385.

Shapin, S. (1994) A social history of truth : Civility and science in seventeenth-century England. University of Chicago Press.

Shapin, S. & S. Schaffer (1985) Leviathan and the air-pump : Hobbes, Boyle, and the experimental life : including a translation of Thomas Hobbes, Dialogus physicus de natura aeris by Simon Schaffer. Princeton University Press.

Siebert, J., L. Joeckel, J. Heidrich, K. Nakamichi, K. Ohashi, I. Namba, R. Yamamoto & M. Aoyama (2020) "Towards Guidelines for Assessing Qualities of Machine Learning Systems", 17–31 in M. Shepperd, F. Brito e Abreu, A. Rodrigues da Silva & R. Pérez-Castillo, R. (Eds) Quality of information and communications technology. Cham: Springer.

Thylstrup, N.B. (2022) "The ethics and politics of data sets in the age of machine learning: Deleting traces and encountering remains", Media, Culture & Society 44 (4):655–671.

Timmermans, S. & M. Berg (1997) "Standardization in action: Achieving local universality through medical protocols", Social Studies of Science 27 (2):273–305.

Timmermans, S. & M. Berg (2003) The gold standard. The challenge of evidence-based medicine. Temple University Press.

Weinkle, J. & R. Pielke (2016) "The truthiness about hurricane catastrophe models", Science, Technology, & Human Values 42 (4):547–576.

Zajac, H., N. Avlona, T. Andersen, F. Kensing & I. Shklovski (2023) "Ground truth or dare: Factors affecting the creation of medical datasets for training AI". Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, August 8–10, 2023, Palais des congrès de Montréal, Montreal, QC, Canada, pp. 351–362. Association for Computing Machinery.

Downloads

Publicerad

2025-06-12

Referera så här

Högberg, Charlotte. 2025. ”“This Ground Truth Is Muddy anyway”: Ground Truth Data Assemblages for Medical AI Development”. Sociologisk Forskning 62 (1-2):85-106. https://doi.org/10.37062/sf.62.27826.