Link-Lives, Historical Big Data: Reconstructing Millions of Life Courses from Archival Records Using Domain Experts and Machine Learning: Proceedings of Linked Archives International Workshop 2021

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Link-Lives, Historical Big Data: Reconstructing Millions of Life Courses from Archival Records Using Domain Experts and Machine Learning : Proceedings of Linked Archives International Workshop 2021. / Revuelta-Eugercios, Barbara A.; Robinson, Olivia; Løkke, Anne.

I: CEUR Workshop Proceedings, Bind 3019, 2021, s. 135-143.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Revuelta-Eugercios, BA, Robinson, O & Løkke, A 2021, 'Link-Lives, Historical Big Data: Reconstructing Millions of Life Courses from Archival Records Using Domain Experts and Machine Learning: Proceedings of Linked Archives International Workshop 2021', CEUR Workshop Proceedings, bind 3019, s. 135-143. <https://ceur-ws.org/Vol-3019/LinkedArchives_2021_paper_9.pdf>

APA

Revuelta-Eugercios, B. A., Robinson, O., & Løkke, A. (2021). Link-Lives, Historical Big Data: Reconstructing Millions of Life Courses from Archival Records Using Domain Experts and Machine Learning: Proceedings of Linked Archives International Workshop 2021. CEUR Workshop Proceedings, 3019, 135-143. https://ceur-ws.org/Vol-3019/LinkedArchives_2021_paper_9.pdf

Vancouver

Revuelta-Eugercios BA, Robinson O, Løkke A. Link-Lives, Historical Big Data: Reconstructing Millions of Life Courses from Archival Records Using Domain Experts and Machine Learning: Proceedings of Linked Archives International Workshop 2021. CEUR Workshop Proceedings. 2021;3019:135-143.

Author

Revuelta-Eugercios, Barbara A. ; Robinson, Olivia ; Løkke, Anne. / Link-Lives, Historical Big Data: Reconstructing Millions of Life Courses from Archival Records Using Domain Experts and Machine Learning : Proceedings of Linked Archives International Workshop 2021. I: CEUR Workshop Proceedings. 2021 ; Bind 3019. s. 135-143.

Bibtex

@article{cc678bfcba21466a9569389e10c56ca0,
title = "Link-Lives, Historical Big Data: Reconstructing Millions of Life Courses from Archival Records Using Domain Experts and Machine Learning: Proceedings of Linked Archives International Workshop 2021",
abstract = "The Danish archives comprise some of the world{\textquoteright}s most comprehensive source coverage but despite large-scale digitization and transcription projects by diverse actors, there are no shared standards or possibilities for data linkage. The Denmark-based Link-Lives research project (2019-2024) is tackling this disparity by linking individual-level Danish records in census and parish record sources from 1787-1968 to create a multigenerational database for research using a combination of domain expertise and machine learning techniques. In contrast to small-sample linking or fully automated processes, Link-Lives is creating its own manually-linked data to train machine learning as well as exploring the impacts of different approaches to linking. Due to personal data protection legislation and propriety agreements, the data cannot be fully open access, but data outputs will be made available to both researchers and the general public via a website. The project{\textquoteright}s interdisciplinary team is based at the Danish National Archives and the University of Copenhagen, in partnership with Copenhagen City Archives, and funded by Carlsberg and Innovation Fund Denmark.",
author = "Revuelta-Eugercios, {Barbara A.} and Olivia Robinson and Anne L{\o}kke",
year = "2021",
language = "English",
volume = "3019",
pages = "135--143",
journal = "CEUR Workshop Proceedings",
issn = "1613-0073",
publisher = "ceur workshop proceedings",

}

RIS

TY - JOUR

T1 - Link-Lives, Historical Big Data: Reconstructing Millions of Life Courses from Archival Records Using Domain Experts and Machine Learning

T2 - Proceedings of Linked Archives International Workshop 2021

AU - Revuelta-Eugercios, Barbara A.

AU - Robinson, Olivia

AU - Løkke, Anne

PY - 2021

Y1 - 2021

N2 - The Danish archives comprise some of the world’s most comprehensive source coverage but despite large-scale digitization and transcription projects by diverse actors, there are no shared standards or possibilities for data linkage. The Denmark-based Link-Lives research project (2019-2024) is tackling this disparity by linking individual-level Danish records in census and parish record sources from 1787-1968 to create a multigenerational database for research using a combination of domain expertise and machine learning techniques. In contrast to small-sample linking or fully automated processes, Link-Lives is creating its own manually-linked data to train machine learning as well as exploring the impacts of different approaches to linking. Due to personal data protection legislation and propriety agreements, the data cannot be fully open access, but data outputs will be made available to both researchers and the general public via a website. The project’s interdisciplinary team is based at the Danish National Archives and the University of Copenhagen, in partnership with Copenhagen City Archives, and funded by Carlsberg and Innovation Fund Denmark.

AB - The Danish archives comprise some of the world’s most comprehensive source coverage but despite large-scale digitization and transcription projects by diverse actors, there are no shared standards or possibilities for data linkage. The Denmark-based Link-Lives research project (2019-2024) is tackling this disparity by linking individual-level Danish records in census and parish record sources from 1787-1968 to create a multigenerational database for research using a combination of domain expertise and machine learning techniques. In contrast to small-sample linking or fully automated processes, Link-Lives is creating its own manually-linked data to train machine learning as well as exploring the impacts of different approaches to linking. Due to personal data protection legislation and propriety agreements, the data cannot be fully open access, but data outputs will be made available to both researchers and the general public via a website. The project’s interdisciplinary team is based at the Danish National Archives and the University of Copenhagen, in partnership with Copenhagen City Archives, and funded by Carlsberg and Innovation Fund Denmark.

M3 - Journal article

VL - 3019

SP - 135

EP - 143

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

ER -

ID: 324598873