• EN English (UK)
inloggen

Facebook Twitter

  • Home
  • Over
    • Organisatie
      • Bestuur
      • Raad van Toezicht
      • International Advisory Panel
      • CLARIAH Centra
    • Partners
    • Internationaal
    • Wie-is-Wie
    • Geschiedenis
      • Bestaande Infrastructuur
      • Zaaigeld (2012-2014)
      • Voorstel
    • Deelnemers
      • Reisondersteuning
      • Evenement ondersteuning
      • FAQ
    • Acroniemen
    • Bestanden
      • Downloads
      • Categories
  • Werkpakketten
    • Disseminatie & Educatie
    • Techniek
    • Focusgebieden
      • Taalkunde
      • Sociaal-Economische Geschiedenis
      • Media Studies
      • Tekst
    • Formulieren
  • Projecten
    • Research Pilots
      • 2TBI
      • ACAD
      • CoDoSiS
      • CrossEWT
      • DB:CCC
      • DReAM
      • HHuCap
      • HUMIGEC
      • LinkSyr
      • M&M
      • MIMEHIST
      • NAMES
      • NarDis
      • OpenGazAm
      • ReSpoNs
      • SERPENS
      • Templaten
    • ADAH Project Call
    • DODH
    • Afgerond
      • Zaaigeld
        • CLIO-DAP
        • HLZ
        • Nederlab
        • TROVe
        • OHT
  • Disseminatie
    • Presentaties
    • CLARIAH-CORE Boek
    • Video's
    • Screencasts
    • Web Lectures
    • Folders
    • Flipboard
    • Archief
    • Maatschappelijke Impact
  • Evenementen
    • Agenda
    • CLARIAH evenementen
    • Toog-dag (2019)
    • Toog-dag 2 (2018)
    • Toog-dag 1 (2018)
    • Toog-dag (2017)
    • Tech-dag 2 (2018)
    • Tech-dag 1 (2018)
    • Tech-dag (2017)
    • Tech-dag (2016)
    • Informatiedag (2016)
    • Kickoff (2015)
    • Demodag (2013)
  • Nieuws & Blogs
    • Nieuws
    • Blogs
    • Artikelen
    • Nieuwsbrieven
  • Contact
    • Logo's
    • Persberichten
    • In de pers
      • Groene Amsterdammer
      • eData & Research
      • Overig
    • Audio

NAMES

  •   Abstract
  •   People
  •   Publications & Presentations
  •   Deliverables
  •   Other
  •   Abstract
  •   People
  •   Publications & Presentations
  •   Deliverables
  •   Other

Abstract

Dutch corpus of person name variants

 

This project aims to develop a gold standard for person name variants, mainly based on the LINKS corpus of 19/20th century person names from the vital register (63 million tokens). 25% of the 564.000 surnames and 189.000 first names have already been standardized, based on variants associated to the same individual. Expert review of this core set is necessary, however, which process will be assisted by the CLARIAH tool TICCL. This will also constitute the (statistical) learning phase of TICCL (to handle previously unseen variants), while a data structure will be established to deal with ambiguities and to accommodate different levels of standardization. In a second phase, the remaining 75% of the LINKS corpus will be standardized.
The corpus will both be delivered in RDF format for Linked Open Data, and as a lexical service. The usage of the corpus will be tested within the CLARIAH Anansi environment .

People

gerrit bloothooft

Principal Investigator

Gerrit Bloothooft
UiL-OTS Utrecht

Project Team

david onland martin reynaert katrien depuydt mathieu fannee Jauco  
David Onland
UiL-OTS Utrecht
Martin Reynaert
Tilburg School of Humanities
Katrien Depuydt
INT Leiden
Mathieu Fannee
INT Leiden
Jauco
Noordzij
Huygens-ING
 

Publications & Presentations

Uw tekst...

Deliverables

Uw tekst...

Other

Uw tekst...

subMenu

  • Research Pilots
    • 2TBI
    • ACAD
    • CoDoSiS
    • CrossEWT
    • DB:CCC
    • DReAM
    • HHuCap
    • HUMIGEC
    • LinkSyr
    • M&M
    • MIMEHIST
    • NAMES
    • NarDis
    • OpenGazAm
    • ReSpoNs
    • SERPENS
    • Templaten
  • ADAH Project Call
  • DODH
  • Afgerond
    • Zaaigeld
      • CLIO-DAP
      • HLZ
      • Nederlab
      • TROVe
      • OHT

Hoe kunnen we u helpen

  • Tools
  • Data
  • Standaarden
  • Educatie

Mede mogelijk gemaakt door

  • KNAW
  • NWO

Contact

Spinhuis - Huygens ING/CLARIAH-bureau
Oudezijds Achterburgwal 185
1012 DK AMSTERDAM

t + 31 (0)20 - 224 68 00



NWO projectnummer: 184.033.101


  • Sitemap
Creative Commons-LicentieDit werk van ©CLARIAH: Infrastructuur voor de Geesteswetenschappen, valt onder:
Creative Commons Naamsvermelding-NietCommercieel 4.0 Internationaal-licentie. 
Naar boven