CLARIAH is a research infrastructure that aims to support research by humanities researchers. It must therefore provide facilities for the workflow in a typical research project that provides & creates its own data next to enabling access to the large standard data sets that have been gathered over the years. In such a project, the focus is on research, and data and tools are just instrumental. Data created in a typical research project are relatively small in size (compared to e.g. Nederlab). Exact maximum size of external or newly created data for which CLARIAH wants to support their incorporation in the infrastructure still has to be determined, but it will be orders of magnitude lower than the Nederlab data. For incorporation of data of the size of the Nederlab data a dedicated project is required, and probably dedicated software has to be developed.
The strategy chosen for the CLARIAH Linguistic RI is that we provide long-term data availability together with high quality metadata for a wide range of usages. While for the data consumer side (research) we provide flexible frameworks populated with tools and usage recipes directed at specific research use-cases.
|Work Package 3 (EN)|
Werkpakket 3 (WP3) concentreert zich op taalkunde en tekstuele data. De focus ligt hierbij op:
- Search applications and services for searching data and software via metadata
- Search applications and services for searching relevant linguistic phenomena in rich annotated corpora and lexicons
- Linguistic enrichements of corpora
- Interoperability (syntactic and semantic) of data and software in WP3 and of data and software from other work packages
Om de zoveel tijd zal er een update gegeven worden van de status van WP3, de zaken die gedaan zijn en de activiteiten in de (nabije) toekomst. Deze updates zullen hieronder geplaatst worden.
|NL||20-11-2015||eerste voortgangsverslag van WP3|