Image of a hub, © Paul Watson, CC  BY-NC-SA 2.0 The Linguistic Teaching resources Hub
Image © Paul Watson, Licence CC BY-NC-SA 2.0

From Print and Manuscript to Electronic Version: Text Digitization and Annotation

* Axel Herold * Henriette Ast *

Keywords:

https://hdl.handle.net/11022/0000-0007-C2BD-9

n this course we will provide an introduction to current methods for the creation and annotation of linguistic (text) corpora. This will be done to a large extend by hands-on exercises. We primarily focus on historical prints and manuscripts, though more recent texts will also be taken into account. Participants’ own material can be discussed as well. We demonstrate and discuss the steps towards the creation of corpora, focusing on text selection, treatment of metadata, transcription and representation of the material, methods of (linguistic and text structural) annotation, and finally the presentation and provision of corpora. In this context, standards and best practices for data recognition and annotation will be introduced and applied by example, e. g. the guidelines of the Text Enc

Feedback

Sorry, there is no feedback available. Be the first one to provide feedback!

Resource details

Institution: European Summer University of Cultures and Technology 2017
Year of publication: 2017
Language: english
Type: Course
Audience:
Level: ------
Prerequisites:
Media: text/pdf
Objective:
Licence:
Access: open
Creation date: Thursday, 19 October 2017 17:22:04
Last modified: Wednesday, 24 April 2024 21:39:13
BibTeX type: @misc
BibTeX entry:
@misc(TeLeMaCo:390,
author = "Herold, Axel and Ast, Henriette",
title = "{F}rom {P}rint and {M}anuscript to {E}lectronic {V}ersion: {T}ext {D}igitization and {A}nnotation",
year = "2017",
url = ""
)
  
  
  

Helpdesk Button