UB & CDH Workshop: Transkribus and Optical Character Recognition

to

During this entry level demo-workshop Ruben Schalk, Information- and Collection Specialist at the University Library, will demonstrate and teach how to use Optical Character Recognition software and Transkribus on example texts or material from your own research.

Thanks to improvements in AI, printed and handwritten texts can now be read almost perfectly by computers, using software for Handwritten Text Recognition (HTR) and Optical Character Recognition (OCR). This transforms archives, books, and other texts into machine-readable data that is full-text searchable and can be analyzed using, for instance, text mining or Named Entity Recognition.

In this demo and workshop, the presenter will first demonstrate how to use OCR software on printed collections and Transkribus (HTR software) on handwritten material – such as the Special Collections of the Utrecht University Library. Next, there will be a hands-on HTR/OCR session working with example texts, or material from your own research, such as photos of archival material. No prior knowledge of HTR or OCR is required.

Tip: the Centre for Digital Humanities has purchased a subscription to Transkribus that gives all UU-staff a big discount to this service.


Admission is free, but the number of participants is limited, so please register as soon as possible (first come, first served). If you are unable to attend, please cancel your registration by sending an email to CDH@uu.nl, so another participant can take your place.

Please note: the Centre for Digital Humanities aims to promote digital literacy amongst staff-members and therefore compensates attendance to these courses in terms of DCU (22 hours = 1 DCU). This course (including preparation) counts for 4 hours, which is equal to 0,18 DCU. The DCU’s will be automatically settled with your department at the end of the course.