OPERANDI – OCR-D Performance Optimisation and Integration - Project details (OPERANDI)
OPERANDI is one of the implementation projects funded by the DFG initiative OCR-D. The main goal of OCR-D is the conceptual and technical preparation of the full text transformation of the 16th to 18th century prints published in the German language area. The task of automatic full-text recognition will be broken down into its individual process steps, which can be reproduced in the open source OCR-D software. This makes it possible to create optimal workflows for the prints to be processed and thus generate scientifically usable full texts.
The goal of OPERANDI is to develop and build an OCR-D based implementation package for mass full-text capture with improved throughput, while improving the quality of the results. At the same time, the goal is that the implementation package can also be used by other projects and institutions with comparable requirements. Two scenarios were identified during the pilot. In the first scenario, OCR generation is to take place for works that have already been digitized, resulting in mass full-text capture. In the second scenario, OCR generation for new works to be digitized will take place as part of the digitization process.
Affiliates
- Gesellschaft für wissenschaftliche Datenverarbeitung mbH Göttingen
- Göttingen State and University Library
Project board at the SUB Göttingen
Project staff at the SUB Göttingen
- Kay Liewald
- Jörg-Holger Panzer
- Lilja Mareike Sautter
- Kristine Schima-Voigt
- Jana Schmidt-Tesch
- Yasin Onur Yücel
SUB Göttingen departments / units involved in the project
Former project staff of the SUB Göttingen
Manikanth Dindigala, Claudia Malzer, Adnan Siddique