DI4DH: Digitization and Information processing for Digital Humanities
Digitizing content from historical sources is the first fundamental part of Digital Humanities. The automatic data enrichment is as important since it guarantees standardized research. Hence, the digitization and data enrichment are closely linked and form the basis of today’s Digital Humanities. This project is fulfilled in collaboration with the READ-COOP SCE.
Internships:
Several bachelor and master works can be fulfilled within this project. Below is a list of possible projects, but other project ideas are also possible (feel free to contact us):
- Android development
- Document page dewarping
- Handwritten Text Recognition
- Text line detection
- Enhancement of document images
- Named Entity Recognition
We develop a mobile scanning device that enables historians to digitize archival documents on the go. A ScanTent was developed within the framework of the READ project. The tent is ideal for document scanning because it blocks ambient light an functions as mount for any smartphone. An scanning app is developed as part of this DI4DH project, that empowers the ScanTent. DocScan automatically detects documents in video live streams and checks if the camera settings are ideal for document scanning. It further detects page turns and shoots pages automatically thereafter.