TriModal Face Detection Dataset – TMFD Dataset

TMFD Dataset addresses the lack of available depth and thermal face detection benchmarks. The corresponding depth and thermal data frames accompany each RGB image in the dataset. The dataset encompasses a wide range of variations. The samples include different numbers of people in the scene, different individuals, various types of backgrounds, varying distances of measurement,

HISTORIAN: a large-scale HISTORIcal film dataset with cinematographic ANnotation

Description Developing automated tools for sustainable film preservation of extensive historical film collections assumes an understanding of fundamental cinematographic settings. In order to be able to investigate new approaches to detect and classify cinematographic settings, this paper proposes a novel large-scale historical film dataset with cinematographic annotations (HISTORIAN), i.e., shot boundaries, shot types, camera movements.

MIPT Dataset

Description Human behavioral analysis applications in the fields of ambient assisted living (AAL) and human security monitoring require continuous video analysis of individuals. Although intelligent systems deployed in these areas are intended to have a positive impact on the persons involved, subsequent continuous monitoring naturally raises ethical concerns and questions about privacy implications. To address these

SALAMI Dataset

Subjective Assessments of Legibility in Ancient Manuscript Images. The dataset consists of 250 images of historic manuscripts, paired with spatial maps of human legibility. These legibility were created in a study with 20 experts of philology an paleography. Dataset: Publication:

IPT Dataset

Identity Preserved Tracking (IPT) Dataset   Description The IPT (Identity Preserved Tracking) dataset consisting of 10 sequences of depth data recorded using an Orbbec Astra depth sensor. It features sequences in ten different locations with a high amount of background variation and is designed to be applicable to a wide range of tasks. Its labeling is

SDT Dataset

Synthetic Depth & Thermal (SDT) Dataset Description The Synthetic Depth & Thermal (SDT) dataset consists of 40k synthetic and 8k real depth and thermal stereo images, depicting human behavior in indoor environments. Included samples show uniquely posed lying, sitting and standing persons within four different room types (living room, bed room, bath room and kitchen),

PaCaBa – Parking Cars Barcelona Dataset

The PaCaBa (Parking Cars Barcelona) dataset is a WorldView-3 stereo satellite image dataset with labeled parking cars. It consists of three parts: Raw geotiff images with polygon annotations of cars. Image patches of size 540×540 with rotated bounding box annotations of parking cars. This part is suitable for training and testing of a parking cars

MSBin – MultiSpectral Document Binarization

This dataset is named MSBin which stands for MultiSpectral Document Binarization. The dataset is dedicated to the (document image) binarization of multispectral images. A description of the dataset is given at The dataset can be downloaded from Zenodo: The dataset is introduced in: Fabian Hollaus, Simon Brenner, Robert Sablatnig: CNN Based Binarization of MultiSpectral