TMFD Dataset addresses the lack of available depth and thermal face detection benchmarks. The corresponding depth and thermal data frames accompany each RGB image in the dataset. The dataset encompasses a wide range of variations. The samples include different numbers of people in the scene, different individuals, various types of backgrounds, varying distances of measurement, and wearable accessories worn by the individuals, such as hoodies, headphones, hats, glasses, and face masks. Additionally, the dataset includes different types of illumination in the scene. The dataset is categorized into three separate groups based on the complexity and difficulty level of face detection. The first subset represents the easiest detection conditions, with images containing a single person against a simple background. The sensor is positioned close to the target. This subset consists of 781 images for each modality. The second group represents medium detection conditions. The sensor placement remains the same as in the first subset, but additional variety is introduced. Individuals are captured wearing accessories that serve as obstacles to make detection more challenging. These accessories include hoodies, headphones, hats, and glasses. The second group also includes images with multiple persons in the scene to test the performance of multiple face detection. There are 965 images for each modality in this group. The last group represents the most challenging images for detection. In this case, the sensor is placed further away from the target. The scenes feature various obstacles, such as computer screens, plants, desks, and chairs, with individuals interacting with them. The presence of singular and plural individuals is also included. A distinct subset within this collection consists of images from a previous, easier scene, but with the addition of face mask usage to increase the difficulty of the prediction task. This particular group contains 1062 images for each modality.
Download and Use:
This database may be used for non-commercial research purpose only. If you publish material based on this database, we request you to include a reference to:
- Mucha W., Kampel M. (2022) “Depth and Thermal Images in Face Detection – A Detailed Comparison Between Image Modalities”, The 5th International Conference on Machine Vision and Applications (ICMVA 2022), February 18-20, 2022, Singapore https://doi.org/10.1145/3523111.3523114