This lecture covers Deep Learning for automatic image and video analysis, such as classifying images into categories or detecting and distinguishing persons. Deep Learning has lead to breakthroughs in these fields; in certain problems, the performance of current methods based on this technology is similar or even better than that of humans – a novelty in Computer Vision.
The goal of this lecture is to provide an introduction to this exciting branch of machine learning. Due to the ongoing pandemic, the lecture will be held online (details will be announced soon).
You will apply what you’ve learned in the exercise part of this course, which consists of several assignments that must be handed in by each group (two students). You can work on these assignments on your own computer if you have a decent GPU with CUDA support. We will provide remote access to a dedicated GPU server for those who don’t. Alternatively, you can use services like AWS SageMaker or Google AI Platform, which have free tiers.
This is a course for Master’s students, so students are expected to have basic knowledge of mathematics and statistics, image processing, and machine learning. We will briefly recap some basics as part of the first few lectures.
For the exercise part, students must be able to program in Python 3.
There will be a written exam that covers the lecture part (50% of the grade). The exercise part is also worth 50% of the grade.
Due to the ongoing pandemic, exams will be held upon request.