Graduate-level course at ETH Zurich in Autumn Semester 2024

Lecturer: Christos Sakaridis.
6 ECTS. Class size limited to 90 students.

ETH Course Catalogue

Lecture Team

All
General
Project 1
Project 2
Website & Forum

Christos

Luigi

Tim

Siyuan

Lectures

Date

Time

Room

Slides

Video

Topic

20.09.2024

14:15 - 17:00

HG D 5.2
Zoom

Fundamentals of Autonomous Cars

27.09.2024

14:15 - 17:00

HG D 5.2
Zoom

Fundamental Computer Vision Architectures and Algorithms
for Autonomous Cars

04.10.2024

14:15 - 17:00

HG D 5.2
Zoom

Fundamental Computer Vision Architectures and Algorithms
for Autonomous Cars (continued)

11.10.2024

14:15 - 17:00

HG D 5.2
Zoom

Semantic Segmentation

18.10.2024

14:15 - 17:00

HG D 5.2
Zoom

Depth Estimation

25.10.2024

14:15 - 17:00

HG D 5.2
Zoom

Object Detection

1.11.2024

14:15 - 17:00

HG D 5.2
Zoom

Instance Segmentation and Panoptic Segmentation

8.11.2024

14:15 - 17:00

HG D 5.2
Zoom

Unimodal 3D Object Detection

15.11.2024

No lecture - CVPR conference deadline

22.11.2024

14:15 - 17:00

HG D 5.2
Zoom

3D Reconstruction and Localization
(last updated 25.11.2024)

29.11.2024

14:15 - 17:00

HG D 5.2
Zoom

Domain Adaptation

6.12.2024

14:15 - 17:00

HG D 5.2
Zoom

Multi-modal 2D and 3D Object Detection

13.12.2024

14:15 - 17:00

HG D 5.2
Zoom

Visual Grounding, Anomaly Segmentation and
Vehicle-to-Vehicle Communication

20.12.2024

14:15 - 17:00

HG D 5.2
Zoom

Multiple Object Tracking and Motion Prediction

Practical Sessions

Date

Time

Room

Slides

Video

Topic

20.09.2024

No practical session

27.09.2024

No practical session

04.10.2024

10:15 - 12:00

ON LI NE

Getting Started with Python and SLURM

11.10.2024

10:15 - 12:00

ON LI NE

Project 1 Introduction:
Semantic Segmentation and Depth Estimation

18.10.2024

10:15 - 12:00

ON LI NE

Project 1: Attention Mechanisms and Transformers

25.10.2024

10:15 - 12:00

ON LI NE

Project 1: Q&A

1.11.2024

10:15 - 12:00

ON LI NE

Project 1: Q&A

8.11.2024

10:15 - 12:00

ON LI NE

Project 1: Q&A

15.11.2024

10:15 - 12:00

ON LI NE

Project 2 Introduction:
3D Detection from Point Clouds

22.11.2024

10:15 - 12:00

ON LI NE

Project 2 Introduction:
3D Detection from Point Clouds

29.11.2024

10:15 - 12:00

ON LI NE

Project 2 Q&A

06.12.2024

10:15 - 12:00

ON LI NE

Project 2 Q&A

13.12.2024

10:15 - 12:00

ON LI NE

Project 2 Q&A

20.12.2024

10:15 - 12:00

ON LI NE

Project 2 Q&A

Forum

We use Piazza as forum to discuss within students and with teachers. You can post private messages, no anonymous messages are allowed.

Abstract

This course introduces the core computer vision techniques and algorithms that autonomous cars use to perceive the semantics and geometry of their driving environment, localize themselves in it, and predict its dynamic evolution. Emphasis is placed on techniques tailored for real-world settings, such as multi-modal fusion, domain-adaptive and outlier-aware architectures, and multi-agent methods.

Objective

Students will learn about the fundamentals of autonomous cars and of the computer vision models and methods these cars use to analyze their environment and navigate themselves in it. Students will be presented with state-of-the-art representations and algorithms for semantic, geometric and temporal visual reasoning in automated driving and will gain hands-on experience in developing computer vision algorithms and architectures for solving such tasks.
After completing this course, students will be able to:

understand the operating principles of visual sensors in autonomous cars,
differentiate between the core architectural paradigms and components of modern visual perception models and describe their logic and the role of their parameters,
systematically categorize the main visual tasks related to automated driving and understand the primary representations and algorithms which are used for solving them,
critically analyze and evaluate current research in the area of computer vision for autonomous cars,
practically reproduce state-of-the-art computer vision methods in automated driving,
independently develop new models for visual perception.

Content

The content of the lectures consists in the following topics:

Fundamentals
1. Fundamentals of autonomous cars and their visual sensors
2. Fundamental computer vision architectures and algorithms for autonomous cars
Semantic perception
1. Semantic segmentation
2. Object detection
3. Instance segmentation and panoptic segmentation
Geometric perception and localization
1. Depth estimation
2. 3D reconstruction
3. Visual localization
4. Unimodal visual/lidar 3D object detection
Robust perception: multi-modal, multi-domain and multi-agent methods
1. Multi-modal 2D and 3D object detection
2. Visual grounding and verbo-visual fusion
3. Domain-adaptive and outlier-aware semantic perception
4. Vehicle-to-vehicle communication for perception
Temporal perception
1. Multiple object tracking
2. Motion prediction

Projects

The practical projects involve implementing complex computer vision architectures and algorithms and applying them to real-world, multi-modal driving datasets. In particular, students will develop models and algorithms for:

Semantic segmentation and depth estimation,
3D object detection using LiDARs.

Prerequisites

Students are expected to have a solid basic knowledge of linear algebra, multivariate calculus, and probability theory, and a basic background in computer vision and machine learning. All practical projects will require solid background in programming and will be based on Python and libraries of it such as PyTorch, scikit-learn and scikit-image.

Exam

Examiners: Christos Sakaridis

A session examination is offered. The mode of the exam is written and its duration is 120 minutes. The language of examination is English. The performance assessment is only offered in the session after the course unit. Repetition is only possible after re-enrolling.
The final grade will be calculated from the session examination grade and the overall projects grade, with each of the two elements weighing 50%. The projects are an integral part of the course, they are group-based and their completion is compulsory. Receiving a failing overall projects grade results in a failing final grade for the course. Students who do not pass the projects are required to de-register from the exam.
Written aids for the final exam: one A4 sheet of paper and simple non-programmable calculator.

A short mock exam with sample, representative multiple-choice and true-false questions is available below, without and with solutions, for the purpose of practicing. The volume of this mock exam is shorter than (and not representative of) that of the actual exam. Questions on the solutions of the mock exam will be discussed in the lecture of 06.12.2024.

Mock Exam without Solutions

Mock Exam with Solutions

Computer Vision and Artificial Intelligencefor Autonomous Cars 2024

Graduate-level course at ETH Zurich in Autumn Semester 2024

ETH Course Catalogue

Lecture Team

Lectures

Date

Time

Room

Slides

Video

Topic

Practical Sessions

Date

Time

Room

Slides

Video

Topic

Forum

Abstract

Objective

Content

Projects

Prerequisites

Exam

Computer Vision and Artificial Intelligence
for Autonomous Cars 2024