Skip to content

CS479: Machine Learning for 3D Data

Minhyuk Sung, KAIST, Fall 2023


Teaser

Time & Location

Time: Mon/Wed 10:30am - 11:45am (KST)
Location: Online via Zoom

Zoom Link

Description

3D Data (both 3D scans captured by depth sensors and 3D models created by designers) are widely used in many applications in computer vision, computer graphics, and robotic, such as autonomous driving, AI-assisted 3D object/scene design, augmented reality, and physical robot interaction. Along with the recent increasing demands on processing and analyzing such 3D data, there has been tremendous progress in developing novel technologies, especially based on deep learning. In this course, we will cover the recent advances in machine learning techniques for 3D data and also discuss the remaining challenges. Most of the course material will be less-than-5-year-old research papers in CVPR/ICCV/ECCV (Vision), SIGGRAPH/SIGGRAPH Asia (Graphics), and NeurIPS/ICML (Machine Learning). The course will be project-oriented (no exam, no paper-and-pencil homework, but easy programming assignment), and it will combine pedagogical lectures and seminar-style reading group presentations (followed by interactive discussions).

Prerequisites

This course is intended for undergraduate/master students who have a basic background in deep learning and experience with PyTorch.

Course Staff

Instructor: Minhyuk Sung (mhsung@kaist.ac.kr)

Course Assistants:

Office hours (Offline): Mon 7:00pm (KST). N1 Rm 601.

Past Years

Grading

  • Programming Assignments: 30%
  • Project: 50%
  • Paper/Project Reviews: 10%
  • In-Class Participation: 10%

AI Coding Assistant Tool Policy

You are allowed (and even encouraged) to utilize AI coding assistant tools, such as ChatGPT, Copilot, Codex, and Code Intelligence, for your programming assignments and projects. Utilizing AI coding assistant tools will not be deemed as plagiarism. However, it is still strictly prohibited to directly copy code from the Internet or from someone else. Doing so will lead to a score of zero and a report to the university.

Important Dates

ALL ASSIGNMENTS ARE DUE 23:59 KST. (Subject to Change)

  • Project Sign-Up: Due Sep 8 (Fri)
  • 1st Programming Assignment: Due Sep 17 (Sun)
  • Project Proposal: Due Sep 24 (Sun)
  • 2nd Programming Assignment: Due Oct 9 (Mon)
  • Project Mockup: Due Oct 9 (Mon)
  • Project Pitch Video: Due Oct 20 (Fri)
  • Project Interim Report 1: Due Nov 5 (Sun)
  • Paper Review Questions: Due Nov 12 (Sun)
  • 3rd Programming Assignment: Due Nov 12 (Sun)
  • Project Interim Report 2: Due Nov 19 (Sun)
  • Paper Review Answers: Due Nov 26 (Sun)
  • Project Poster: Due Nov 29 (Wed)
  • Project Report/Code: Due Dec 3 (Sun)
  • Project Review: Due Dec 9 (Sat)
  • Project Rebuttal: Due Dec 13 (Wed)

Paper List

Paper List Link

Schedule

(Subject to Change)

Week Mon Topic Wed Topic
1 Aug 28 Course Introduction
Slides
Recording
Aug 30 3D Representations 1
Slides
Recording
2 Sep 04 3D Representations 2
Slides
Recording
Sep 06 3D Representations 3
Slides
Recording
3 Sep 11 Point Cloud Encoders
Slides
Recording
Sep 13 Point Cloud Generation
Slides
Recording
4 Sep 18 Implicit Neural Representations
Slides
Recording
Sep 20 Structure from Motion 1
Slides
Recording
5 Sep 25 Structure from Motion 2 /
Neural Rendering 1

Slides
Recording
Sep 27 Neural Rendering 2
Recording
6 Oct 02 No Class (Substitute Holiday) Oct 04 No Class (Conference Trip)
7 Oct 09 No Class (Hangul Day) Oct 10 Guest Lecture 1
Niloy J. Mitra
Professor at UCL
Oct 10 (Tue) 4:00 p.m.
Offline (E3-1, Rm 1101)
8 Oct 16 No Class (Midterm Week) Oct 18 No Class (Midterm Week)
9 Oct 23 Project Pitches
Video Compilation
Oct 25 Hybrid Representations 1
Slides
Recording
10 Oct 30 Hybrid Representations 2
Recording
Nov 01 Diffusion Models 1
Slides
Recording
12 Nov 06 Diffusion Models 2
Slides
Recording
Nov 08 Conditional Generation
Slides
Recording
11 Nov 13 3D Generation
Slides
Recording
Nov 15 No Class (Break)
13 Nov 20 3D Detection/Segmentation
Slides
Recording
Nov 22 Guest Lecture 2
Jun Gao
Research Scientist at
NVIDIA Toronto AI Lab
Nov 22 (Wed) 10:30 a.m.
Online (Zoom)
14 Nov 27 Rotation Invariance/Equivariance
Slides
Recording
Nov 29 No Class (Undergraduate Admission Interviews)
15 Dec 04 Poster Presentations 1 Dec 06 Poster Presentations 2
16 Dec 11 No Class (Final Week) Dec 13 No Class (Final Week)