Schedule
Date | Lecture | Topics | ||
---|---|---|---|---|
8/30 |
Lecture 1.1: Course introduction [ slides | video ] |
Multimodal core challenges |
||
9/1 |
Lecture 1.2: Multimodal applications and datasets [ slides | video ] |
Research tasks and datasets |
||
9/6 |
Lecture 2.1: Basic concepts: neural networks [ slides | video ] |
Gradient and optimization |
||
9/8 |
Lecture 2.2: Unimodal representations [ slides | video ] |
Dimensions of heterogeneity |
||
9/13 |
Lecture 3.1: Unimodal representations [ slides | video ] |
Language representations |
||
9/15 |
Lecture 3.2: Multimodal representations [ slides | video ] |
Cross-modal interactions |
||
9/20 |
Lecture 4.1: Multimodal representations [ slides | video ] |
Coordinated representations |
||
9/22 |
Lecture 4.2: Multimodal alignment and grounding [ slides | video ] |
Explicit alignment |
||
9/27 | Lecture 5.1: Project Hours (live working sessions instead of lectures) | |||
9/29 |
Lecture 5.2: Aligned representations [ slides | video ] |
Self-attention transformer models |
||
10/4 |
Lecture 6.1: Multimodal aligned representations
[ slides | video ] |
Multimodal transformers |
||
10/6 |
Lecture 6.2: Multimodal Reasoning [ slides | video ] |
Structured and hierarchical models |
||
10/11 |
Lecture 7.1: Multimodal Reasoning [ slides | video ] |
Reinforcement learning |
||
10/13 |
Lecture 7.2: Multimodal Reasoning [ slides | video ] |
Logical and causal inference |
||
10/18 | Lecture 8.1: Fall Break Break – No lectures |
|||
10/20 | Lecture 8.2: Fall Break – No lectures |
|||
10/25 |
Lecture 9.1: Generation [ slides | video ] |
Translation, summarization, creation |
||
10/27 |
Lecture 9.2: Generation [ slides | video ] |
GANs and diffusion models |
||
11/1 |
Lecture 10.1: Project Hours [ slides | video ] |
|||
11/3 |
Lecture 10.2: Project Hours [ slides | video ] |
|||
11/8 |
Lecture 11.1: Transference [ slides | video ] |
Modality transfer |
||
11/10 |
Lecture 11.2: Quantification [ slides | video ] |
Heterogeneity and interactions |
||
11/15 |
Lecture 12.1: Project Hours [ slides | video ] |
Project Hours |
||
11/17 |
Lecture 12.2: New research directions [ slides | video ] |
Recent approaches in multimodal ML |
||
11/22 |
Lecture 13.1: Thanksgiving Week – No Class – [ slides | video ] |
|||
11/24 |
Lecture 13.2: Thanksgiving Week – No Class –
[ slides | video ] |
|||
11/30 |
Lecture 14.1: Language, Vision, and Actionst
[ slides | video ] |
Motion and navigation |
||
12/2 |
Lecture 14.2: Multimodal Applications [ slides | video ] |
Healthcare and affective computing |
||
12/6 | Lecture 15.1: Final project assignment (live working sessions instead of lectures) | |||
12/8 | Lecture 15.2: Final project assignment (live working sessions instead of lectures) |