Schedule Fall 2024
Speech2text
In the first block of the semester, we will focus on automatic speech recognition systems (ASR). In other words, systems that can map speech in form an audio signal to text.
- W38 Waw2vec (FAIR / 2019): Unsupervised Pretraining for Speech Recognition
- W39 Conformer (Google / 2020): Convolution-augmented Transformer for Speech Recognition
- W40 Whisper (OpenAI / 2022): Robust Speech Recognition via Large-Scale Weak Supervision
Transformers
Before the Autumn break, we will make a quick stop at the transformers. We will go through the original paper and discuss the key ideas behind the transformer architecture which is omnipresent across all ML fields nowadays.
- W41 Transformers (Google / 2017): Attention is All You Need
Graph Neural Networks
In this block, we venture into graph neural networks and their application in football tactics.
- W43 GNNs Intro (Deepmind / 2023): Everything is Connected
- W44 GANs (UofCambridge / 2018): Graph Attention Networks
- W45 Tactic AI (Deepmind / 2023): AI assistant for football tactics
3D reconstruction
This block covers Neural Radiance Fields (NeRF), a major breakthrough in 3D reconstruction. In the first session, we will review the original paper, and in the second, we will explore training and implementation details.
- W46 NeRF (UC Berkley / 2020): Representing Scenes as Neural Radiance Fields for View Synthesis
- W47 PyTorch3D (Meta / 2021): Going over the implementation and training details of NeRF
Industry talks
-
W48 Gustav Hansen (ML Researcher at Veo Technologies)
Gustav will present results of his master thesis at DTU titled Representation Learning Techniques for Sequence Data in Football Game Dynamics which he has done in collaboration with Veo. Being able to efficiently encode and represent football game dynamics is crucial for many downstream tasks such as action recognition, event detection, and tactical analysis.
-
W49 Frederik Warburg (Head of AI at Teton)
Frederik will talk about Teton's AI-powered caregiving system that supports nurses in their daily work by monitoring patients and providing real-time insights.