dla
Discover the forefront of audio deep learning in a methodically designed course featuring weekly lectures, seminars, and self-study opportunities. This autumn 2024 offering at the HSE CS Faculty delves into essential areas such as digital signal processing, speech recognition, and audio-visual fusion. The course provides detailed exploration of advanced models like CTC, RNN-T, and self-supervised learning. Engage in practical projects such as training speech recognition models and audio-visual separation to solidify your understanding of audio technologies.