Course Staff

Debdeep Sanyal
Instructor
Arka Mukherjee
Instructor

Content

What is this course about?

Language models serve as the cornerstone of modern natural language processing (NLP) applications and open up a new paradigm of having a single general purpose system address a range of downstream tasks. As the field of artificial intelligence (AI), machine learning (ML), and NLP continues to grow, possessing a deep understanding of language models and generative ai in general becomes essential for scientists and engineers alike. This course is designed to provide students with a comprehensive understanding of language models by walking them through the entire process of how a language model is developed and optimized. Unlike most of the courses, this course will include a lot of coding the concepts along you learn along the way, which includes developing an entire language model and from scratch and optimizing it in various aspects.

Prerequisites

Note that this is not a graded class. The purpose of this class is to introduce the mechanistics and system level optimizations of large generative models to students which are not available as distinct courses on the internet.


Coursework

Assignments

Coding assignments, related to each topic, will be provided when required and discussed in class. While ungraded, these assignments are crucial for solidifying your understanding. For any assistance, students are welcome to contact any instructor. All of the codes discussed in the classes will be available on github. Here's a list of questions you can try out before joining the class. Feel free to refer to ChatGPT or any AI to help solve and understand the questions.

Topics to be covered


Announcements 📢

The second lecture is out! Slides for the second class will be made available soon.

Schedule

# Date Description Course Materials Exercises Video Link
1 19th March 2025 Introduction to Language Models: Scope and Overview Introduction to Language Models - Scope and Overview Assignment 1 LLM | Introduction to Language Models: Scope and Overview | Lecture 01
2 23rd March 2025 Introduction Neural Networks - Part 1 TBD TBD LLM | Introduction to Neural Networks: Part 1 | Lecture 02
3 30th March 2025 Advanced Language Modeling with Recurrent Nets Advanced Language Modeling with Recurrent Networks Resources on RNNs and LSTMs NA