Introduction to Transformers for Large Large Models

Course Content

Introduction

Available in days

days after you enroll

Welcome (3:01)

The RNN Encoder Decoder Architecture

Available in days

days after you enroll

The Attention Mechanism Before Transformers

Available in days

days after you enroll

The Self-Attention Mechanism

Available in days

days after you enroll

Understanding the Transformer Architecture

Available in days

days after you enroll

How do we create Tokens from Words

Available in days

days after you enroll

How LLMs Generate Text

Available in days

days after you enroll

Beyond LLMs: The Vision Transformer

Available in days

days after you enroll

Damien Benveniste, PhD

Ex-ML Tech Lead at Meta

Requirements

Intermediate knowledge of Python programming
Knowledge of PyTorch can be helpful
Some knowledge of Machine Learning can be helpful

Description

Welcome to the Introduction to Transformers for Large Language Models. Very recently, we saw a revolution with the advent of Large Language Models. It is rare that something changes the world of Machine Learning that much, and the hype around LLM is real! That's something that very few experts predicted, and it's essential to be prepared for the future.

This course is for Machine Learning enthusiasts who want to understand the inner workings of Transformer architecture. We are going to explore the different models that led to that discovery back in 2017. From the RNN Encoder-Decoder architecture, passing by the Bahdanau and Luong Attention mechanisms, up to the self-attention mechanism. We are going to dive into the strategy to parse text into tokens before feeding them to the LLMs and how LLMs can be tuned to generate text.

Each section will divided into the conceptual part and the coding part. I recommend digging into both aspects, but feel free to focus on the concepts or the coding if it matters more to you. I made sure to separate the two for learning flexibility. In the coding part, we are going to see how the different models are implemented in PyTorch, and we are going to explore some of the capabilities of the Transformers Python package by Hugging Face. However, this is not a PyTorch course, and I will not dive into the details of the framework.

Topics covered in that course:

The RNN Encoder-Decoder Architecture
The Attention Mechanism Before Transformers
The Self-Attention Mechanism
Understanding the Transformer Architecture
How do we create Tokens from Words
How LLMs Generate Text
Transformers' applications beyond LLMs

Who this course is for:

Machine Learning enthusiasts who want to improve their knowledge of Large Language Models
Intermediate Python developers curious to learn the ins and outs of the Transformer architecture

Featured Courses

Introduction to LangChain (2023 - deprecated)

Learn to build Software Applications with Large Language Models

Damien Benveniste

FREE

Introduction to Transformers for Large Language Models

From The RNN Encoder-Decoder to the Transformer Architecture

Damien Benveniste

FREE

Machine Learning Fundamentals BootCamp

Self-paced

Damien Benveniste

$800

Introduction to Transformers for Large Language Models

From The RNN Encoder-Decoder to the Transformer Architecture

Course Content

Damien Benveniste, PhD

Requirements

Description

Who this course is for:

Pricing

Introduction to Transformers for Large Language Models

Featured Courses

Introduction to LangChain (2023 - deprecated)

Learn to build Software Applications with Large Language Models

Introduction to Transformers for Large Language Models

From The RNN Encoder-Decoder to the Transformer Architecture

Machine Learning Fundamentals BootCamp

Self-paced