AI & Graphics | AI & Graphics

Learn AI

vLLM Optimization Deep Dive

Master vLLM's optimization internals through Triton kernel implementations.

Gurwinder May 2026 · 6 Modules · 2-3 Hours

GPU Optimization

GPU Optimization: 10 vLLM PRs

Learn optimization techniques from actual merged vLLM PRs with verified performance numbers.

Gurwinder May 2026 · 10 Modules · 2-3 Hours

PyTorch Optimization

PyTorch Optimization: 10 PRs

Learn optimization techniques from actual merged PyTorch PRs with verified performance numbers.

Gurwinder May 2026 · 10 Modules · 2-3 Hours

Featured

How PyTorch Sees Your Triton Kernel: Using ReLU Kernel in Model with Dynamo and AOT Autograd Backend

How PyTorch Sees Your Triton Kernel: Using ReLU Kernel in Model with Dynamo and AOT Autograd Backend

How to write Triton Kernel, wire it into model with full gradient support, and then trace the entire …

Gurwinder Apr 24, 2026 · 10 min read

Understanding Triton Kernels from First Principles

Understanding Triton Kernels from First Principles

A deep dive into how Triton kernels work, explained from absolute basics to complete understanding. …

Gurwinder Mar 19, 2026 · 7 min read

All Stories

A thumbnail image

Under the Hood: How PyTorch Chooses Attention Kernels and Why It Matters for Performance

A deep dive into PyTorch’s attention kernel selection and what each choice means for your …

Gurwinder Sep 20, 2025 · 8 min read

A thumbnail image

Breaking Down Vision Transformers: A Code-Driven Explanation

In this article, I’ll break down the layers of a ViT step by step with code snippets, and a …

Gurwinder Nov 25, 2024 · 4 min read

A thumbnail image

Game Development

Turn 3D Gaussian Splat Files into Stunning Assets in Unity 6

This guide walks you through the process of loading splat files in Unity 6 using the Gaussian …

Gurwinder Nov 11, 2024 · 3 min read

A thumbnail image

Game Development

Intel GPU Scheduling: Exploring Matrix Addition with SYCL and PyTorch

If you’ve ever worked with GPUs, you know how crucial it is to understand how they manage workloads. …

Gurwinder Oct 20, 2024 · 4 min read

A thumbnail image

Game Development

HLSL Ray Tracing: Crafting Realistic Scenes in Unity, One Ray at a Time

Instead of just slapping textures on polygons, ray tracing lets us simulate how light interacts with …

Gurwinder Oct 11, 2024 · 6 min read

A thumbnail image

Harnessing Local Llama to Process Complete Projects: How I use AI for code suggestions and refactoring my Projects

We’ll walk through a Python script that leverages the LangChain framework to process a codebase, …

Gurwinder Oct 10, 2024 · 6 min read