Build LLM from Scratch, Chapter 2 — Working with Text Data
An Elixir/Nx walkthrough of preparing text for LLM training: tokenization, token IDs, BPE, sliding windows, token embeddings, and positional embeddings.
https://karlosmid.com/2026/01/build-llm-from-scratch-chapter-2-working-with-text-data/
Read next ElixirConf US 2025 Videos and CFT
