A step-by-step guide to building your own Large Language Model (LLM) from scratch! Whether you're an AI enthusiast, a developer curious about LLMs, or someone who wants to dive deeper into how GPT-like models work, this repository will take you on a hands-on journey through the fundamentals of LLM development.
- Beginners curious about AI/ML.
- Developers looking to demystify large language models.
- Researchers interested in experimenting with LLM architectures.
This project is a work in progress, and new modules are added regularly. Stay tuned for updates and feel free to contribute!
This repository includes Python code, Jupyter notebooks, and detailed explanations for every step.