@marc.kaz — DY7SU5LujWZ

Open on Instagram ↗

@marc.kaz · Marc Kaz

Saved 2026-06-03 · Posted 2026-05-29 · Status: New

This repo proves you don’t need a datacenter or millions of dollars to train real LLMs.

Ships with:
• Full GPT-style model training from scratch
• Tokenization, architecture & training loop
• Advanced techniques that run on consumer hardware
• Distributed training tricks optimized for single GPU

Everything is open source and explained step-by-step.

👉 https://github.com/FareedKhan-dev/train-llm-from-scratch

Who’s training their own billion-parameter model today? Drop a 🔥

Comments (15)

@mr_mabulous ♥ 174 2026-05-29

Yeah and the output it generates is complete garbage - worse than gpt 2. Reminds me of markov-chain gibberish from 15 years ago
@phanto90 ♥ 133 2026-05-29

Why people who does not know shit talks about AI and LLMs
@_kaustav97 ♥ 108 2026-05-29

Been training ml models since 2017, let me tell you something, scripts don't matter, if your hardware and dataset is limited. Once you solve those two, than you can decide what scripts are the best.
@joephilipson ♥ 18 2026-05-29

Idgi, I've been doing this in Claude.
@liquiditysweeps ♥ 13 2026-05-29

Kinda wanna touch into this , isn’t new been around since 2017-2019, just his own version of the transformer. Ur still heavily limited based on hardware this isn’t a hardware workaround, your not gonna be able to train a good billion parameter model on ur 3090 or 4090 (keyword good) so kinda overhyped with ur wording but still impressive for the dude
@krish_chilvery ♥ 5 2026-05-29

Takes 300+ days
@binhusmachado ♥ 2 2026-05-29

🔥
@m1.r0gue ♥ 1 2026-05-31

But.......
@faheem_.10 ♥ 1 2026-05-30

Bro don’t yap nonsense. This repo is about a small GPT-style transformer built to explain the fundamentals not to train any 1-2b params model. I've made a similar project repo on github, ive made it from scratch and it covers the architecture, tokenization, self-attention, next-token prediction, training, and theory behind how GPT-like models work. If anyone wants to learn the internals rather than just use APIs, feel free to check it out.
@choonyong.tan.33 ♥ 1 2026-05-30

You still need like a 3090
@josh_einstein ♥ 1 2026-05-29

🔥
@siddhesh_c_chaudhari ♥ 1 2026-05-29

🔥
@ajinkya_8086 ♥ 1 2026-05-29

🔥
@theyreallactors44_ ♥ 1 2026-05-29

🔥
@singhsdoingtings 2026-05-30

Is this ragebait? Please be ragebait

Content ideas (0)

No ideas generated yet. Run /instagram-sync ideate from Claude Code to create some.