← Puter stuff
Open on Instagram ↗

Comments (15)

  • @mr_mabulous 2026-05-29

    Yeah and the output it generates is complete garbage - worse than gpt 2. Reminds me of markov-chain gibberish from 15 years ago

  • @phanto90 2026-05-29

    Why people who does not know shit talks about AI and LLMs

  • @_kaustav97 2026-05-29

    Been training ml models since 2017, let me tell you something, scripts don't matter, if your hardware and dataset is limited. Once you solve those two, than you can decide what scripts are the best.

  • @joephilipson 2026-05-29

    Idgi, I've been doing this in Claude.

  • @liquiditysweeps 2026-05-29

    Kinda wanna touch into this , isn’t new been around since 2017-2019, just his own version of the transformer. Ur still heavily limited based on hardware this isn’t a hardware workaround, your not gonna be able to train a good billion parameter model on ur 3090 or 4090 (keyword good) so kinda overhyped with ur wording but still impressive for the dude

  • @krish_chilvery 2026-05-29

    Takes 300+ days

  • @binhusmachado 2026-05-29

    🔥

  • @m1.r0gue 2026-05-31

    But.......

  • @faheem_.10 2026-05-30

    Bro don’t yap nonsense. This repo is about a small GPT-style transformer built to explain the fundamentals not to train any 1-2b params model. I've made a similar project repo on github, ive made it from scratch and it covers the architecture, tokenization, self-attention, next-token prediction, training, and theory behind how GPT-like models work. If anyone wants to learn the internals rather than just use APIs, feel free to check it out.

  • @choonyong.tan.33 2026-05-30

    You still need like a 3090

  • @josh_einstein 2026-05-29

    🔥

  • @siddhesh_c_chaudhari 2026-05-29

    🔥

  • @ajinkya_8086 2026-05-29

    🔥

  • @theyreallactors44_ 2026-05-29

    🔥

  • @singhsdoingtings 2026-05-30

    Is this ragebait? Please be ragebait

Content ideas (0)

No ideas generated yet. Run /instagram-sync ideate from Claude Code to create some.