News

Exploring Karpathy's New NanoChat Project

Roy Osherove

13 Oct 2025 — 1 min read

OK. This is Big. If you don't know who Andrej Karpathy is, please get to know him and all of his amazing writing and projects.

💡

If you're a hands-on developer, I recommend Karpathy's "Technical Track" set of youtube videos where he basically builds an LLM from absolute scratch, including tokenization. Just be warned - it's about as deep as a rabbit hole goes.

Recently, Andrej released a new repository project called NanoChat -

"A minimal, from scratch, full stack training/inference pipelined of a simple chatGPT clone. "

You can read al about it in his own words:

Excited to release new repo: nanochat!
(it's among the most unhinged I've written).

Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,… pic.twitter.com/LLhbLCoZFt
— Andrej Karpathy (@karpathy) October 13, 2025

Now What?

My background is software development & engineering, and less research, and I've been slowly getting into the "deep" side of LLMs and how models work under the covers. So I'll be warking through the code , rung the training myself and document any insights or findings here on this blog.

I'll be starting from this page, and as we speak I'm running the training on lambda.ai. But my goal is to understand all the steps sin a deep way. Today I understand them in a very surface level way, and have been working to change that. Will report back as I progress.

Reference Architecture: OpenClaw (Early Feb 2026 Edition, Opus 4.6)

I asked OpenClaw with opus 4.6 to deep dive into its source code and create a detailed architecture document. I like the output for this and I think others could find use for it since I didn't see it described in this detail so far.

Generating Weird & Beautiful Architecture Diagrams with Claude

As an architect, I love doing diagrams. I think visually. In the past I loved using Miro, mermaid, and a bunch of others, but only recently (past week!) did I try and experiment with drawIO and its XM files format. This whole post is just about a fun late night

Getting Claude Code Up and Running using AWS Bedrock Claude Opus 4.5 Model (Long Term API Keys)

What better time is there to try out a new tool or service than when joining a new company? Well, I just recently joined AWS as a Sr. Solutions Architect. Yay! And that's my opportunity to try out a bunch of stuff that I get a chance to

Cursor & Other AI dev tools need a model-switch hook

A common thing for me and other developers I see do (I was first alerted to this by Harel Coman) . We work mainly with cursor but a few use claude code. But a common pattern is this: Start chat, then: 1. Enter Prompt 2. Choose model for this prompt 3.

Now What?

Read more

Reference Architecture: OpenClaw (Early Feb 2026 Edition, Opus 4.6)

Generating Weird & Beautiful Architecture Diagrams with Claude

Getting Claude Code Up and Running using AWS Bedrock Claude Opus 4.5 Model (Long Term API Keys)

Cursor & Other AI dev tools need a model-switch hook