A First With Claude Code

I’ve been downloading public-domain books for a project, and many of them are kindof bad OCR scans.

A lot of the work I was doing was in Claude Code. I created a skill where it would look online for the text I’m seeking and download it into a sources folder with a certain naming convention.

At one point, I asked, “What options do I have to clean up files downloaded from Project Gutenberg OCR scans into nicely formatted markdown files? Ideally, tools I can download that won’t burn through tokens,” and I was surprised by its suggestion.

It actually recommended I use Ollama (a local LLM runtime) to go through and clean up the text files. I’m not sure if it suggested it because it could see it was already installed, or if it would have suggested it either way, but this is the first time it did that.

It ended up writing a Python script to work with Ollama and a bash script to bootstrap the process. 14 hours later, I had three massive OCR scans cleaned up, and it didn’t use any of my Claude Code quota. It recommended I use the “qwen2.5:14b” model, which seemed to do a good job and ran fine on my M4 Mac Mini with 24 GB of RAM.

Published May 19, 2026

One comment on “A First With Claude Code”

Dave Winer

May 19, 2026 at 12:32 pm

Glad to see you writing about Claude Code. I hope we can exchange ideas and tools how to build stuff in this new space.

Loading...

A First With Claude Code

Reposts (1)

One comment on “A First With Claude Code”

Leave a ReplyCancel reply