Soul Player C64: Real Transformer Runs on a 1 MHz Commodore 64

2026-04-21 | Source: HN | Original article

A small team of hobbyist developers has pushed the limits of retro hardware by getting a genuine transformer‑style language model to run on a 1 MHz Commodore 64. The project, dubbed **Soul Player C64**, ships as a 25 k‑parameter transformer that can be loaded onto a .d64 disk image and executed either in the VICE emulator or on a real C64 equipped with a 1541 floppy drive. The code relies on aggressive quantisation, 8‑bit integer arithmetic and hand‑optimised 6502 assembly loops to squeeze inference into the machine’s meagre 64 KB of RAM and 1 MHz clock speed. Why it matters goes beyond novelty. As we reported on 20 April in “The Trouble with Transformers”, the energy and compute appetite of modern LLMs is a growing concern. Soul Player C64 shows that, with extreme model pruning and hardware‑aware design, useful neural inference can be squeezed onto devices that consume a fraction of the power of today’s GPUs. It also validates the claim from the same day’s “Local LLMs are actually good now” blog that small, locally‑run models can be practical, opening a pathway for ultra‑low‑power AI in embedded or off‑grid scenarios. The demonstration is a proof‑of‑concept rather than a production‑ready tool, but it raises several questions for the community. Will the approach scale to larger vocabularies or multimodal tasks, or is 25 k the practical ceiling for a 6502‑class CPU? Can similar tricks be applied to other vintage platforms, turning them into educational sandboxes for AI fundamentals? The developers plan to publish a benchmark suite comparing inference latency on the C64, a modern laptop and a low‑end ARM board, and they have opened the source repository for contributors to experiment with pruning strategies and custom kernels. The next few weeks should reveal whether this retro‑AI stunt sparks a broader movement toward “micro‑transformers” on ultra‑constrained hardware.

Sources

Back to AIPULSEN