GitHub - arman-bd/guppylm: A ~9M parameter LLM that talks like a small fish.

2026-04-06 | Source: Mastodon | Original article

A GitHub repository released on Monday introduces GuppyLM, a 9‑million‑parameter language model that “talks like a small fish.” The project, authored by arman‑bd and highlighted on Hacker News with a score of 103, ships a ready‑to‑run Colab notebook that downloads a 60 k‑entry fish‑conversation dataset from Hugging Face, fine‑tunes the model, and exports it for local inference. The code is deliberately minimal, exposing every training step so hobbyists and students can watch a full LLM pipeline on a free GPU. The release matters because it pushes the frontier of ultra‑lightweight models that can be trained and served on consumer‑grade hardware. At roughly 30 MB of storage and under 2 GB of VRAM during generation, GuppyLM fits comfortably on a laptop or a Raspberry Pi, opening the door to on‑device experimentation without cloud costs. Its open‑source nature also provides a concrete teaching aid for the community, echoing the “tiny LLM” showcase we covered earlier this week in Show HN: I built a tiny LLM to demystify how language models work [2026‑04‑06]. Together, these projects illustrate a growing appetite for transparent, low‑resource AI that can be inspected, modified, and deployed by anyone. What to watch next is whether GuppyLM gains traction beyond its novelty appeal. Early adopters may integrate it with Ollama or other local‑LLM runtimes, benchmark its speed and quality against larger open models, or extend the fish‑dialogue corpus to other niche domains. A follow‑up fork that adds tool‑use or multimodal capabilities would signal that the community sees genuine utility in sub‑10‑M models, potentially sparking a wave of edge‑focused AI applications across the Nordic startup scene.

Sources

Back to AIPULSEN