Sudip, Y Combinator Alum, Takes to X

benchmarks funding

2026-05-03 | Source: Mastodon | Original article

Sudip optimizes tiny microGPT on MacBook. Reaches 6.7M tokens/sec.

Sudip, founder of Lamina Labs (YC P26), has achieved a significant milestone in optimizing AI performance on Apple's M5 Pro MacBook. By running Karpathy's tiny microGPT with optimized C/NEON, Sudip's setup reached an impressive 6.71 million tokens per second. When combining multiple independent streams, the performance soared to approximately 86 million tokens per second, rivaling FPGA-based systems. This developer-focused benchmark demonstrates the potential for high-performance AI processing on consumer-grade hardware. This breakthrough matters because it highlights the growing accessibility of AI technology, allowing developers to build and test complex models without requiring massive funding or specialized equipment. As Sudip himself noted in a LinkedIn post, building deep tech doesn't necessarily require a large budget or team, and this achievement reinforces that idea. As the AI landscape continues to evolve, it will be interesting to watch how developers like Sudip push the boundaries of what's possible on consumer-grade hardware. With the increasing demand for efficient and powerful AI processing, innovations like this could have significant implications for the future of AI development and deployment.

Sources

Back to AIPULSEN