Tech giants like to boast about trillion-parameter AI models that require massive and expensive GPU clusters. But Fastino is taking a different approach.
The Palo Alto-based startup says it has invented a new kind of AI model architecture that’s intentionally small and task-specific. The models are so small they’re trained with low-end gaming GPUs worth less than $100,000 in total, Fastino says.
The method is attracting attention. Fastino has secured $17.5 million in seed funding led by Khosla Ventures, famously OpenAI’s first venture investor, Fastino exclusively tells TechCrunch.
This brings the startup’s total funding to nearly $25 million. It raised $7 million last November in a pre-seed round led by Microsoft’s VC arm M12 and Insight Partners.
“Our models are faster, more accurate, and cost a fraction to train while outperforming flagship models on specific tasks,” says Ash Lewis, Fastino’s CEO and co-founder.
Fastino has built a suite of small models that it sells to enterprise customers. Each model focuses on a specific task a company might need, like redacting sensitive data or summarizing corporate documents.
Fastino isn’t disclosing early metrics or users yet, but says its performance is wowing early users. For example, because they’re so small, its models can deliver an entire response in a single token, Lewis told TechCrunch, showing off the tech giving a detailed answer at once in milliseconds.
Techcrunch event
Exhibit at TechCrunch Sessions: AI
Secure your spot at TC Sessions: AI and show 1,200+ decision-makers what you’ve built — without the big spend. Available through May 9 or while tables last.
Exhibit at TechCrunch Sessions: AI
Secure your spot at TC Sessions: AI and show 1,200+ decision-makers what you’ve built — without the big spend. Available through May 9 or while tables last.
Berkeley, CA
|
June 5
BOOK NOW
It’s still a bit early to tell if Fastino’s approach will catch on. The enterprise AI space is crowded, with companies like Cohere and Databricks also touting AI that excels at certain tasks. And the enterprise-focused SATA model makers, including Anthropic and Mistral, also offer small models. It’s also no secret that the future of generative AI for enterprise is likely in smaller, more focused language models.
Time may tell, but an early vote of confidence from Khosla certainly doesn’t hurt. For now, Fastino says it’s focused on building a cutting-edge AI team. It’s targeting researchers at top AI labs who aren’t obsessed with building the biggest model or beating the benchmarks.
“Our hiring strategy is very much focused on researchers that maybe have a contrarian thought process to how language models are being built right now,” Lewis says.
Topics
AI, Enterprise, Exclusive, Fastino, Khosla Ventures, LLMs, Startups
Charles Rollet
Senior Reporter
Charles Rollet is a senior reporter at TechCrunch. His investigative reporting has led to U.S. government sanctions against four tech companies, including China’s largest AI firm. Prior to joining TechCrunch, Charles covered the surveillance industry for IPVM. Charles is based in San Francisco, where he enjoys hiking with his dogs. You can contact Charles securely on Signal at charlesrollet.12 or +1-628-282-2811.
View Bio
May 13, 2025
London, England
Get inside access to Europe’s top investment minds — with leaders from Monzo, Accel, Paladin Group, and more — plus top-tier networking at StrictlyVC London.
REGISTER NOW
Most Popular
Fastino trains AI models on cheap gaming GPUs and just raised $17.5M led by Khosla
Charles Rollet
Stripe unveils AI foundation model for payments, reveals ‘deeper partnership’ with Nvidia
Mary Ann Azevedo
A comprehensive list of 2025 tech layoffs
Cody Corrall
Alyssa Stringer
Kate Park
Tesla’s ‘Robotaxi’ and ‘Cybercab’ trademarks hit roadblocks ahead of June launch
Sean O'Kane
OpenAI and the FDA are reportedly discussing AI for drug evaluations
Maxwell Zeff
The Papal ‘conclave cam’ is slow TV
Amanda Silberling
Delta debuts its Patreon-supported gaming app update after US App Store policy change
Sarah Perez