Sarvam AI Challenges ChatGPT and Google Gemini With Strong India-Focused AI Performance
Homegrown AI startup Sarvam AI is making global waves by outperforming ChatGPT and Google Gemini on key India-centric tasks.

Homegrown AI startup Sarvam AI is making global waves by outperforming ChatGPT and Google Gemini on key India-centric tasks.
In a powerful showing for Indian innovation, Sarvam AI, a Bengaluru-based startup, has captured global attention by outperforming world-leading artificial intelligence models like OpenAI’s ChatGPT and Google’s Gemini on specialised benchmarks tailored to Indian languages and document understanding. This achievement highlights how locally built solutions can outshine generic global models on context-specific tasks that matter most in India.
What Is Sarvam AI and Why It Matters
Sarvam AI is an Indian artificial intelligence company with a clear mission: build state-of-the-art AI models designed specifically for Indian contexts, languages, and real world challenges. Founded in 2023 by Dr Pratyush Kumar and Dr Vivek Raghavan, the company aims to reduce reliance on foreign AI systems by building sovereign, efficient, and context-aware AI models tuned to the country’s linguistic diversity.
Unlike many global AI frameworks that were primarily trained on English or generic data sets, Sarvam AI focuses on improving performance for Indian languages, voice interfaces, and document processing. This specialized focus has helped it make notable gains in niche but important areas where large generic models often struggle.
Landmark Performance on Key Benchmarks
Sarvam AI’s biggest breakthroughs have come from its two core AI tools:
Sarvam Vision– Advanced Document Understanding
Sarvam Vision is a powerful vision language model trained to excel at tasks like Optical Character Recognition (OCR), scene text extraction, image captioning, and complex table interpretation.
In head-to-head comparisons:
- It achieved an 84.3% accuracy score on the olmOCR-Bench, putting it ahead of Google Gemini 3 Pro and OpenAI’s ChatGPT in recognizing text and documents.
- On a broader benchmark called OmniDocBench v1.5, Sarvam Vision scored 93.28%, showcasing exceptional understanding of complex layouts, technical tables, and mixed content a task that traditional AI systems often find challenging.
These results demonstrate that a focused, India-tailored model can outperform larger, more generic models on tasks closely tied to India’s language and document diversity.
Bulbul V3– Next-Generation Voice AI
Beyond text and vision tasks, Sarvam AI’s Bulbul V3 model is making strides in text-to-speech generation, especially for Indian languages. Unlike many global voice AIs designed primarily for Western audiences, Bulbul supports dozens of voices across India’s linguistic spectrum. It is reported to handle at least 35 distinct voices and natural speech patterns, with plans to expand support to all 22 scheduled Indian languages.
This ability to interpret and generate regional language voice output brings Sarvam AI closer to real-world deployment in education, government services, enterprise support systems, and more.
The Indian Advantage: Context and Use Cases
Why does Sarvam AI outshine giants like ChatGPT or Gemini on these benchmarks?
1. Localization Matters
Global AI giants train on worldwide data, which excels at generic tasks but sometimes falls short on regional nuances especially in languages like Hindi, Telugu, Tamil, Bengali and others. Sarvam AI fills this gap by training on India-specific data, giving it an edge in language understanding and context-sensitive tasks.
2. Focused Model Design
Rather than building everything into one massive model, Sarvam AI develops task-specific tools optimized for problems like document processing, OCR, and conversational voice models areas where real users encounter friction daily.
3. Efficiency and Affordability
Sarvam AI’s models are designed to run on a range of infrastructure setups from cloud to phones making them more accessible for developers, enterprises, and government services within India’s digital ecosystem.
Together, these elements help explain why Sarvam AI is gaining real traction in a space long dominated by global players with far more resources.
Not a Blanket Victory — But a Tactical One
While Sarvam AI’s recent benchmark success is notable, experts caution that this does not mean it has universally surpassed ChatGPT or Gemini in all categories. Its strengths lie in task-specific performance, particularly those related to local languages and document processing, rather than broad general intelligence across every possible use case.
Senior analysts note that general purpose AI systems remain more capable in wide ranging tasks like creative writing, abstract reasoning, and multimodal conversation. Sarvam AI’s true value is in how well it complements and enhances these systems for Indian applications rather than replacing them outright.
Leadership Behind Sarvam AI
At the helm is Dr Pratyush Kumar, an IIT Bombay and ETH Zurich alum with deep experience in AI research and applications. Kumar co-founded Sarvam AI with a vision to create India’s sovereign AI stack robust, culturally sensitive, and built from the ground up with local needs in mind. He previously co-founded AI4Bharat, a leading Indian AI research lab, and was instrumental in boosting language-centric model development.
Under Kumar’s leadership, Sarvam AI has assembled a team focused on solving real-world problems from literacy and governance to digital customer support and enterprise automation rather than chasing benchmarks alone.
What This Means for India’s AI Future
Sarvam AI’s achievements mark more than just a few benchmark wins. They signal that sovereign AI systems built for purpose and not just scale can compete with global giants where it matters most.
For India, this has several implications:
- Language empowerment: Better tools for Indian languages boost inclusivity and impact.
- Local governance: Governments can deploy efficient OCR and voice systems for public services.
- Enterprise adoption: Businesses can build domestic AI solutions tailored to regional markets.
- Reduced dependence: A vibrant local AI ecosystem reduces reliance on foreign platforms.