Sarvam AI Launches Powerful Open-Source LLM: A 24 Billion Parameter Model for Advanced AI Tasks.
  • 343 views
  • 2 min read

Sarvam AI, a Bengaluru-based artificial intelligence startup, has recently launched its flagship large language model (LLM), Sarvam-M, designed with a focus on Indian languages and reasoning capabilities. This open-source, 24-billion-parameter model is built upon Mistral Small and represents a significant stride towards building a sovereign AI ecosystem in India. Sarvam-M is engineered to power a diverse range of applications, including conversational agents, machine translation, and educational tools, specifically tailored to the Indian context.

Sarvam-M distinguishes itself through its unique training process, which involves three key steps: Supervised Fine-Tuning (SFT), Reinforcement Learning with Verifiable Rewards (RLVR), and inference optimization. During SFT, the model is trained using carefully crafted prompts to enhance its capabilities in general dialogue and complex reasoning. RLVR further refines its instruction-following and mathematical skills through custom reward engineering and curated datasets. Finally, inference is optimized using FP8 post-training quantization and techniques like lookahead decoding, ensuring efficient and accurate responses, especially in real-time applications.

The model has demonstrated strong performance in multilingual and reasoning benchmarks. It achieved an impressive 86% gain on a romanized Indian language version of the GSM-8K math dataset. Moreover, it showcased average performance boosts of 20% on Indian language benchmarks, 21.6% on math tasks, and 17.6% on programming tasks. When compared to other models, Sarvam-M outperforms Llama-4 Scout and is comparable to larger models like Llama-3.3 70B and Gemma 3 27B. However, it slightly lags in English benchmarks such as MMLU, indicating a trade-off for its enhanced multilingual and reasoning strengths.

Sarvam-M's architecture is designed for versatility, supporting a wide array of applications. Its accessibility is facilitated through Sarvam's API, a dedicated playground, and its availability for download on Hugging Face, enabling developers and researchers to experiment and integrate the model into various projects. The model supports 10 Indian languages, including Hindi, Bengali, Gujarati, Kannada, and Malayalam.

The launch of Sarvam-M is part of Sarvam AI's broader vision to create a sovereign AI ecosystem in India. This initiative is aligned with the Indian government's IndiaAI Mission, which aims to strengthen the country's domestic AI capabilities. Sarvam AI was selected by the Indian government to build a sovereign LLM under this mission.

Despite the model's capabilities, its initial reception has been mixed. While it has been praised for its focus on Indian languages, mathematics, and programming tasks, some critics have pointed out that it is not "good enough" compared to more established models. Some experts argue that there are cheaper and better models available from Google and other companies. The debate extends to whether India should focus on building AI for local needs or benchmark against Silicon Valley.

In response to the criticism, Sarvam AI has emphasized that Sarvam-M is a research model and a stepping stone towards building a comprehensive sovereign AI. The company plans to release models regularly and share detailed technical findings to foster collaboration and innovation.


Writer - Neha Gupta
Neha Gupta is a seasoned tech news writer with a deep understanding of the global tech landscape. She's renowned for her ability to distill complex technological advancements into accessible narratives, offering readers a comprehensive understanding of the latest trends, innovations, and their real-world impact. Her insights consistently provide a clear lens through which to view the ever-evolving world of tech.
Advertisement

Latest Post


Infosys is strategically leveraging its "poly-AI" or hybrid AI architecture to deliver significant manpower savings, potentially up to 35%, for its clients across various industries. This approach involves seamlessly integrating various AI solutions,...
  • 426 views
  • 3 min

Indian startups have displayed significant growth in funding, securing $338 million, marking a substantial 65% year-over-year increase. This surge reflects renewed investor confidence in the Indian startup ecosystem and its potential for sustainable...
  • 225 views
  • 3 min

Cohere, a Canadian AI start-up, has reached a valuation of $6. 8 billion after securing $500 million in a recent funding round. This investment will help Cohere accelerate its agentic AI offerings. The funding round was led by Radical Ventures and In...
  • 320 views
  • 2 min

The Indian Institute of Technology Hyderabad (IIT-H) has made significant strides in autonomous vehicle technology, developing a driverless vehicle system through its Technology Innovation Hub on Autonomous Navigation (TiHAN). This initiative marks ...
  • 377 views
  • 2 min

Advertisement

About   •   Terms   •   Privacy
© 2025 TechScoop360