India’s Sovereign AI Platform is an initiative by the Government of India to build and manage its own AI infrastructure instead of relying fully on foreign tech companies. The goal is to keep data within the country, strengthen digital independence and develop AI systems tailored to India’s needs.

When you open Sarvam AI, this is how the user interface looks. After that, click on the Experience Sarvam button. It will ask you to log in, and once you log in, you will see the dashboard like in the image below. Initially, they provide 1000 credits for free.

Free users can do:
- 33 hours of audio transcription
- 7 hours of voice generation
- 500K characters of translation
- Unlimited AI chat (free)
Table of Contents
Quick summary table for Text-to-Speech
| Feature | What It Does |
|---|---|
| Convert Text to Voice | Turns written text into spoken audio |
| Voice Styles | Choose from multiple voices to match tone and style |
| Speed & Pitch | Adjust how fast or slow, high or low the voice sounds |
| Audio Quality | Select the output quality of the audio |
| Download Audio | Save audio files for offline use |
| Generate Videos | Create videos with synced voiceover |
| API Integration | Connect text-to-speech directly to apps or websites |
Sarvam AI – Text to Speech Explained Clearly
What Text to Speech Does
Sarvam AI’s Text to Speech converts written text into audio. You use it when you need voice output instead of text.
For example:
- YouTube videos
- Voice assistants
- IVR systems
- Learning apps
- Accessibility support
Instead of recording manually every time, you just paste text and generate voice instantly.
It is free until the end of February 2026 and supports 10+ Indian languages.

Important: How Language Selection Works
If you type text in Tamil, it will read in Tamil – even if Hindi is selected in the dropdown.
The language dropdown does not translate. It mainly controls pronunciation style and voice model. So whatever language you type, it reads that language in the selected voice style.
These languages are in dropdown (English, Tamil, Hindi, Bengali, Gujarati, Kannada, Malayalam, Marathi, Odia, Punjabi, Telugu.)
Models Available
Sarvam AI provides two models:
| Model | What It Means |
|---|---|
| Bulbul V3 | Newest model, more natural and expressive |
| Bulbul V2 | Older model, simpler voice options but realistic |
Bulbul V3
Bulbul V3 is more advanced. Voices are divided into categories to make selection easier.
| Category | Voice Name | Gender | Best Used For / Description |
|---|---|---|---|
| Conversational | Shubh | Male | Friendly default voice for IVR and support |
| Conversational | Ritu | Female | Soft, approachable voice for customer interactions |
| Conversational | Amit | Male | Formal voice for business communications |
| Conversational | Sumit | Male | Balanced warmth with professionalism |
| Conversational | Pooja | Female | Encouraging voice for assistance flows |
| Conversational | Manan | Male | Consistent voice for automated systems |
| Conversational | Simran | Female | Warm voice for conversational interfaces |
| Conversational | Rahul | Male | Composed voice that builds trust |
| Conversational | Kavya | Female | Everyday conversational tone |
| Conversational | Ratan | Male | Sharp articulation for clarity |
| Conversational | Priya | Female | Upbeat voice with personality |
| Conversational | Ishita | Female | Polished voice for enterprise use |
| Conversational | Shreya | Female | Precise pronunciation and enunciation |
| Conversational | Shruti | Female | Sweet and melodious voice |
| Audiobooks | Aditya | Male | Captivating voice for stories and audiobooks |
| Audiobooks | Ashutosh | Male | Traditional Hindi narration style |
| Audiobooks | Advait | Male | Contemporary storytelling voice |
| Audiobooks | Roopa | Female | Gentle voice perfect for audiobooks |
| Audiobooks | Tanya | Female | Friendly and modern voice |
| Audiobooks | Gokul | Male | Trustworthy and dependable voice |
| Audiobooks | Suhani | Female | Pleasant and soothing voice |
| Audiobooks | Kavitha | Female | Graceful and articulate voice |
| Entertainment | Kabir | Male | Rich voice for film-style narration |
| Entertainment | Varun | Male | Tension-building voice for horror and drama |
| Entertainment | Neha | Female | Bold voice for shows and promos |
| Entertainment | Mani | Male | Calm and composed voice |
| Entertainment | Mohit | Male | Versatile and adaptable voice |
| Entertainment | Rupali | Female | Elegant and refined voice |
| Sales | Rohan | Male | Clear, confident voice for collections |
| Sales | Dev | Male | Authoritative tone for recovery calls |
| Sales | Sunny | Male | Cheerful and upbeat voice |
| News | Aayan | Male | Professional voice for news and documentaries |
| News | Anand | Male | Warm and reassuring voice for support |
| News | Tarun | Male | Professional and clear voice |
| News | Vijay | Male | Confident and authoritative voice |
| News | Rehan | Male | Youthful and energetic voice |
| News | Soham | Male | Balanced and natural voice |
You don’t need to open each one. Just read the description and choose smartly. Voice switching takes a few seconds.
You can easily Increase or decrease the audio speed & Download it easily
Bulbul V2 – Voice Options
Bulbul V2 only has Conversational category.
| Category | Voice Name | Gender | Best Used For / Description |
|---|---|---|---|
| Conversational | Anushka | Female | Default voice for general use |
| Conversational | Manisha | Female | Approachable female voice |
| Conversational | Arya | Female | Energetic female voice |
| Conversational | Abhilash | Male | Articulate male voice |
| Conversational | Karun | Male | Strong, authoritative presence |
| Conversational | Hitesh | Male | Adaptable male voice |
| Conversational | Vidya | Female | Composed, reliable tone |
This model feels simple and natural, like someone talking casually. You can easily adjust audio speed and pitch.
Audio Quality Options
You can select output quality.
| Option | Usage |
|---|---|
| Standard (22.05 kHz) | Balanced quality |
| Telephony (8 kHz) | Phone calls & IVR systems |
| High Quality (48 kHz) | Best clarity |
Choose based on what you actually need. If you’re setting it up for IVR, go with Telephony. If it’s for content creation, choose Standard or High Quality.
Get Code Option
The Get Code option gives you an API snippet that lets developers connect Sarvam AI directly to their app, website, chatbot, IVR system, or SaaS product, so text can be automatically converted to speech instead of doing it manually through the dashboard.
Share Option (Video Generation)
When you click Share, it lets you convert your text + voice into a video. You can choose a background style.
| Styles Available | Warm Sunset | Midnight | Deep Ocean | Soft Light | Ember |
After selecting a style, click Generate Video. Within a few seconds, it creates a video automatically. You can download it easily.
Output
Convert (Text to Speech) into video
I found an issue in the video section.
There’s a problem when converting to video. The full text doesn’t display properly. Only part of it shows and when the next sentence plays, the visual text mostly stays static instead of updating. Fixing this would make the feature much better.









0 Comments