ABCDEF
1
2
Data TypeLanguagesDescriptionSamples
3
Natural Conversational Voice DatasetHindi and Indian English10,885+ Hours of Hindi and 10,392+ of Indian English Conversation Voice Dataset. Natural two - person conversations recorded in each language. Each sample captures genuine interactions covering a range of topics and emotions typical in everyday conversations. With 1,000+ hours transcribed for each.Hindi AudioHindi Sample Transcript
4
Indian English AudioEnglish Sample Transcript
5
Natural Conversational Voice Dataset Telugu, Malayalam, Bengali, Punjabi, Tamil, Marathi10,000+ Hours of Telugu, Malayalam, Bengali, Punjabi, Tamil and Marathi. Natural two - person conversations recorded in each language. Each sample captures genuine interactions covering a range of topics and emotions typical in everyday conversations. Transcription not readily available.Telugu AudioTelugu Sample Transcript
6
Malayalam AudioMalayalam Sample Transcript
7
Bengali AudioBengali Sample Transcript
8
Punjabi AudioPunjabi Sample Transcript
9
Natural Conversational Voice Dataset - Low Resource LanguagesBodo, Maithili, Bhojpuri and Haryanvi10,000+ Hours for each language. Natural two-person conversations recorded in each language. Each sample captures genuine interactions covering a range of topics and emotions typical in everyday conversations. Transcription not readily available.Bodo AudioBodo Sample Transcript
10
Maithili AudioMaithili Sample Transcript
11
Bhojpuri AudioBhojpuri Sample Transcript
12
Haryanvi AudioHaryanvi Sample Transcript
13
Read Speech HindiHindi This dataset comprises 1.2 million recordings totaling 10,374 hours of Hindi content, based on 124,000 unique paragraphs. With contributions from over 40,000 speakers across India, each 20-30 second recording captures linguistic diversity. (also available in English, Tamil, Telugu, Marathi, Bangla)AudioTranscript
14
Natural conversations - 4 personHindi, Indian English550+ hours: Natural, four-person conversational audio recordings where participants were prompted to discuss various topics. The conversations include a mix of Hindi and Indian English. (also available in Tamil, Telugu, Marathi, Bangla)AudioTranscript
15
High Mother Tounge Influence, High Emotion Conversations Hindi 1069 hours:Indian English. Speakers with medium and high degree of mother tongue influence were selected. Speakers were given an imaginary situation and asked to be overly emotional. (also available in English, Tamil, Telugu, Marathi, Bangla)Audio
16
Real World Conversation Indian English (uncontrolled environment)Indian English20,542 hours: Natural conversations between 2 people learning to speak fluently in English with 1,000 hours transcribed. The conversations took place in an uncontrolled environment, including adverse conditions and may have background noise as is the case in real world environments. Transcription not readily available.Audio
17
Unique Environment High Emotion Utterances Hindi 1099 hours: Conversational data in Indian English and Hindi. Each recording has high degree of emotion. Users were given prompts of situations such as getting a promotion for a job and instructed to be highly emotional. Each utterance is a short conversation ranging from 30 sec to 3 min long. A total of 10 emotions are covered across all conversations. The users were instructed to be in certain specific environments like on a busy road and 8 other environments. (also available in English, Tamil, Telugu, Marathi, Bangla)Audio (Environment: washroom with echo, Emotion: excited
18
Code Switching Conversation Hindi - EnglishHindi- English1000+ hours: 1259 unique speakers have 2-person unscripted conversations. Each recording includes conversational audio where speakers fluidly switch between Hindi and English, capturing the natural bilingual speech patterns common in multilingual communities. (also available in other languages paired with English)Audio
19
Josh Talks YouTube Videos Speech DatasetHindi 1534 hours of studio recorded spontaneous speech covering 8 languages. This audio data is a part of the library of Josh Talks content that is shared on YouTube. (also available in English, Tamil, Telugu, Marathi, Bangla, Malayalam)AudioTranscript
20
Voice Assistant Prompts with Phonetic TranscriptionsHindi This dataset comprises 100,000 utterances recorded by 200 unique speakers from 68 districts across India each speaking 500 phonetically rich sentences. (also available in English, Tamil, Telugu, Marathi, Bangla)AudioPhonetic Transcript