The SSNCE Database of Tamil Dysarthric Speech
|Item Name:||The SSNCE Database of Tamil Dysarthric Speech|
|Author(s):||P. Vijayalakshmi, T. A. Mariya Celin, T. Nagarajan|
|LDC Catalog No.:||LDC2021S04|
|Release Date:||May 17, 2021|
|DCMI Type(s):||Sound, Text|
|Data Source(s):||microphone speech|
|Application(s):||speech recognition, subjectivity analysis, temporal analysis|
The SSNCE Database of Tamil Dysarthric Speech Agreement
|Online Documentation:||LDC2021S04 Documents|
|Licensing Instructions:||Subscription & Standard Members, and Non-Members|
|Citation:||Vijayalakshmi, P., T. A. Mariya Celin, and T. Nagarajan. The SSNCE Database of Tamil Dysarthric Speech LDC2021S04. Web Download. Philadelphia: Linguistic Data Consortium, 2021.|
The SSNCE Database of Tamil Dysarthric Speech was developed by the Speech Lab, SSN College of Engineering, India, in collaboration with the Indian National Institute of Empowerment of Persons with Multiple Disabilities (NIEPMD) and contains approximately eight hours of Tamil speech data, time-aligned transcripts and metadata collected from 30 speakers (20 dysarthric speakers and 10 non-dysarthric speakers).
Dysarthria is a speech disorder caused by muscle weakness which can result in slowed and slurred speech that is difficult to understand. Common causes of dysarthria include nervous system disorders and conditions that cause facial paralysis or tongue or throat muscle weakness.
The non-dysarthric speakers consisted of five female and five male subjects. The dysarthric speakers (7 female, 13 male) reported a diagnosis of cerebral palsy and ranged in age from 12 years old to 37 years old.
The speech data was collected between 2015 and 2017 in two sessions at NIEPMD. In total, each speaker recorded 365 utterances consisting of single words and of sentences that included a combination of common and uncommon Tamil phrases.
The corpus includes time-aligned phonetic transcripts for all collected speech data. Additional documentation includes phoneme mappings and speaker metadata.
Audio data is presented as 16-bit 16kHz FLAC compressed linear pcm wav. Transcripts are presented as UTF-8 encoded plain text.
Please view the following samples:
None at this time.