ST-CMDS (Free ST Chinese Mandarin Corpus)
Free Chinese Mandarin speech dataset with ~100 hours from 855 speakers recorded on mobile phones in quiet indoor environments
Duration
100 hours
Languages
1
Sample Rate
16 kHz
Published
2017-01
Description
1855 speakers with 102,600 utterances total
2120 utterances per speaker
3Recorded on mobile phones in quiet indoor environments
4All utterances manually transcribed and proofread
5This dataset is a subset of a larger corpus
6CC BY-NC-ND 4.0 license
Language Details
| Language | Duration |
|---|---|
| Mandarin Chinese | 100 hours |
Publisher
Resources