Description

1Accent-free Mandarin speech recorded by 50 speakers in a quiet environment

2Training set: 10,000 utterances (30 speakers), validation set: ~900 utterances, test set: 2,495 utterances (10 speakers)

3Includes language model, pronunciation dictionary, and Kaldi-based baseline system

4Completely free for academic use

5Recorded in 2000-2001, publicly released in 2015

Language Details

Language	Duration
Mandarin Chinese	30 hours

Publisher

Center for Speech and Language Technologies (CSLT)Tsinghua University

Resources

THCHS-30