Name: AISHELL-5
Creator: ASLP, Northwestern Polytechnical University; Beijing AISHELL Technology Co., Ltd.; Li Auto Inc.
Published: 2025-05
License: CC BY-SA 4.0

Description

1Recorded in a hybrid vehicle with far-field microphones placed at the front and each speaker wearing a high-fidelity close-talk microphone

2165 speakers participated, without noticeable accents

32-4 speakers randomly seated at four positions in the car, engaging in unrestricted free conversations

4Over 100 hours total: 94h training, 3.3h validation, two test sets

5Far-field audio contains 4 channels; training set additionally includes close-talk audio

6Also provides a large-scale noise dataset for speech simulation research

7CC BY-SA 4.0 license

Language Details

Language	Duration
Mandarin Chinese	100 hours

Publisher

ASLPNorthwestern Polytechnical University; Beijing AISHELL Technology Co.Ltd.; Li Auto Inc.

License & Commercial Use

Resources

Paperhttps://arxiv.org/abs/2505.23036 OpenSLRhttps://www.openslr.org/159/Official Pagehttps://www.aishelltech.com/aishell_5