Overview
This Open Source Mandarin Speech Corpus, AISHELL-ASR0009-OS1, is 178 hours long. It is a part of AISHELL-ASR0009, of which utterance contains 11 domains, including smart home, autonomous driving, and industrial production. The whole recording was put in quiet indoor environment, using 3 different devices at the same time: high fidelity microphone (44.1kHz, 16-bit,); Android-system mobile phone (16kHz, 16-bit), iOS-system mobile phone (16kHz, 16-bit). Audios in high fidelity were re-sampled to 16kHz to build AISHELL- ASR0009-OS1.400 speakers from different accent areas in China were invited to participate in the recording. The manual transcription accuracy rate is above 95%,through professional speech annotation and strict quality inspection. The corpus is divided into training, development and testing sets.
Data Format
/readme.txt
/SPEECHDATA
+—— /S0252
+—— /S0252_mic #高保真数据
+—— BAC009S0252W0001.wav
+—— BAC009S0252W0001.txt
Citation
Please use the following citation when referencing the dataset:
@article{DBLP:journals/corr/abs-1709-05522,
author = {Hui Bu and
Jiayu Du and
Xingyu Na and
Bengu Wu and
Hao Zheng},
title = {{AISHELL-1:} An Open-Source Mandarin Speech Corpus and {A} Speech
Recognition Baseline},
journal = {CoRR},
volume = {abs/1709.05522},
year = {2017},
url = {http://arxiv.org/abs/1709.05522},
archivePrefix = {arXiv},
eprint = {1709.05522},
timestamp = {Mon, 13 Aug 2018 16:46:31 +0200},
biburl = {https://dblp.org/rec/journals/corr/abs-1709-05522.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}