graviti
Products
Resources
About us
AISHELL1
Audio
NLP
|...
License: Apache-2.0

Overview

This Open Source Mandarin Speech Corpus, AISHELL-ASR0009-OS1, is 178 hours long. It is
a part of AISHELL-ASR0009, of which utterance contains 11 domains, including smart home, autonomous
driving, and industrial production. The whole recording was put in quiet indoor environment,
using 3 different devices at the same time: high fidelity microphone (44.1kHz, 16-bit,);
Android-system mobile phone (16kHz, 16-bit), iOS-system mobile phone (16kHz, 16-bit).
Audios in high fidelity were re-sampled to 16kHz to build AISHELL- ASR0009-OS1.400 speakers from
different accent areas in China were invited to participate in the recording. The manual
transcription accuracy rate is above 95%,through professional speech annotation and strict quality
inspection. The corpus is divided into training, development and testing sets.

Data Format

/readme.txt

/SPEECHDATA

​ +—— /S0252

​ +—— /S0252_mic #高保真数据

​ +—— BAC009S0252W0001.wav

​ +—— BAC009S0252W0001.txt

Citation

Please use the following citation when referencing the dataset:

@article{DBLP:journals/corr/abs-1709-05522,
  author    = {Hui Bu and
               Jiayu Du and
               Xingyu Na and
               Bengu Wu and
               Hao Zheng},
  title     = {{AISHELL-1:} An Open-Source Mandarin Speech Corpus and {A} Speech
               Recognition Baseline},
  journal   = {CoRR},
  volume    = {abs/1709.05522},
  year      = {2017},
  url       = {http://arxiv.org/abs/1709.05522},
  archivePrefix = {arXiv},
  eprint    = {1709.05522},
  timestamp = {Mon, 13 Aug 2018 16:46:31 +0200},
  biburl    = {https://dblp.org/rec/journals/corr/abs-1709-05522.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

License

Apache-2.0

Data Summary
Type
Audio,
Amount
--
Size
14.51GB
Provided by
AISHELL
Aishell is an innovative company focusing on artificial intelligence, big data and technical services.
| Amount -- | Size 14.51GB
AISHELL1
Audio
NLP
License: Apache-2.0

Overview

This Open Source Mandarin Speech Corpus, AISHELL-ASR0009-OS1, is 178 hours long. It is
a part of AISHELL-ASR0009, of which utterance contains 11 domains, including smart home, autonomous
driving, and industrial production. The whole recording was put in quiet indoor environment,
using 3 different devices at the same time: high fidelity microphone (44.1kHz, 16-bit,);
Android-system mobile phone (16kHz, 16-bit), iOS-system mobile phone (16kHz, 16-bit).
Audios in high fidelity were re-sampled to 16kHz to build AISHELL- ASR0009-OS1.400 speakers from
different accent areas in China were invited to participate in the recording. The manual
transcription accuracy rate is above 95%,through professional speech annotation and strict quality
inspection. The corpus is divided into training, development and testing sets.

Data Format

/readme.txt

/SPEECHDATA

​ +—— /S0252

​ +—— /S0252_mic #高保真数据

​ +—— BAC009S0252W0001.wav

​ +—— BAC009S0252W0001.txt

Citation

Please use the following citation when referencing the dataset:

@article{DBLP:journals/corr/abs-1709-05522,
  author    = {Hui Bu and
               Jiayu Du and
               Xingyu Na and
               Bengu Wu and
               Hao Zheng},
  title     = {{AISHELL-1:} An Open-Source Mandarin Speech Corpus and {A} Speech
               Recognition Baseline},
  journal   = {CoRR},
  volume    = {abs/1709.05522},
  year      = {2017},
  url       = {http://arxiv.org/abs/1709.05522},
  archivePrefix = {arXiv},
  eprint    = {1709.05522},
  timestamp = {Mon, 13 Aug 2018 16:46:31 +0200},
  biburl    = {https://dblp.org/rec/journals/corr/abs-1709-05522.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

License

Apache-2.0

0
Start building your AI now
graviti
wechat-QR
Long pressing the QR code to follow wechat official account

Copyright@Graviti