Japanese Speech Data - Scripted Monologue
Domain The business or industry verticals that characterize this dataset
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
Locale The language(s) and country(s) applicable to the speakers in the dataset.
Age Age range of the speakers in the dataset.
78% | 22% | 0%
Female | Male | Unspecified Percentage of female and male speakers in the dataset.
Mar 22, 2021
Published date Date the dataset was published.
Seller Name Entity or individual making the dataset available for sale.
Data source Source entity providing the dataset
License type Agreement defining the data being licensed, including the usage, manner, and frequency in which data will be provided or updated.
Acoustic Modelling, ASR Testing, Benchmarking
A zip file containing metadata files in tsv format and a folder with all the audio files
This dataset contains 102 hours of Japanese Scripted Monologue data, recorded from speakers in Japan.
|Domain The business or industry verticals that characterize this dataset||Generic|
|Total recordings Total audio recordings in the dataset.||88271|
|File size Dataset file size.||11.04GB|
|Hours Total number of hours in the dataset.||102|
|Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.||0.3%|
|Total prompts Total number of prompts read by speakers in the dataset.||88271|
|Unique prompts Total number of unique prompts read by speakers in the dataset.||46277|
|Average amount of recordings per speaker Average amount of audio recordings per speaker in the dataset.||388.86|
|Number of speakers Total number of unique speakers in the dataset.||227|
|Locale The language(s) and country(s) applicable to the speakers in the dataset.||ja-jp|
|Language Language(s) spoken in the audio files included in the dataset.||Japanese|
|Country The country where the speakers reside and are represented in the audio.||Japan|
|Female | Male | Unspecified Percentage of female and male speakers in the dataset.||78.41% | 21.59% | 0.00%|
|Accent(s) Accents spoken by speakers in the dataset.||Aichi, Chiba, Fukuoka, Fukushima, Gifu, Hiroshima, Hokkaido, Hyogo, Ishikawa, Kochi, Kyoto, Oita, Osaka, Shizuoka, Tokushima, Tokyo, Yamaguchi, Yamanashi, Akita, Aomori, Ehime, Ibaraki, Iwate, Kagawa, Kanagawa, Kumamoto, Mie, Miyagi, Miyazaki, Nagano, Nagasaki, Nara, Niigata, Okayama, Okinawa, Saga, Saitama, Toyama|
Audio details Additional audio information.
Recording environment Acoustic environment in which the recordings were made.silent
Audio format File format of audio in the dataset.WAV
Bits per sample Number of bits used to represent the audio data for each sample of playback.16
Device type Type of device the audio was recorded on.mobile
Communication band Channel bandwidth used during the recording of the audio.broadband
Sample rate Number of samples per second for audio data.16kHz
Sample demo Audio clips from the dataset that you can listen to.Download sample