Catalog

English Scripted Monologue data - 200 hours

Overview

Banking, Insurance, Retail, Telecommunication
Domain The business or industry verticals that characterize this dataset
0.2%
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
en-gb
Locale The language(s) and country(s) applicable to the speakers in the dataset.
18-77
Age Age range of the speakers in the dataset.
67% | 33% | 0% | 0%
Female | Male | Unspecified Percentage of female and male speakers in the dataset.
$30,000.00
Version Number
01
Hours
200
Purchase Options
Sender
Invitee
Apr 8, 2021
Published date Date the dataset was published.
DefinedData
Seller Name Entity or individual making the dataset available for sale.
DefinedData
Data source Source entity providing the dataset
License type Agreement defining the data being licensed, including the usage, manner, and frequency in which data will be provided or updated.

Use case(s)

mobile speech

Model Applications

Acoustic Modelling, ASR Testing, Benchmarking

Packaging description

A zip file containing metadata files in tsv format and a folder with all the audio files
This dataset contains 200 hours of English Scripted Monologue data, recorded from speakers in Great Britain.

Dataset details

About

Domain The business or industry verticals that characterize this dataset Banking, Insurance, Retail, Telecommunication
Total recordings Total audio recordings in the dataset. 99849
File size Dataset file size. 21.55GB
Hours Total number of hours in the dataset. 200
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced. 0.2%
Total prompts Total number of prompts read by speakers in the dataset. 99849
Unique prompts Total number of unique prompts read by speakers in the dataset. 68035
Average amount of recordings per speaker Average amount of audio recordings per speaker in the dataset. 178.94

Demographic

Number of speakers Total number of unique speakers in the dataset. 558
Locale The language(s) and country(s) applicable to the speakers in the dataset. en-gb
Language Language(s) spoken in the audio files included in the dataset. English
Country The country where the speakers reside and are represented in the audio. Great Britain
Female | Male | Unspecified Percentage of female and male speakers in the dataset. 67.2% | 32.62% | 0.18% | 0.00%
Accent(s) Accents spoken by speakers in the dataset. English - East and Central Midlands (Cambridge, Leicester, Nottingham), English - East Anglia (Norfolk, Ipswich), English - Geordie (Newcastle, Sunderland, Northumberland), English - Hampshire/Wiltshire, English - London and Greater London/Surrey, English - Mancunian (Manchester and Greater Manchester), English - Scouse/Northwestern (Liverpool, Lancashire, Blackpool), English - Southwestern (Devon, Cornwall), English - Sussex (East/West), English - West Country (Bristol, Gloucester, Somerset), English - West Midlands (Birmingham, Coventry), English - Yorkshire (Sheffield, Leeds, Middlesbrough), Irish - Belfast/East Ulster, Irish - Derry/West Ulster, Irish - Western (Limerick, Galway), Scottish - Aberdeen/Northern Lowlands, Scottish - Edinburgh-Dundee, Scottish - Glasgow-Stirling, Scottish - Highlands/Orkneys, Welsh - Northern (Wrexham, Anglesey), Welsh - Southern (Cardiff, Newport), Welsh - Western (Swansea, Pembrokeshire)

Phonetic Distribution

Phonetic Distribution
Phonetic Distribution

Age Distribution

Age Distribution
Age Distribution

Gender distribution

Gender distribution
Gender distribution

Accent Distribution

Accent Distribution
Accent Distribution

Audio details Additional audio information.

  • Words
    1742623
  • Recording environment Acoustic environment in which the recordings were made.
    noisy, silent
  • Audio format File format of audio in the dataset.
    WAV
  • Bits per sample Number of bits used to represent the audio data for each sample of playback.
    16
  • Device type Type of device the audio was recorded on.
    mobile
  • Communication band Channel bandwidth used during the recording of the audio.
    broadband
  • Sample rate Number of samples per second for audio data.
    16kHz

Sample

Sample demo Audio clips from the dataset that you can listen to.
Name Duration Age Gender Native Language
1 Audio sample 1 00:00:07.636 35 Female en-gb