100 Audio files (.wav) with spoken numbers (containing zero) of 100 different speakers

Question

I have been working on a machine learning project for speaker recognition. I need more audio files from different speakers to improve the accuracy of my algorithm^*.

I'm using the files to recognize who is speaking. It's not a speech to text algorithm.

I'm using just digits (initially only zero) to simplify my task, because it's still a proof of concept.

^{*I tested my machine learning with just 10 speakers and it gave me 55% accuracy. I want to add more samples to get a higher percentage.}

Unfortunately, most numbers won't be replicated enough times to constitute a data set. — Sheakspear Zitouni, Apr 03 '18 at 09:49
im working on speaker recognition but im using digits to simplify my task. i tested my machine learning with just 10 speakers and it gave me 55% accuracy i want to add more samples to get more percentage. — Sheakspear Zitouni, Apr 03 '18 at 10:42

score 0 · Answer 1 · answered Jun 08 '23 at 19:08

0

There are multiple datasets of spoken digits.

AudioMNIST

30000 audio samples of spoken digits (0-9) of 60 different speakers. Download. Paper

Free Spoken Digit Dataset (FSDD)

5 speakers, 2,500 recordings (50 of each digit per speaker). Download

answered Jun 08 '23 at 19:08

Jon Nordby

211
1
4

100 Audio files (.wav) with spoken numbers (containing zero) of 100 different speakers

1 Answers1

AudioMNIST

Free Spoken Digit Dataset (FSDD)