Is a labeled dataset of spoken digits, that means of people saying "zero", "one", "two", three", "four", "five", "six", "seven", "eight", or "nine" available?
I would also be interested in such a dataset in another language.
I want to try some speech recognition algorithms. This means the dataset should be audio files which were created by recording humans saying those digits.