1

How does Microsoft speech recognition API compare with Google Cloud Speech API in terms of speech recognition accuracy (e.g., in terms of word error rate, character error rate (CER), or sentence error rate (SER))?

I'm also interested in other online speech recognition APIs.


I have crossposted the question at:

Franck Dernoncourt
  • 1,588
  • 2
  • 12
  • 35
  • 1
    This is clearly a "rapidly changing event", so any answer would become obsolete every week or so. :-) – Be Brave Be Like Ukraine Oct 28 '17 at 00:17
  • 1
    @bytebuster we could timestamp the evaluations :) – Franck Dernoncourt Oct 28 '17 at 00:20
  • 1
    What about those built into Chrome and Android? FWIW I bet the difference will be in speed, price, languages, and things like robustness to background noise and other unlanguage that a benchmark may not capture. – Adam Bittlingmayer Oct 28 '17 at 21:39
  • @A.M.Bittlingmayer I'm also interested in those, but to a lower extent as they cannot be used in any application, unlike online APIs or open source ASRs. Some other factors: speak accents, and ability to take into account the context. It's hard to benchmark indeed :) – Franck Dernoncourt Oct 28 '17 at 22:30
  • @FranckDernoncourt I have one data pipeline that invokes Chrome instances programmatically for some other Chrome functionality. It has bad latency but good enough throughput, ie good for preproc and data gen but not viable for live requests. – Adam Bittlingmayer Oct 29 '17 at 07:18

0 Answers0