https://m.facebook.com/groups/107107546348803?view=permalink&id=413934485666106

As a lot of you read, Baidu has released their paper on speech-to-text called DeepSpeech. As written in paper, their end-to-end architecture offers 7x speed-up over previous architectures. And as I understand - sets the new state-of-the-art.

Papers are fun, but without data and code their hard to implement for a lot of individuals.

Great thing is that mozilla open-sourced DeepSpeech model that is implemented in Tensorflow. So now all of us can use it, tweak it, train it.

Gihub: https://github.com/mozilla/DeepSpeech
Blog: https://svail.github.io/mandarin/
Paper: http://jmlr.org/proceedings/papers/v48/amodei16.pdf
Posted by uniqueone
,