Javascript 谷歌的语音识别技术有API吗?
声明:本页面是StackOverFlow热门问题的中英对照翻译,遵循CC BY-SA 4.0协议,如果您需要使用它,必须同样遵循CC BY-SA许可,注明原文地址和作者信息,同时你必须将它归于原作者(不是我):StackOverFlow
原文地址: http://stackoverflow.com/questions/8830203/
Warning: these are provided under cc-by-sa 4.0 license. You are free to use/share it, But you must attribute it to the original authors (not me):
StackOverFlow
Is there an API for Google's speech recognition technology?
提问by Leo Jiang
I want to try creating a jQuery slideshow using simple voice commands like "next" or "previous". Is there a way to use Google's voice recognition? I know about Chrome's x-webkit-speech, but I have to click a button to use it.
我想尝试使用简单的语音命令(如“下一个”或“上一个”)创建 jQuery 幻灯片。有没有办法使用谷歌的语音识别?我知道 Chrome 的 x-webkit-speech,但我必须单击一个按钮才能使用它。
I tried MIT's WAMI, but I found it slower and less accurate than Google's speech cognition.
我尝试了 MIT 的 WAMI,但我发现它比 Google 的语音认知更慢,更不准确。
采纳答案by Tim Perry
As of today this now exists, for Chrome: http://chrome.blogspot.co.uk/2013/01/hello-browser.html
截至今天,Chrome 浏览器已存在:http: //chrome.blogspot.co.uk/2013/01/hello-browser.html
(api doc)
( API 文档)
回答by Tal Ater
回答by Michael Levy
The issue is what will capture your user's speech and send it some speech server for processing. x-webkit-speech is built into the browser and the browser uses the PC's sound card to capture speech. Javascript/Jquery in a browser alone cannot access the hardware devices. I do not know if Chrome exposes any x-webkit-speech objects in the DOM so you can do capture through Javascript.
问题是什么将捕获用户的语音并将其发送到一些语音服务器进行处理。x-webkit-speech 内置于浏览器中,浏览器使用 PC 的声卡来捕获语音。单靠浏览器中的 Javascript/Jquery 无法访问硬件设备。我不知道 Chrome 是否会在 DOM 中公开任何 x-webkit-speech 对象,以便您可以通过 Javascript 进行捕获。
That said, people have reverse engineered the Google speech api and used it from code. See http://mikepultz.com/2011/03/accessing-google-speech-api-chrome-11/for an example.
也就是说,人们已经对 Google 语音 api 进行了逆向工程,并从代码中使用了它。有关示例,请参见http://mikepultz.com/2011/03/accessing-google-speech-api-chrome-11/。
回答by Roger
I found this, I have not personally used it do I don't know if it is entirely what you are looking for but take a look.
我找到了这个,我没有亲自使用它我不知道它是否完全是你要找的,但看看吧。
http://captico.com/introducing-speech-2-text-api-by-google/2011/03
http://captico.com/introducing-speech-2-text-api-by-google/2011/03