问题描述
我在玩语音识别.是否可以将语音拆分为多个单词?
I'm play with speech recognition. Is it possible to split speech to multiple words?
如果可能,请推荐我支持将语音拆分为单词的库.
If it's possible please recommend me library supported split a speech to words.
谢谢
推荐答案
如果您知道演讲者所说的话,您可以执行强制对齐以生成单词(或音素)时间对齐.工具包,例如 CMU Sphinx、HTK 和 Kaldi 可以执行此操作.如果不知道说话者说了什么,您可以执行标准语音识别并使用时间信息来获取单词边界,尽管识别输出中可能会出现错误.
If you know what the speaker has said you can perform forced alignment to generate the word (or phoneme) time alignments. Toolkits such as CMU Sphinx, HTK and Kaldi can perform this. If don't know what the speaker has said you can just perform standard speech recognition and use the time information to obtain the word boundaries, although there may be errors in the recognition output.
这篇关于如何将演讲拆分为单词的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!