从我的演讲中识别出一个特定的单词

Question

我想实现一个可以在人后重复某个单词的功能。例如：“说娃娃。”可以使用任何其他词来代替娃娃。重点是助理要准确地重复“doll”这个词

我想用 vosk 模块来实现这个。我该怎么做？

Answer 1

import vosk

model = vosk.Model("path_to_your_model")  # Specify the correct path to your Vosk model

def recognize_and_repeat(target_word, audio_input):
    rec = vosk.KaldiRecognizer(model, 16000)
    
    while True:
        if rec.AcceptWaveform(audio_input):
            result = rec.Result()
            recognized_text = result.split('"text" : ')[-1].split('"')[1]  # Extracting the recognized text
            if recognized_text == target_word:
                print(f"You said: '{target_word}'")
                break

在“audio_input”中，插入麦克风录音中的实际音频，就像我在上面的代码中所做的那样。在 target_word 位置输入您要重复的单词。该方法“rec。 AcceptWaveform(audio_input)”准备audio_input，并确保它在代码中按照您的预期被接受和处理。确保音频输入的格式和采样率正确，建议采样率为 16000 Hz。

我希望这能很好地回答你的问题。

从我的演讲中识别出一个特定的单词

问题描述投票：0回答：1

1个回答

最新问题

从我的演讲中识别出一个特定的单词

问题描述 投票：0回答：1

1个回答

最新问题

问题描述投票：0回答：1