当前位置：首页 > 文章列表 > Golang > Go问答 > 使用 Go 从麦克风中提取 WAV 音频的方法

使用 Go 从麦克风中提取 WAV 音频的方法

来源：stackoverflow 2024-02-09 12:45:24 0浏览收藏

从现在开始，努力学习吧！本文《使用 Go 从麦克风中提取 WAV 音频的方法》主要讲解了等等相关知识点，我会在golang学习网中持续更新相关的系列文章，欢迎大家关注并积极留言建议。下面就先一起来看一下本篇正文内容吧，希望能帮到你！

问题内容

我的程序使用 Vosk 语音识别库的 Go 绑定，它将音频作为 WAV 单声道音频的字节切片进行接收。我的程序当前使用外部命令 arecord 从麦克风获取 WAV 音频，但我更喜欢在 Go 中正确执行此操作，并且最好没有任何共享库依赖项。

我尝试使用 malgo 包，但卡在如何将麦克风中的原始音频转换为 WAV 上。我发现 WAV 编码包只能写入文件 (io.WriteSeeker)，但我需要转换来自麦克风的连续流以进行实时语音识别。

至少是Linux

正确答案

我最终也使用了 malgo ，以及 malgo.formats16。

在此回调中生成字节：

// https://github.com/gen2brain/malgo/blob/master/_examples/capture/capture.go
    onrecvframes := func(psample2, psample []byte, framecount uint32) {
        // empirically, len(psample) is 480, so for sample rate 44100 it's triggered about every 10ms.
        // samplecount := framecount * deviceconfig.capture.channels * sizeinbytes
        psampledata = append(psampledata, psample...)
    }

我可以将其转换为 int （为此使用 gpt-4）：

func twobytedatatointslice(audiodata []byte) []int {
    intdata := make([]int, len(audiodata)/2)
    for i := 0; i < len(audiodata); i += 2 {
        // convert the pcapturedsamples byte slice to int16 slice for formats16 as we go
        value := int(binary.littleendian.uint16(audiodata[i : i+2]))
        intdata[i/2] = value
    }
    return intdata
}

然后使用 "github.com/go-audio/wav" 生成内存中的 wav 字节（gpt-4 再次创建了内存中文件系统 hack 来克服 io.writeseeker 要求）

// Create an in-memory file to support io.WriteSeeker needed for NewEncoder which is needed for finalizing headers.
    inMemoryFilename := "in-memory-output.wav"
    inMemoryFile, err := fs.Create(inMemoryFilename)
    dbg(err)
    // We will call Close ourselves.

    // Convert audio data to IntBuffer
    inputBuffer := &audio.IntBuffer{Data: intData, Format: &audio.Format{SampleRate: iSampleRate, NumChannels: iNumChannels}}

    // Create a new WAV wavEncoder
    bitDepth := 16
    audioFormat := 1
    wavEncoder := wav.NewEncoder(inMemoryFile, iSampleRate, bitDepth, iNumChannels, audioFormat)

我在尝试将您想要的东西组合在一起时开发了这些片段 - 流式语音助手 [wip] https://github.com/petrzlen/vocode-golang

以上就是本文的全部内容了，是否有顺利帮助你解决问题？若是能给你带来学习上的帮助，请大家多多支持golang学习网！更多关于Golang的相关知识，也可关注golang学习网公众号。

版本声明

本文转载于：stackoverflow 如有侵犯，请联系study_golang@163.com删除