c++ - 使用 PortAudio 和 sndfile 播放立体声 .wav 文件,输出模糊且变慢/变慢
问题描述
我一直在编写一些代码来使用 PortAudio 和 sndfile 在 C++ 中播放立体声 .wav 文件,但是输出声音是模糊的并且向下倾斜(向下倾斜对我来说不是问题,但它可能是问题的一部分) . 看起来它似乎在播放部分垃圾数据,但我不相信我正在使用任何变量,这可能会发生在我没有先清除数据的情况下。我尝试过一些带有立体声播放的 PortAudios 示例,但由于输入来自 .wav 文件而不是生成的,我无法完美地遵循它。我还编译并运行了一些 PortAudio 示例(使用立体声)并且工作正常。我不确定问题出在哪里。
音频.h
struct AudioFile {
SNDFILE* file = nullptr;
SF_INFO info;
int buffer_size = 512;
int readHead = 0;
sf_count_t count = 1;
};
/*
Class for handling basic audio functions
*/
class Audio {
protected:
public:
/// Constructor
Audio();
/// Destructor
~Audio();
/// Load an audio file
AudioFile loadFile(const char* path);
/// Play an audio file
void playFile(AudioFile* file);
};
音频.cpp
/// Audio constructor
Audio::Audio() {
PaError err = Pa_Initialize();
if (err != paNoError) std::cerr << "PAError: " << err << std::endl;
#ifdef DEBUG
std::cout << "Initialising PortAudio" << std::endl;
std::cout << "----------------------" << std::endl;
std::cout << "Version: " << Pa_GetVersion << std::endl;
std::cout << "Devices:" << std::endl;
std::cout << "----------------------" << std::endl;
int numDevices = Pa_GetDeviceCount();
for (int i=0; i < numDevices; i++) {
auto deviceInfo = Pa_GetDeviceInfo(i);
std::cout << "Name: " << deviceInfo->name << std::endl;
std::cout << "HostApi: " << deviceInfo->hostApi << std::endl;
std::cout << "SampleRate: " << deviceInfo->defaultSampleRate << std::endl;
std::cout << "InputChannels: " << deviceInfo->maxInputChannels << std::endl;
std::cout << "OutputChannels: " << deviceInfo->maxOutputChannels << std::endl;
std::cout << "----------------------" << std::endl;
}
#endif
}
Audio::~Audio() {
PaError err = Pa_Terminate();
if (err != paNoError) std::cerr << "PAError: " << err << std::endl;
}
/* Loads an audiofile */
AudioFile Audio::loadFile(const char* path) {
AudioFile file;
::memset(&file.info, 0, sizeof(file.info));
file.file = sf_open(path, SFM_READ, &file.info);
return file;
}
static int patestCallback(const void* inputBuffer, void* outputBuffer,
unsigned long framesPerBuffer,
const PaStreamCallbackTimeInfo* timeInfo,
PaStreamCallbackFlags statusFlags, void* userData) {
/// Prevent warnings
(void)inputBuffer;
(void)timeInfo;
(void)statusFlags;
/// an AudioFile gets passed as userData
AudioFile* file = (AudioFile*)userData;
float* out = (float*)outputBuffer;
sf_seek(file->file, file->readHead, SF_SEEK_SET);
auto data = std::make_unique<float[]>(framesPerBuffer);
file->count = sf_read_float(file->file, data.get(), framesPerBuffer);
for (int i = 0; i < framesPerBuffer; i++) {
*out++ = data[i];
}
file->readHead += file->buffer_size;
if (file->count > 0) return paContinue;
else return paComplete;
}
void Audio::playFile(AudioFile* file) {
PaStream* stream = nullptr;
PaStreamParameters params;
params.device = Pa_GetDefaultOutputDevice();
params.channelCount = file->info.channels;
params.sampleFormat = paFloat32;
params.suggestedLatency =
Pa_GetDeviceInfo(params.device)->defaultLowOutputLatency;
params.hostApiSpecificStreamInfo = nullptr;
/// Check if params work
PaError err = Pa_IsFormatSupported(nullptr, ¶ms, file->info.samplerate);
if (err != paFormatIsSupported) {
std::cerr << "PAError: " << Pa_GetErrorText(err) << std::endl;
return;
}
err = Pa_OpenStream(&stream, nullptr, ¶ms, file->info.samplerate,
file->buffer_size * params.channelCount, paClipOff,
&patestCallback, file);
if (err != paNoError) std::cerr << "PAError: " << Pa_GetErrorText(err) << std::endl;
err = Pa_StartStream(stream);
if (err != paNoError)
std::cerr << "PAError: " << Pa_GetErrorText(err) << std::endl;
/// wait until file finishes playing
while (file->count > 0) {}
err = Pa_StopStream(stream);
if (err != paNoError)
std::cerr << "PAError: " << Pa_GetErrorText(err) << std::endl;
err = Pa_CloseStream(stream);
if (err != paNoError)
std::cerr << "PAError: " << Pa_GetErrorText(err) << std::endl;
}
我也尝试过不使用data
指针(使用它似乎会产生更清晰但仍然模糊的声音)并将音频文件按值传递给playFile
函数。任何帮助表示赞赏。
解决方案
最终弄清楚了,我有一个主要问题,在这里:
err = Pa_OpenStream(&stream, nullptr, ¶ms, file->info.samplerate,
file->buffer_size * params.channelCount, paClipOff,
&patestCallback, file);
我给了 Pa_OpenStream 缓冲区大小 * 通道数,但是我应该只是给它缓冲区大小,并直接在回调函数中对 framesPerBuffer 进行通道调整:
static int patestCallback(const void* inputBuffer, void* outputBuffer,
unsigned long framesPerBuffer,
const PaStreamCallbackTimeInfo* timeInfo,
PaStreamCallbackFlags statusFlags, void* userData) {
/// Prevent warnings
(void)inputBuffer;
(void)timeInfo;
(void)statusFlags;
/// an AudioFile gets passed as userData
velox::AudioFile* file = (velox::AudioFile*)userData;
float* out = (float*)outputBuffer;
sf_seek(file->file, file->readHead, SF_SEEK_SET);
auto data = std::make_unique<float[]>(framesPerBuffer * file->info.channels);
file->count = sf_read_float(file->file, data.get(),
framesPerBuffer * file->info.channels);
for (int i = 0; i < framesPerBuffer * file->info.channels; i++) {
*out++ = data[i];
}
file->readHead += file->buffer_size;
if (file->count > 0) return paContinue;
else return paComplete;
}
这种变化固定了音高和模糊,事后看来是有道理的。
推荐阅读
- programming-languages - Coursera 编程语言 hw2
- ios - 'NSInvalidArgumentException',原因:'+[FIRInstanceIDCheckinPreferences preferencesFromKeychainContents:]:'无法识别的选择器发送到类'
- python - 错误:找不到满足要求 tensorflow 的版本(来自版本:无)
- python - 在蛋白质序列中找到已识别模式的位置
- youtube-api - Youtube api 未找到视频
- rust - 在异步 Rust (Tokio) 中包装阻塞 mpsc
- django - Django ForeignKey 值没有对应的值
- node.js - 打字稿错误:Blob 不是构造函数
- python - 使用 huggingface fill-mask 管道来获得它没有建议的结果的“分数”
- android - 如何使用形状材质组件制作完整的圆形编辑文本?