torchaudio.functional.detect_pitch_frequency¶

torchaudio.functional.detect_pitch_frequency(waveform: Tensor, sample_rate: int, frame_time: float = 0.01, win_length: int = 30, freq_low: int = 85, freq_high: int = 3400) → Tensor[原始碼]¶

檢測音高頻率。

它使用歸一化互相關函式和中值平滑來實現。

引數：

waveform (Tensor) – 維度為 (…, 頻率, 時間) 的音訊張量
sample_rate (int) – 波形的取樣率 (Hz)
frame_time (float, optional) – 幀的持續時間 (預設值: 10 ** (-2))。
win_length (int, optional) – 用於中值平滑的視窗長度（以幀數為單位）(預設值: 30)。
freq_low (int, optional) – 可檢測的最低頻率 (Hz) (預設值: 85)。
freq_high (int, optional) – 可檢測的最高頻率 (Hz) (預設值: 3400)。

返回：

維度為 (…, 幀) 的頻率張量

返回型別：

Tensor

使用 detect_pitch_frequency 的教程: 音訊特徵提取

音訊特徵提取

torchaudio.functional.detect_pitch_frequency¶

文件

教程

資源