torchaudio.functional.rtf_power¶

torchaudio.functional.rtf_power(psd_s: Tensor, psd_n: Tensor, reference_channel: Union[int, Tensor], n_iter: int = 3, diagonal_loading: bool = True, diag_eps: float = 1e-07) → Tensor[原始碼]¶

使用冪方法估計相對傳遞函式 (RTF) 或導向向量。

引數：

psd_s (torch.Tensor) – 目標語音的復值功率譜密度 (PSD) 矩陣。維度為 (…, freq, channel, channel) 的張量。
psd_n (torch.Tensor) – 噪聲的復值功率譜密度 (PSD) 矩陣。維度為 (…, freq, channel, channel) 的張量。
reference_channel (int or torch.Tensor) – 指定參考通道。如果 dtype 為 int，則表示參考通道索引。如果 dtype 為 torch.Tensor，其形狀為 (…, channel)，其中 channel 維度為 one-hot。
diagonal_loading (bool, optional) – 如果為 True，則對 psd_n 應用對角載入。 (預設值: True)
diag_eps (float, optional) – 用於對角載入的與單位矩陣相乘的係數。僅當 diagonal_loading 設定為 True 時有效。 (預設值: 1e-7)

返回：

估計的目標語音的復值 RTF。維度為 (…, freq, channel) 的張量。

返回型別：

torch.Tensor

文件