torchaudio.models.hubert_base¶
- torchaudio.models.hubert_base(encoder_projection_dropout: float = 0.1, encoder_attention_dropout: float = 0.1, encoder_ff_interm_dropout: float = 0.0, encoder_dropout: float = 0.1, encoder_layer_drop: float = 0.05, aux_num_out: Optional[int] = None) Wav2Vec2Model[source]¶
根據 HuBERT 構建“base”
HuBERT模型 [Hsu 等, 2021]- 引數:
encoder_projection_dropout (float) – 參見
wav2vec2_model()。encoder_attention_dropout (float) – 參見
wav2vec2_model()。encoder_ff_interm_dropout (float) – 參見
wav2vec2_model()。encoder_dropout (float) – 參見
wav2vec2_model()。encoder_layer_drop (float) – 參見
wav2vec2_model()。aux_num_out (int 或 None, 可選) – 參見
wav2vec2_model()。
- 返回:
結果模型。
- 返回型別: