1 Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
1Self-SupervisedTrainingofSpeakerEncoderwithMulti-ModalDiversePositivePairsRuijieTao,StudentMember,IEEE,,KongAikLee,SeniorMember,IEEE,RohanKumarDas,SeniorMember,IEEE,VilleHautam¨aki,Member,IEEE,andHaizhouLi,Fellow,IEEEAbstractWestudyanovelneuralarchitectureanditstrainingstrategiesofspeakerencoderfo...
2025-04-30
1.9MB 13 页 0
0
10玖币