Exploring Mode Connectivity for Pre-trained Language Models Yujia Qin1 Cheng Qian1 Jing Yi1 Weize Chen1 Yankai Lin23y Xu Han1 Zhiyuan Liu145yMaosong Sun145y Jie Zhou6
ExploringModeConnectivityforPre-trainedLanguageModelsYujiaQin1,ChengQian1,JingYi1,WeizeChen1,YankaiLin2;3y,XuHan1,ZhiyuanLiu1;4;5y,MaosongSun1;4;5y,JieZhou61NLPGroup,DCST,IAI,BNRIST,TsinghuaUniversity,Beijing2GaolingSchoolofArticialIntelligence,RenminUniversityofChina,Beijing3BeijingKeyLaborator...
2025-05-06
2.24MB 21 页 0
0
10玖币