Improving Policy Learning via Language Dynamics Distillation Victor Zhong12 Jesse Mu3 Luke Zettlemoyer12 Edward Grefenstette45and Tim Rocktäschel4
ImprovingPolicyLearningviaLanguageDynamicsDistillationVictorZhong1,2,JesseMu3,LukeZettlemoyer1,2,EdwardGrefenstette4,5andTimRocktäschel41UniversityofWashington2MetaAIResearch3StanfordUniversity4UniversityCollegeLondon5CohereAbstractRecentworkhasshownthataugmentingenvironmentswithlanguagedescriptions...
2025-05-08
6.56MB 16 页 12
0
10玖币