Towards a Theoretical Foundation of Policy Optimization for Learning
TowardsaTheoreticalFoundationofPolicyOptimizationforLearningControlPoliciesBinHu1,KaiqingZhang2,NaLi3,MehranMesbahi4,MaryamFazel5,andTamerBasar61CSL&ECE,UniversityofIllinoisatUrbana-Champaign,IL,USA,61801;email:binhu7@illinois.edu2LIDS&CSAIL,MassachusettsInstituteofTechnology,Cambridge,MA,USA,02139...
2025-05-06
2.55MB 35 页 0
0
10玖币