Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions Haanvid Lee1 Jongmin Lee2 Yunseon Choi1 Wonseok Jeony
LocalMetricLearningforOff-PolicyEvaluationinContextualBanditswithContinuousActionsHaanvidLee1,JongminLee2,YunseonChoi1,WonseokJeony,Byung-JunLee3;4,Yung-KyunNoh5;6,Kee-EungKim11KAIST,2UCBerkeley,3KoreaUniv.,4GaussLabsInc.,5HanyangUniv.,6KIAShaanvid@kaist.ac.kr,jongmin.lee@berkeley.edu,cys9506@kaist....
2025-05-02
1.03MB 29 页 0
0
10玖币