Reward Imputation with Sketching for Contextual Batched Bandits Xiao Zhang12 Ninglu Shao12 Zihua Si12 Jun Xu12
RewardImputationwithSketchingforContextualBatchedBanditsXiaoZhang1,2,NingluShao1,2,∗,ZihuaSi1,2,∗,JunXu1,2,†,WenhanWang3,HanjingSu3,Ji-RongWen1,21GaolingSchoolofArtificialIntelligence,RenminUniversityofChina,Beijing,China2BeijingKeyLaboratoryofBigDataManagementandAnalysisMethods,Beijing,China3Tencen...
2025-04-24
1.78MB 12 页 0
0
10玖币