Feedback Information on Cumulative Payoff in a Bandit Experiment: Meaningful Learning in Weighted Voting

Kazuhito Ogawa, Naoki Watanabe. Feedback Information on Cumulative Payoff in a Bandit Experiment: Meaningful Learning in Weighted Voting. In Shusaku Tsumoto, Yukio Ohsawa, Lei Chen 0002, Dirk Van den Poel, Xiaohua Hu 0001, Yoichi Motomura, Takuya Takagi, Lingfei Wu, Ying Xie, Akihiro Abe, Vijay Raghavan 0001, editors, IEEE International Conference on Big Data, Big Data 2022, Osaka, Japan, December 17-20, 2022. pages 3295-3299, IEEE, 2022. [doi]

Abstract

Abstract is missing.