This paper studies the problem of online combinatorial prediction problem with input from strategic experts, where the learner not only tries to maximize a submodular utility function under bandit or semi-bandit feedback, but also try to make the experts truthfully reporting their true predictions in the sense that being honest maximizes the probability of being chosen.

When only bandit feedback on the overall utility is available, the algorithm ensures an -regret of via an Online Mirror Descent with -Tsallis entropy regularizer using the standard one-point gradient estimator.
When semi-bandit feedback on the losses of those chosen experts are available, the -regret slightly improved -- although the dependency on is still , some of the terms replaced with or .