A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Using Contextual Bandits with Behavioral Constraints for Constrained Online Movie Recommendation
2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
AI systems that learn through reward feedback about the actions they take are increasingly deployed in domains that have significant impact on our daily life. In many cases the rewards should not be the only guiding criteria, as there are additional constraints and/or priorities imposed by regulations, values, preferences, or ethical principles. We detail a novel online system, based on an extension of the contextual bandits framework, that learns a set of behavioral constraints by observation
doi:10.24963/ijcai.2018/843
dblp:conf/ijcai/BalakrishnanBMR18
fatcat:chmji4iilrchdkzqknxhfij5yi