RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising, 2018. ,
Regret bounds and regimes of optimality for user-user and item-item collaborative filtering, 2018 Information Theory and Applications Workshop, 2018. ,
A fast bandit algorithm for recommendations to users with heterogeneous tastes, Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, ser. AAAI'13, pp.1135-1141, 2013. ,
Bandits and recommender systems, Revised Selected Papers of the First International Workshop on Machine Learning, vol.9432, pp.325-336, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01256033
Adaptive -greedy exploration in reinforcement learning based on value differences, Proceedings of the 33rd Annual German Conference on Advances in Artificial Intelligence, ser. KI'10, pp.203-210, 2010. ,
Finite-time analysis of the multiarmed bandit problem, Machine Learning, 2002. ,
The Nonstochastic Multiarmed Bandit Problem, SIAM Journal on Computing, 2003. ,
On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-00281392
Openai gym, 2016. ,
Multi-armed bandit, dynamic environments and meta-bandits, Environments, pp.1-14, 2006. ,
URL : https://hal.archives-ouvertes.fr/hal-00113668
Comparing accuracy of cosine-based similarity and correlation-based similarity algorithms in tourism recommender systems, 4th IEEE International Conference on Management of Innovation and Technology, pp.469-474, 2008. ,
Mining of massive datasets, 2014. ,
Recommender systems, 2016. ,