Seguir
Shie Mannor
Shie Mannor
Professor of Electrical Engineering @ Technion & Researcher @ Nvidia Research
Dirección de correo verificada de technion.ac.il - Página principal
Título
Citado por
Citado por
Año
A Tutorial on the Cross-Entropy Method
B DE, P KROESE, S MANNOR
Annals of Operations Research 134 (1), 19-67, 2005
3257*2005
The kernel recursive least-squares algorithm
Y Engel, S Mannor, R Meir
IEEE Transactions on signal processing 52 (8), 2275-2285, 2004
12332004
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems.
E Even-Dar, S Mannor, Y Mansour, S Mahadevan
Journal of machine learning research 7 (6), 2006
7332006
Robustness and Regularization of Support Vector Machines.
H Xu, C Caramanis, S Mannor
Journal of machine learning research 10 (7), 2009
5822009
Bayesian reinforcement learning: A survey
M Ghavamzadeh, S Mannor, J Pineau, A Tamar
Foundations and Trends® in Machine Learning 8 (5-6), 359-483, 2015
5362015
Reward constrained policy optimization
C Tessler, DJ Mankowitz, S Mannor
arXiv preprint arXiv:1805.11074, 2018
5192018
PAC bounds for multi-armed bandit and Markov decision processes
E Even-Dar, S Mannor, Y Mansour
Computational Learning Theory: 15th Annual Conference on Computational …, 2002
5172002
Reinforcement learning with Gaussian processes
Y Engel, S Mannor, R Meir
ICML, 201-208, 2005
5012005
Robustness and generalization
H Xu, S Mannor
Machine learning 86, 391-423, 2012
4932012
The sample complexity of exploration in the multi-armed bandit problem
S Mannor, JN Tsitsiklis
Journal of Machine Learning Research 5 (Jun), 623-648, 2004
4772004
A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, D Mankowitz, S Mannor
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
4352017
Robust regression and lasso
H Xu, C Caramanis, S Mannor
Advances in neural information processing systems 21, 2008
3682008
Q-cut—dynamic discovery of sub-goals in reinforcement learning
I Menache, S Mannor, N Shimkin
Machine Learning: ECML 2002: 13th European Conference on Machine Learning …, 2002
3682002
Policy gradients with variance related risk criteria
A Tamar, D Di Castro, S Mannor
Proceedings of the twenty-ninth international conference on machine learning …, 2012
3652012
Risk-sensitive and robust decision-making: a cvar optimization approach
Y Chow, A Tamar, S Mannor, M Pavone
Advances in neural information processing systems 28, 2015
3582015
The cross entropy method for classification
S Mannor, D Peleg, R Rubinstein
Proceedings of the 22nd international conference on Machine learning, 561-568, 2005
3572005
Dynamic abstraction in reinforcement learning via clustering
S Mannor, I Menache, A Hoze, U Klein
Proceedings of the twenty-first international conference on Machine learning, 71, 2004
3302004
Graying the black box: Understanding dqns
T Zahavy, N Ben-Zrihem, S Mannor
International conference on machine learning, 1899-1908, 2016
3282016
Percentile optimization for Markov decision processes with parameter uncertainty
E Delage, S Mannor
Operations research 58 (1), 203-213, 2010
328*2010
Bayes meets Bellman: The Gaussian process approach to temporal difference learning
Y Engel, S Mannor, R Meir
Proceedings of the 20th International Conference on Machine Learning (ICML …, 2003
3032003
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20