Benjamin Van Roy

Citado por

	Total	Desde 2019
Citas	19161	10117
Índice h	59	44
Índice i10	129	93

2000

1000

500

1500

199719981999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202448 50 68 71 109 169 160 208 308 339 424 445 552 577 561 611 546 631 603 637 750 995 1287 1662 1834 1989 1992 1349

Acceso público

Ver todo

5 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Ian OsbandOpenAIDirección de correo verificada de openai.com
John TsitsiklisProfessor of Electrical Engineering, MITDirección de correo verificada de mit.edu
Zheng WenGoogle DeepMindDirección de correo verificada de google.com
Daniel RussoColumbia UniversityDirección de correo verificada de gsb.columbia.edu
Gabriel Y WeintraubStanford GSBDirección de correo verificada de stanford.edu
Ciamac MoallemiProfessor, Graduate School of Business, Columbia UniversityDirección de correo verificada de gsb.columbia.edu
Morteza IbrahimiStanford UniversityDirección de correo verificada de stanford.edu
Paat RusmevichientongProfessor, Marshall School of Business, University of Southern CaliforniaDirección de correo verificada de marshall.usc.edu
Vivek FariasMassachusetts Institute of TechnologyDirección de correo verificada de mit.edu
Abbas KazerouniStanford UniversityDirección de correo verificada de stanford.edu
Anant SAHAIEECS, University of California, BerkeleyDirección de correo verificada de eecs.berkeley.edu
Alexander PritzelDeepmindDirección de correo verificada de google.com
Charles BlundellResearch Scientist at DeepMindDirección de correo verificada de google.com
Tsachy WeissmanProfessor of Electrical Engineering at Stanford UniversityDirección de correo verificada de stanford.edu
Yi-Hao KaoPhD Candidate, Electrical Engineering, Stanford UniversityDirección de correo verificada de stanford.edu
Hui ZhangCarnegie Mellon University, ConvivaDirección de correo verificada de andrew.cmu.edu
Richard ZeckhauserHarvard UniversityDirección de correo verificada de harvard.edu
Per EngeProfessor, Stanford UniversityDirección de correo verificada de stanford.edu
Ramesh GovindanProfessor of Computer Science, University of Southern CaliforniaDirección de correo verificada de usc.edu
Ashish GoelProfessor of Management Science and Engineering, and by courtesy, Computer Science, Stanford UniversityDirección de correo verificada de stanford.edu

Seguir

Benjamin Van Roy

Stanford University

Dirección de correo verificada de stanford.edu - Página principal

reinforcement learning operations research information theory


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Analysis of temporal-diffference learning with function approximation J Tsitsiklis, B Van Roy Advances in neural information processing systems 9, 1996	2200	1996
Deep exploration via bootstrapped DQN I Osband, C Blundell, A Pritzel, B Van Roy Advances in neural information processing systems 29, 2016	1468	2016
A tutorial on thompson sampling D Russo, B Van Roy, A Kazerouni, I Osband, Z Wen Foundations and Trends in Machine Learning 11 (1), pp. 1-96, 2018	1133	2018
The linear programming approach to approximate dynamic programming DP De Farias, B Van Roy Operations research 51 (6), 850-865, 2003	965	2003
Regression methods for pricing complex American-style options JN Tsitsiklis, B Van Roy IEEE Transactions on Neural Networks 12 (4), 694-703, 2001	856	2001
Learning to optimize via posterior sampling D Russo, B Van Roy Mathematics of Operations Research 39 (4), 1221-1243, 2014	745	2014
Feature-based methods for large scale dynamic programming JN Tsitsiklis, B Van Roy Machine Learning 22 (1), 59-94, 1996	716	1996
Markov perfect industry dynamics with many firms G Weintraub, CL Benkard, B Van Roy Econometrica 76 (6), 1375-1411, 2008	561	2008
On constraint sampling in the linear programming approach to approximate dynamic programming DP De Farias, B Van Roy Mathematics of operations research 29 (3), 462-478, 2004	494	2004
Optimal stopping of Markov processes: Hilbert space theory, approximation algorithms, and an application to pricing high-dimensional financial derivatives JN Tsitsiklis, B Van Roy IEEE Transactions on Automatic Control 44 (10), 1840-1851, 1999	483	1999
An information-theoretic analysis of thompson sampling D Russo, B Van Roy Journal of Machine Learning Research 17 (68), 1-30, 2016	425	2016
Deep Exploration via Randomized Value Functions. I Osband, B Van Roy, DJ Russo, Z Wen The Journal of Machine Learning Research 20 (124), 1-62, 2019	332	2019
Generalization and exploration via randomized value functions I Osband, B Van Roy, Z Wen International Conference on Machine Learning, 2377-2386, 2016	331	2016
Consensus propagation CC Moallemi, B Van Roy IEEE Transactions on Information Theory 52 (11), 4753-4766, 2006	301	2006
Why is posterior sampling better than optimism for reinforcement learning? I Osband, B Van Roy International conference on machine learning, 2701-2710, 2017	269	2017
Dynamic pricing with a prior on market response VF Farias, B Van Roy Operations Research 58 (1), 16-29, 2010	269	2010
Solving data mining problems through pattern recognition RL Kennedy, Y Lee, B Van Roy, CD Reed, RP Lippman Upper Saddle River, NJ: Prentice Hall PTR, 2011	268*	2011
Eluder dimension and the sample complexity of optimistic exploration D Russo, B Van Roy Advances in Neural Information Processing Systems 26, 2013	264	2013
Learning to optimize via information-directed sampling D Russo, B Van Roy Advances in neural information processing systems 27, 2014	239	2014
A neuro-dynamic programming approach to retailer inventory management B Van Roy, DP Bertsekas, Y Lee, JN Tsitsiklis Proceedings of the 36th IEEE Conference on Decision and Control 4, 4052-4057, 1997	239	1997

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores