Efficient reinforcement learning for high dimensional linear quadratic systems M Ibrahimi, A Javanmard, B Roy Advances in Neural Information Processing Systems 25, 2012 | 107 | 2012 |
Epistemic neural networks I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ... Advances in Neural Information Processing Systems 36, 2795-2823, 2023 | 101 | 2023 |
Learning networks of stochastic differential equations J Pereira, M Ibrahimi, A Montanari Advances in Neural Information Processing Systems 23, 2010 | 89 | 2010 |
Reinforcement learning, bit by bit X Lu, B Van Roy, V Dwaracherla, M Ibrahimi, I Osband, Z Wen Foundations and Trends® in Machine Learning 16 (6), 733-865, 2023 | 75 | 2023 |
The set of solutions of random XORSAT formulae M Ibrahimi, Y Kanoria, M Kraning, A Montanari Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete …, 2012 | 49 | 2012 |
Hypermodels for exploration V Dwaracherla, X Lu, M Ibrahimi, I Osband, Z Wen, B Van Roy arXiv preprint arXiv:2006.07464, 2020 | 47 | 2020 |
On efficiency in hierarchical reinforcement learning Z Wen, D Precup, M Ibrahimi, A Barreto, B Van Roy, S Singh Advances in Neural Information Processing Systems 33, 6708-6718, 2020 | 47 | 2020 |
A packet-based photonic label switching router for a multirate all-optical CDMA-based GMPLS switch F Farnoud, M Ibrahimi, JA Salehi IEEE Journal of Selected Topics in Quantum Electronics 13 (5), 1522-1530, 2007 | 20 | 2007 |
The neural testbed: Evaluating joint predictions I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, M Ibrahimi, ... Advances in Neural Information Processing Systems 35, 12554-12565, 2022 | 18 | 2022 |
Efficient power allocation in cooperative OFDM system with channel variation M Ibrahimi, B Liang 2008 IEEE International Conference on Communications, 3022-3028, 2008 | 17 | 2008 |
Approximate thompson sampling via epistemic neural networks I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ... Uncertainty in Artificial Intelligence, 1586-1595, 2023 | 15 | 2023 |
A renewable, modular, and time-responsive DNA circuit A Goel, M Ibrahimi Natural Computing 10, 467-485, 2011 | 14 | 2011 |
Robust max-product belief propagation M Ibrahimi, A Javanmard, Y Kanoria, A Montanari 2011 Conference Record of the Forty Fifth Asilomar Conference on Signals …, 2011 | 12 | 2011 |
From predictions to decisions: The importance of joint predictive distributions Z Wen, I Osband, C Qin, X Lu, M Ibrahimi, V Dwaracherla, M Asghari, ... arXiv preprint arXiv:2107.09224, 2021 | 11 | 2021 |
Information theoretic limits on learning stochastic differential equations J Bento, M Ibrahimi, A Montanari 2011 IEEE International Symposium on Information Theory Proceedings, 855-859, 2011 | 11 | 2011 |
Evaluating predictive distributions: Does Bayesian deep learning work? I Osband, Z Wen, SM Asghari, X Lu, M Ibrahimi, V Dwaracherla, ... | 9 | 2021 |
Renewable, time-responsive DNA logic gates for scalable digital circuits A Goel, M Ibrahimi International Workshop on DNA-Based Computers, 67-77, 2009 | 9 | 2009 |
Support recovery for the drift coefficient of high-dimensional diffusions JBA Periera, M Ibrahimi IEEE transactions on information theory 60 (7), 4026-4049, 2014 | 8 | 2014 |
Accelerated time-of-flight mass spectrometry M Ibrahimi, A Montanari, GS Moore IEEE transactions on signal processing 62 (15), 3784-3798, 2014 | 5 | 2014 |
Posterior sampling networks VR Dwaracherla, B Van Roy, M Ibrahimi Reinforcement Learning and Decision Making Conference, 366-370, 2019 | 4 | 2019 |