Seguir
Mehdi Jafarnia Jahromi
Mehdi Jafarnia Jahromi
DeepMind
Dirección de correo verificada de google.com - Página principal
Título
Citado por
Citado por
Año
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
CY Wei, M Jafarnia-Jahromi, H Luo, H Sharma, R Jain
International Conference On Machine Learning (ICML), 2020
1132020
Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
CY Wei, M Jafarnia-Jahromi, H Luo, R Jain
International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
652021
Online Learning for Unknown Partially Observable MDPs
M Jafarnia-Jahromi, R Jain, A Nayyar
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
31*2022
Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path
L Chen, M Jafarnia-Jahromi, R Jain, H Luo
Neural Information Processing Systems (NeurIPS), 2021
262021
Online Learning for Stochastic Shortest Path Model via Posterior Sampling
M Jafarnia-Jahromi, L Chen, R Jain, H Luo
arXiv preprint arXiv:2106.05335, 2021
202021
Approximate Relative Value Learning for Average-reward Continuous State MDPs
H Sharma, M Jafarnia-Jahromi, R Jain
Uncertainty in Artificial Intelligence (UAI), 2019
162019
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
M Jafarnia-Jahromi, CY Wei, R Jain, H Luo
arXiv preprint arXiv:2006.04354, 2020
102020
Learning Zero-sum Stochastic Games with Posterior Sampling
M Jafarnia-Jahromi, R Jain, A Nayyar
arXiv preprint arXiv:2109.03396, 2021
92021
Non-indexability of the Stochastic Appointment Scheduling Problem
M Jafarnia-Jahromi, R Jain
Automatica 118, 109016, 2020
82020
Online learning for cooperative multi-player multi-armed bandits
W Chang, M Jafarnia-Jahromi, R Jain
2022 IEEE 61st Conference on Decision and Control (CDC), 7248-7253, 2022
72022
PPD: Permutation Phase Defense Against Adversarial Examples in Deep Learning
M Jafarnia-Jahromi, T Chowdhury, HT Wu, S Mukherjee
18th IEEE International Conference On Machine Learning And Applications (ICMLA), 2019
42019
Posterior sampling-based online learning for the stochastic shortest path model
M Jafarnia-Jahromi, L Chen, R Jain, H Luo
Uncertainty in Artificial Intelligence, 922-931, 2023
22023
A Bayesian Learning Algorithm for Unknown Zero-sum Stochastic Games with an Arbitrary Opponent
MJ Jahromi, RA Jain, A Nayyar
International Conference on Artificial Intelligence and Statistics, 3880-3888, 2024
2024
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–13