Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes CY Wei, M Jafarnia-Jahromi, H Luo, H Sharma, R Jain International Conference On Machine Learning (ICML), 2020 | 113 | 2020 |
Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation CY Wei, M Jafarnia-Jahromi, H Luo, R Jain International Conference on Artificial Intelligence and Statistics (AISTATS), 2021 | 65 | 2021 |
Online Learning for Unknown Partially Observable MDPs M Jafarnia-Jahromi, R Jain, A Nayyar International Conference on Artificial Intelligence and Statistics (AISTATS), 2022 | 31* | 2022 |
Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path L Chen, M Jafarnia-Jahromi, R Jain, H Luo Neural Information Processing Systems (NeurIPS), 2021 | 26 | 2021 |
Online Learning for Stochastic Shortest Path Model via Posterior Sampling M Jafarnia-Jahromi, L Chen, R Jain, H Luo arXiv preprint arXiv:2106.05335, 2021 | 20 | 2021 |
Approximate Relative Value Learning for Average-reward Continuous State MDPs H Sharma, M Jafarnia-Jahromi, R Jain Uncertainty in Artificial Intelligence (UAI), 2019 | 16 | 2019 |
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret M Jafarnia-Jahromi, CY Wei, R Jain, H Luo arXiv preprint arXiv:2006.04354, 2020 | 10 | 2020 |
Learning Zero-sum Stochastic Games with Posterior Sampling M Jafarnia-Jahromi, R Jain, A Nayyar arXiv preprint arXiv:2109.03396, 2021 | 9 | 2021 |
Non-indexability of the Stochastic Appointment Scheduling Problem M Jafarnia-Jahromi, R Jain Automatica 118, 109016, 2020 | 8 | 2020 |
Online learning for cooperative multi-player multi-armed bandits W Chang, M Jafarnia-Jahromi, R Jain 2022 IEEE 61st Conference on Decision and Control (CDC), 7248-7253, 2022 | 7 | 2022 |
PPD: Permutation Phase Defense Against Adversarial Examples in Deep Learning M Jafarnia-Jahromi, T Chowdhury, HT Wu, S Mukherjee 18th IEEE International Conference On Machine Learning And Applications (ICMLA), 2019 | 4 | 2019 |
Posterior sampling-based online learning for the stochastic shortest path model M Jafarnia-Jahromi, L Chen, R Jain, H Luo Uncertainty in Artificial Intelligence, 922-931, 2023 | 2 | 2023 |
A Bayesian Learning Algorithm for Unknown Zero-sum Stochastic Games with an Arbitrary Opponent MJ Jahromi, RA Jain, A Nayyar International Conference on Artificial Intelligence and Statistics, 3880-3888, 2024 | | 2024 |