Image as a foreign language: Beit pretraining for vision and vision-language tasks W Wang, H Bao, L Dong, J Bjorck, Z Peng, Q Liu, K Aggarwal, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 657* | 2023 |
Vlmo: Unified vision-language pre-training with mixture-of-modality-experts H Bao, W Wang, L Dong, Q Liu, OK Mohammed, K Aggarwal, S Som, ... Advances in Neural Information Processing Systems 35, 32897-32912, 2022 | 432 | 2022 |
Language is not all you need: Aligning perception with language models S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ... Advances in Neural Information Processing Systems 36, 72096-72109, 2023 | 379 | 2023 |
Omega-3 fatty acids and cardiovascular disease AP Jain, KK Aggarwal, PY Zhang Eur Rev Med Pharmacol Sci 19 (3), 441-5, 2015 | 204 | 2015 |
Subhojit Som, et al. 2022. Image as a foreign language: Beit pretraining for all vision and vision-language tasks W Wang, H Bao, L Dong, J Bjorck, Z Peng, Q Liu, K Aggarwal, ... arXiv preprint arXiv:2208.10442, 2022 | 111 | 2022 |
Orca 2: Teaching small language models how to reason A Mitra, L Del Corro, S Mahajan, A Codas, C Simoes, S Agarwal, X Chen, ... arXiv preprint arXiv:2311.11045, 2023 | 83 | 2023 |
Subhojit Som, and Furu Wei W Wang, H Bao, L Dong, J Bjorck, Z Peng, Q Liu, K Aggarwal, ... Image as a foreign language: Beit pretraining for all vision and vision …, 2022 | 71 | 2022 |
Subhojit Som, Xia Song, and Furu Wei. Language is not all you need: Aligning perception with language models S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ... arXiv preprint arXiv:2302.14045 1 (2), 3, 2023 | 44 | 2023 |
Subhojit Som, Xia Song, and Furu Wei S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ... Language is not all you need: Aligning perception with language models …, 2023 | 43 | 2023 |
Subhojit Som, and Furu Wei. Vlmo: Unified vision-language pretraining with mixture-of-modality-experts H Bao, W Wang, L Dong, Q Liu, OK Mohammed, K Aggarwal arXiv preprint arXiv:2111.02358 3, 2021 | 34 | 2021 |
Polaris: A Safety-focused LLM Constellation Architecture for Healthcare S Mukherjee, P Gamble, MS Ausin, N Kant, K Aggarwal, N Manjunath, ... arXiv preprint arXiv:2403.13313, 2024 | 3 | 2024 |
Odin: A single model for 2d and 3d perception A Jain, P Katara, N Gkanatsios, AW Harley, G Sarch, K Aggarwal, ... arXiv preprint arXiv:2401.02416, 2024 | 3 | 2024 |
DUBLIN--Document Understanding By Language-Image Network K Aggarwal, A Khandelwal, K Tanmay, OM Khan, Q Liu, M Choudhury, ... arXiv preprint arXiv:2305.14218, 2023 | 2 | 2023 |
Bootstrapping a high quality multilingual multimodal dataset for Bletchley OK Mohammed, K Aggarwal, Q Liu, S Singhal, J Bjorck, S Som Asian Conference on Machine Learning, 738-753, 2023 | 2 | 2023 |
ODIN: A Single Model for 2D and 3D Segmentation A Jain, P Katara, N Gkanatsios, AW Harley, G Sarch, K Aggarwal, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 1 | 2024 |
DUBLIN: Visual Document Understanding By Language-Image Network K Aggarwal, A Khandelwal, K Tanmay, OK Mohammed, Q Liu, ... Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023 | 1 | 2023 |
sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting S Ahuja, K Tanmay, HH Chauhan, B Patra, K Aggarwal, L Del Corro, ... arXiv preprint arXiv:2407.09879, 2024 | | 2024 |
ODIN: A Single Model for 2D and 3D Segmentation Supplementary Materials A Jain, P Katara, N Gkanatsios, AW Harley, G Sarch, K Aggarwal, ... | | |