Image as a foreign language: Beit pretraining for all vision and vision-language tasks W Wang, H Bao, L Dong, J Bjorck, Z Peng, Q Liu, K Aggarwal, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 806* | 2022 |
Conformer: Local features coupling global representations for recognition and detection Z Peng, Z Guo, W Huang, Y Wang, L Xie, J Jiao, Q Tian, Q Ye IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 694* | 2023 |
Kosmos-2: Grounding multimodal large language models to the world Z Peng, W Wang, L Dong, Y Hao, S Huang, S Ma, Q Ye, F Wei The Twelfth International Conference on Learning Representations, 2023 | 434 | 2023 |
Beit v2: Masked image modeling with vector-quantized visual tokenizers Z Peng, L Dong, H Bao, Q Ye, F Wei arXiv preprint arXiv:2208.06366, 2022 | 236 | 2022 |
Ts-cam: Token semantic coupled attention map for weakly supervised object localization W Gao, F Wan, X Pan, Z Peng, Q Tian, Z Han, B Zhou, Q Ye Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 202 | 2021 |
Foundation transformers H Wang, S Ma, S Huang, L Dong, W Wang, Z Peng, Y Wu, P Bajaj, ... International Conference on Machine Learning, 2022 | 37* | 2022 |
Integral migrating pre-trained transformer encoder-decoders for visual object detection X Zhang, F Liu, Z Peng, Z Guo, F Wan, X Ji, Q Ye arXiv e-prints, arXiv: 2205.09613, 2022 | 24* | 2022 |
Generic-to-Specific Distillation of Masked Autoencoders W Huang, Z Peng, L Dong, F Wei, J Jiao, Q Ye Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 16 | 2023 |
Long-tailed distribution adaptation Z Peng, W Huang, Z Guo, X Zhang, J Jiao, Q Ye Proceedings of the 29th ACM International Conference on Multimedia, 3275-3282, 2021 | 10 | 2021 |
A unified view of masked image modeling Z Peng, L Dong, H Bao, Q Ye, F Wei Transactions on Machine Learning Research, 2023 | 2* | 2023 |
Generating images in context with multimodal large language models X Pan, L Dong, S Huang, Z Peng, W Chen, F Wei The Twelfth International Conference on Learning Representations, 2023 | 2 | 2023 |
Conformer: Local features coupling global representations for recognition and detection Z Peng, Z Guo, W Huang, Y Wang, L Xie, J Jiao, Q Tian, Q Ye IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | | 2023 |
Discriminatively Matched Part Tokens for Pointly Supervised Instance Segmentation Z Guo, M Liao, Z Peng, Y Zhang, P Yuan, Q Ye, F Wan | | |