Multi-view transformer for 3d visual grounding S Huang, Y Chen, J Jia, L Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 105 | 2022 |
Surrogate-assisted evolutionary framework with adaptive knowledge transfer for multi-task optimization S Huang, J Zhong, WJ Yu IEEE transactions on emerging topics in computing 9 (4), 1930-1944, 2019 | 75 | 2019 |
Mp-former: Mask-piloted transformer for image segmentation H Zhang, F Li, H Xu, S Huang, S Liu, LM Ni, L Zhang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 60 | 2023 |
Llava-grounding: Grounded visual chat with large multimodal models H Zhang, H Li, F Li, T Ren, X Zou, S Liu, S Huang, J Gao, L Zhang, C Li, ... arXiv preprint arXiv:2312.02949, 2023 | 45 | 2023 |
Dsgn++: Exploiting visual-spatial relation for stereo-based 3d detectors Y Chen, S Huang, S Liu, B Yu, J Jia IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (4), 4416-4429, 2022 | 27 | 2022 |
Towards learning a generalist model for embodied navigation D Zheng, S Huang, L Zhao, Y Zhong, L Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 20 | 2024 |
DQ-DETR: Dual query detection transformer for phrase extraction and grounding S Liu, S Huang, F Li, H Zhang, Y Liang, H Su, J Zhu, L Zhang Proceedings of the AAAI Conference on Artificial Intelligence 37 (2), 1728-1736, 2023 | 18 | 2023 |
Cleva: Chinese language models evaluation platform Y Li, J Zhao, D Zheng, ZY Hu, Z Chen, X Su, Y Huang, S Huang, D Lin, ... arXiv preprint arXiv:2308.04813, 2023 | 11 | 2023 |
A unified mutual supervision framework for referring expression segmentation and generation S Huang, F Li, H Zhang, S Liu, L Zhang, L Wang arXiv preprint arXiv:2211.07919, 2022 | 4 | 2022 |
Learning preference model for llms via automatic preference data generation S Huang, J Zhao, Y Li, L Wang Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023 | 3 | 2023 |