Generating natural language adversarial examples through probability weighted word saliency S Ren, Y Deng, K He, W Che ACL 2019, 2019 | 663 | 2019 |
MIT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning L Li, Y Yin, S Li, L Chen, P Wang, S Ren, M Li, Y Yang, J Xu, X Sun, ... arXiv preprint arXiv:2306.04387, 2023 | 62* | 2023 |
CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade L Li, Y Lin, D Chen, S Ren, P Li, J Zhou, X Sun Findings of EMNLP 2021, 2021 | 41* | 2021 |
Dynamic Knowledge Distillation for Pre-trained Language Models L Li, Y Lin, S Ren, P Li, J Zhou, X Sun EMNLP 2021, 2021 | 34 | 2021 |
Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification S Ren, J Zhang, L Li, X Sun, J Zhou EMNLP 2021, 2021 | 27 | 2021 |
Learning Relation Alignment for Calibrated Cross-modal Retrieval S Ren, J Lin, G Zhao, R Men, A Yang, J Zhou, X Sun, H Yang ACL 2021, 2021 | 25 | 2021 |
Delving into the Openness of CLIP S Ren, L Li, X Ren, G Zhao, X Sun Findings of ACL 2023, 2022 | 15* | 2022 |
DCA: Diversified Co-Attention towards Informative Live Video Commenting Z Zhang, Z Yin, S Ren, X Li, S Li NLPCC 2020, 2020 | 14 | 2020 |
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition S Ren, A Zhang, Y Zhu, S Zhang, S Zheng, M Li, A Smola, X Sun NeurIPS 2023, 2023 | 13 | 2023 |
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain L Chen, Y Zhang, S Ren, H Zhao, Z Cai, Y Wang, P Wang, X Meng, T Liu, ... arXiv preprint arXiv:2402.15527, 2024 | 12* | 2024 |
Cuge: A chinese language understanding and generation evaluation benchmark Y Yao, Q Dong, J Guan, B Cao, Z Zhang, C Xiao, X Wang, F Qi, J Bao, ... arXiv preprint arXiv:2112.13610, 2021 | 11 | 2021 |
FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation Y Liu, L Li, S Ren, R Gao, S Li, S Chen, X Sun, L Hou NeurIPS 2023 (Datasets and Benchmarks Track), 2023 | 9 | 2023 |
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding S Ren, L Yao, S Li, X Sun, L Hou CVPR 2024, 2023 | 6 | 2023 |
TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding S Ren, S Chen, S Li, X Sun, L Hou Findings of EMNLP 2023, 2023 | 2 | 2023 |
TempCompass: Do Video LLMs Really Understand Videos? Y Liu, S Li, Y Liu, Y Wang, S Ren, L Li, S Chen, X Sun, L Hou arXiv preprint arXiv:2403.00476, 2024 | 1 | 2024 |
VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models S Li, L Li, S Ren, Y Liu, Y Liu, R Gao, X Sun, L Hou arXiv preprint arXiv:2311.17404, 2023 | 1 | 2023 |
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? Y Wang, S Ren, R Gao, L Yao, Q Guo, K An, J Bai, X Sun arXiv preprint arXiv:2404.10763, 2024 | | 2024 |
Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality S Chen, L Li, S Ren, R Gao, Y Liu, X Bi, X Sun, L Hou arXiv preprint arXiv:2403.19221, 2024 | | 2024 |