A survey of safety and trustworthiness of large language models through the lens of verification and validation X Huang, W Ruan, W Huang, G Jin, Y Dong, C Wu, S Bensalem, R Mu, ... arXiv preprint arXiv:2305.11391, 2023 | 38 | 2023 |
Sparse adversarial video attacks with spatial transformations R Mu, W Ruan, LS Marcolino, Q Ni The British Machine Vision Conference (BMVC),2021, 2021 | 18 | 2021 |
Randomized adversarial training via taylor expansion G Jin, X Yi, D Wu, R Mu, X Huang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 15 | 2023 |
3DVerifier: efficient robustness verification for 3D point cloud models R Mu, W Ruan, LS Marcolino, Q Ni Machine Learning, 1-28, 2022 | 12 | 2022 |
Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning R Mu, W Ruan, LS Marcolino, G Jin, Q Ni AAAI 2023, 2022 | 11 | 2022 |
Building Guardrails for Large Language Models Y Dong, R Mu, G Jin, Y Qi, J Hu, X Zhao, J Meng, W Ruan, X Huang arXiv preprint arXiv:2402.01822, 2024 | 2 | 2024 |
Reward Certification for Policy Smoothed Reinforcement Learning R Mu, LS Marcolino, T Zhang, Y Zhang, X Huang, W Ruan AAAI, 2024, 2023 | 2 | 2023 |
Enhancing robustness in video recognition models: Sparse adversarial attacks and beyond R Mu, L Marcolino, Q Ni, W Ruan Neural Networks 171, 127-143, 2024 | 1* | 2024 |
DeepGRE: Global Robustness Evaluation of Deep Neural Networks T Zhang, J Liu, Y Zhang, R Mu, W Ruan ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Towards Fairness-Aware Adversarial Learning Y Zhang, T Zhang, R Mu, X Huang, W Ruan arXiv preprint arXiv:2402.17729, 2024 | | 2024 |
Nrat: towards adversarial training with inherent label noise Z Chen, F Wang, R Mu, P Xu, X Huang, W Ruan Machine Learning, 1-22, 2024 | | 2024 |