Research Theme

Trustworthy AI

Research Interests

Trustworthy AI AI Security & Privacy AI-Generated Content Detection AI For Safety

News

Paper Updates

Awards and Milestones

  • ๐Ÿ†2025.12I received the 2025 DSA Excellent Research Award.
  • ๐Ÿ†2024.11AdSpectorX received the Best Paper Award at SENSYS-SocialMeta 2024.
  • ๐Ÿ†2024.06I received my firm PhD offer from HKUST(GZ).

Selected Papers

* denotes equal contribution. denotes corresponding author.

IEEE S&P 2025 CCF-A PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning Zhen Sun, Tianshuo Cong, Yule Liu, Chenhao Lin, Xinlei He, Rongmao Chen, Xingshuo Han, Xinyi Huang. ACL 2025 CCF-A Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media Zhen Sun*, Zongmin Zhang*, Xinyue Shen, Ziyi Zhang, Yule Liu, Michael Backes, Yang Zhang, Xinlei He. EMNLP 2025 Findings CCF-B FC-Attack: Jailbreaking Large Vision-Language Models via Auto-Generated Flowcharts Ziyi Zhang*, Zhen Sun*, Zongmin Zhang, Jihui Guo, Xinlei He. KDD 2025 CCF-A TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors Jingyi Zheng, Junfeng Wang, Zhen Sun, Wenhan Dong, Yule Liu, Xinlei He. AAAI 2026 CCF-A ยท Oral 6DAttack: Backdoor Attacks in the 6DoF Pose Estimation Jihui Guo, Zongmin Zhang, Zhen Sun, Yuhao Yang, Jinlin Wu, Fu Zhang, Xinlei He. ICLR 2026CCF-A JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models Zifan Peng, Yule Liu, Zhen Sun, Mingchen Li, Zeren Luo, Jingyi Zheng, Wenhan Dong, Xinlei He, Xuechao Wang, Yingjie Xue, Shengmin Xu, Xinyi Huang. KDD 2025 CCF-A On the Generalization and Adaptation Ability of Machine-Generated Text Detectors in Academic Writing Yule Liu, Zhiyuan Zhong, Yifan Liao, Zhen Sun, Jingyi Zheng, Jiaheng Wei, Qingyuan Gong, Fenghua Tong, Yang Chen, Yang Zhang, Xinlei He. USENIX Security 2025 CCF-A Unsafe LLM-Based Search: Quantitative Analysis and Mitigation of Safety Risks in AI Web Search Zeren Luo, Zifan Peng, Yule Liu, Zhen Sun, Mingchen Li, Jingyi Zheng, Xinlei He. NeurIPS 2025 CCF-A CHASM: Unveiling Covert Advertisements on Chinese Social Media Jingyi Zheng, Tianyi Hu, Yule Liu, Zhen Sun, Zongmin Zhang, Zifan Peng, Wenhan Dong, Xinlei He. SENSYS-SocialMeta 2024 Best Paper AdSpectorX: A Multimodal Expert Spector for Covert Advertising Detection on Chinese Social Media Zongmin Zhang, Yujie Han, Zhou Zhang, Yule Liu, Jingyi Zheng, Zhen Sun.

Services

Conference PC / Reviewer

  • The Web Conference 2025 Web4Good Track
  • AAAI
  • ACM MM
  • ICML
  • CVPR
  • ACL
  • EMNLP
  • SaTML
  • EuroS&P
  • AsiaCCS

Journal Reviewer

  • IEEE Transactions on Dependable and Secure Computing (TDSC)
  • IEEE Transactions on Information Forensics and Security (TIFS)
  • ACM Transactions on Privacy and Security (TOPS)
  • International Journal of Human-Computer Interaction (IJHCI)

Honors and Awards

  • 2025, DSA Excellent Research Award
  • Kaggle Competitions Expert (Vincent Sirius)
  • 2020.04, MCM/ICM Meritorious Winner
  • 2019 / 2020 / 2021, Third-class Scholarship of BUPT
  • 2019 / 2020 / 2021, Excellent Student Leader of BUPT

Education

  • 2024.08-present, PhD in Data Science and Analytics, The Hong Kong University of Science and Technology (Guangzhou)
  • 2022.08-2023.10, MSc in Computer Science, City University of Hong Kong
  • 2018.09-2022.07, BSc in Computer Science and Technology, Beijing University of Posts and Telecommunications

Experience

  • Research Assistant, 2023.06-2024.05, Centre for Artificial Intelligence and Robotics (CAIR), Hong Kong Institute of Science & Innovation, Chinese Academy of Sciences (HKISI-CAS). Worked on surgical LLMs and image segmentation. Supervisor: Dr. Jinlin Wu.
  • Project Participant, 2022.09-2023.08, City University of Hong Kong. Worked on financial machine translation. Supervisor: Prof. Linqi Song.