Tony (Lipeng) He

Student, software engineer, founder, and researcher at the University of Waterloo.

Tony teaching an undergraduate CS course

Pages & Links

Pinned

About

I'm a computer science PhD student at UWaterloo. I'm part of Secure Systems Group (SSG), Cryptography, Security, and Privacy (CrySP) Lab, and the Cybersecurity and Privacy Institute (CPI).

I am grateful to be advised by N. Asokan and Yaoliang Yu. I also worked with Jian Liu at Zhejiang University and did applied cryptography research at ABC Lab. Currently, my office is located in the William G. Davis Computer Research Centre, DC 3333.

I'm in pursuit of knowledge, experience, and the various other beautiful things life has to offer. I strive to live deliberately. Before research, I spent some years doing software engineering. In the limit of my life, I am also trying to be a pianist, wri ter, podcaster, designer, and entrepreneur^[1].

^[1] Retrograde Labs is a research-backed startup building the trust layer for agentic AI. We strive to do the kind of research that not only helps us identify failure modes and address theoretical bottlenecks, but can also be turned into something useful in production, and something that is able to withstand the test of the market and real customers. At Retrograde Labs, we build, grow, and scale products that apply research ideas to address issues in real-world workflows.

Research Interests

TL;DR: I study how AI safety and security break in deployed systems, and how to build defenses that survive those conditions.

My research focuses on Trustworthy Machine Learning, with an emphasis on the robustness of LLMs and the security & privacy of agentic AI systems. I study failures that appear when models are fine-tuned, approximated, compressed, connected to tools, exposed to external content, or placed in multi-agent, and other production ML pipeline settings. I care about issues such as data leakage, prompt injection, unsafe tool use, brittle safety behavior, and systems that cannot explain or audit what happened.

I develop effective and efficient adversarial attacks (e.g., automated red-teaming), as well as principled/scalable defenses (e.g., based on reinforcement learning, mechanistic interpretability), drawing on applied cryptography, theoretical machine learning, and systems security to characterize and mitigate emerging threats.

Research Vision

To ensure AI's transformative potential reaches as much of the society as possible in the most impactful way, with safety as the unlock that guarantees the world will benefit from AI.

Selected PublicationsPublications* indicates equal contribution

SoK: Colluding Adversaries in Machine Learning Pipelines

Vasisht Duddu, Lipeng He, Asim Waheed, and N. Asokan

USENIX Security 2026

Paper →

Understanding and Preserving Safety in Fine-Tuned LLMs

Jiawen Zhang, Yangfan Hu, Kejia Chen, Lipeng He, Jiachen Ma, Jian Lou, Dan Li, Jian Liu, Xiaohu Yang, and Ruoxi Jia

CCS 2026

Paper →

Locket: Robust Feature-Locking Technique for Language Models

Lipeng He, Vasisht Duddu, and N. Asokan

ACL 2026

Paper →Poster →Workshop →Code →

Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance

Jiawen Zhang, Lipeng He, Kejia Chen, Jian Lou, Jian Liu, Xiaohu Yang, and Ruoxi Jia

ICLR 2026

Paper →Code →

Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense

Jiawen Zhang*, Kejia Chen*, Lipeng He*, Jian Lou, Dan Li, Zunlei Feng, Mingli Song, Jian Liu, Kui Ren, and Xiaohu Yang

USENIX Security 2025

Paper →Code →Website →

LookAhead: Preventing DeFi Attacks via Unveiling Adversarial Contracts

Shoupeng Ren, Lipeng He, Tianyu Tu, Di Wu, Jian Liu, Kui Ren, and Chun Chen

FSE 2025

Paper →Code →

Secure Transformer Inference Made Non-interactive

Jiawen Zhang, Xinpeng Yang, Lipeng He, Kejia Chen, Wen-jie Lu, Yinghao Wang, Xiaoyang Hou, Jian Liu, Kui Ren and Xiaohu Yang

NDSS 2025

Paper →Code →🏆 Top-cited →

On the Atomicity and Efficiency of Blockchain Payment Channels

Di Wu, Shoupeng Ren, Yuman Bai, Lipeng He, Jian Liu, Wu Wen, Kui Ren, et al.

USENIX Security 2025

Paper →Code →

Revealing and Benchmarking the Safety Risks in Blockchain Agents

Jiawen Zhang, Kejia Chen, Lipeng He, Yechao Zhang, Jian Liu and Xiaohu Yang

BCRA 2026

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Jialin Yang, Dongfu Jiang, Lipeng He, Sherman Siu, Yuxuan Zhang, Disen Liao, Benjamin Schneider, Ping Nie, Wenhu Chen, et al.

Transactions on Machine Learning Research

Paper →Code →Website →🏆 J2C →

FedVLP: Visual-aware Latent Prompt Generation for Multimodal Federated Learning

Hao Pan, Xiaoli Zhao, Yuchen Jiang, Lipeng He, Bingquan Wang, and Yincan Shu

Computer Vision and Image Understanding

Paper →

A Survey of Multimodal Federated Learning: Background, Applications, and Perspectives

Hao Pan, Xiaoli Zhao, Lipeng He, Yicong Shi, and Xiaogang Lin

Multimedia Systems

Paper →Code →

A Comparative Examination of Network and Contract-Based Blockchain Storage Solutions for Decentralized Applications

Lipeng He

DECA 2023

Paper →

PreprintsCitations

SecStep: Guarding Agents Against Prompt Injection via Action Backtracking

Jiawen Zhang, Yechao Zhang, Lipeng He, Kejia Chen, Jiachen Ma, Yangfan Hu, Jian Lou, Jian Liu, Xiaohu Yang, and Tianwei Zhang

Under Submission

Beyond Similarity: Trustworthy Memory Search for Personal AI Agents

Jiawen Zhang, Kejia Chen, Jiachen Ma, Yangfan Hu, Lipeng He, Yechao Zhang, Jian Liu, Xiaohu Yang, Tianwei Zhang, and Ruoxi Jia

Under Submission

Paper →

Defending against Adaptive Prompt Injection Attacks via Reasoning-enabled Task Alignment

Lipeng He, Yihan Wang, Jiawen Zhang, and N. Asokan

Under Submission

Paper →

Backdooring Bias in Large Language Models

Anudeep Das, Prach Chantasantitam, Gurjot Singh, Lipeng He, Mariia Ponomarenko, and Florian Kerschbaum

Under Submission

Paper →

From Detection to Diagnosis: Lightweight Federated Prompt Learning for Interpretable Industrial Anomaly Analysis

Hao Pan, Xiaoli Zhao, Lipeng He, and Xiwu Shang

Under Submission

Token-by-Token Manipulation: Inference-Time Jailbreaking on Production LLMs via Autoregressive Harmful Guidance

Jiawen Zhang, Lipeng He, Kejia Chen, Jian Liu, Zunlei Feng, Mingli Song, Jian Lou, Dan Li, and Xiaohu Yang

Under Submission

Talks

Cybersecurity and Privacy Institute (CPI) Graduate Student Conference 2026

Locket: Robust Feature-Locking Technique for Language Models

Spotlight Talk

Poster →

Cybersecurity and Privacy Institute (CPI) Graduate Student Conference 2025

Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense

Spotlight Talk

Poster →Slides →

Academic Services

Program Committee Member

Conference

USENIX Security Symposium 2027

Program Committee Member

Conference

USENIX Security Symposium 2026

Artifact Evaluation

Program Committee Member

Conference

Privacy Enhancing Technologies Symposium (PoPETs/PETS) 2026

Artifact Evaluation

Program Committee Member

Conference

ACM Conference on Computer and Communications Security (CCS) 2025, 2026

Artifact Evaluation

Invited Reviewer

Journal

IEEE Transactions on Dependable and Secure Computing (TDSC)

Student Member

Membership

Association for Computing Machinery (ACM)

lipenghe@acm.org

Mentoring

Directed Reading Program (DRP) Mentor

University

AI Safety and Security Challenges in LLM-based Autonomous Agents (Spring 2026)

Women in Mathematics (WiM)

Funding

University of Waterloo Graduate Scholarship

University

CAD 4,000

University of Waterloo

AWS Startup Activate Credits (Portfolio)

Industry

USD 25,000

Amazon, Y Combinator

Lambda Research Grant Program

Industry

USD 5,000; Principal Investigator: N. Asokan

λ (Lambda) AI

David R. Cheriton Graduate Scholarship

University

CAD 10,000

University of Waterloo

International Master's Award of Excellence (IMAE)

University

CAD 7,500

University of Waterloo

Teaching

University of Waterloo

Part-time

Teaching Assistant (TA)

Jan 2026 - Present

CS 436 Networks and Distributed Computer Systems

University of Waterloo

Part-time

Instructional Apprentice (IA)

Sept 2025 - Dec 2025

CS 135 Designing Functional Programs

University of Waterloo

Co-op

Instructional Support Assistant (ISA)

Aug 2024 - Dec 2024

CS 135 Designing Functional Programs

Research Experience

University of Waterloo

Research, Part-time

Research Assistant (URA)

Jan 2025 - Present

Cryptography, Security, and Privacy (CrySP) Lab

Zhejiang University

Research, Co-op

Research Assistant

May - Aug 2024

ABC Lab, Institute of Cyberspace Research

Industry ExperienceLinkedIn

Retrograde Labs

Leadership

Co-Founder

May 2026 - Present

Accelerating frontier scientific discovery and commercialization

Bluelet AI

Leadership

Co-Founder & CTO

May 2025 - June 2025

Agentic AI and data platform solutions for talent acquisition and matching

BioRender

SWE, Co-op

Full Stack Software Engineer

Jan - Apr 2023

SaaS, Y Combinator W18

Toronto, ON

Safyre Labs

SWE, Co-op

Back End Software Engineer

May - Aug 2022

E-Commerce Platform, Supply Chain

North York, ON

Bitbuy

SWE, Co-op

Front End Software Engineer

Sep - Dec 2021

Cryptocurrency Exchange, Part of Robinhood, TSX: WNDR

Toronto, ON

Education

University of Waterloo

PhD

Doctor of Philosophy

Sep 2025 - Present

Computer Science

University of Waterloo

BMath

Honours Bachelor's Degree (Co-op)

Sep 2020 - Apr 2025

Mathematics (Minor in Computing)

Nanyang Technological University

Undergrad

Exchange Student (GEM Trailblazer)

Aug 2023 - Dec 2023

Mathematical Sciences

NewsletterPodcast

New Article Everytime I Publish :)