Tony (Lipeng) He

Student, software engineer, founder, and researcher at the University of Waterloo.

I'm pursuing a Master of Mathematics (Research/Thesis) degree in Computer Science at UWaterloo. I am grateful to be advised by N. Asokan.

I'm part of Secure Systems Group (SSG), Cryptography, Security, and Privacy (CrySP) Lab, and the Cybersecurity and Privacy Institute (CPI). I also worked with Jian Liu at ABC Lab, Zhejiang University. Currently, my office is located in the William G. Davis Computer Research Centre, DC 3333B, M3.

I'm in pursuit of knowledge, experience, and the various other beautiful things life has to offer. I strive to live deliberately. Before research, I spent some years doing software engineering. In the limit of my life, I am also trying to be a pianist, writer, podcaster, designer, and entrepreneur[1].

[1] Retrograde Labs is a research-backed startup building the trust layer for agentic AI. I strive to do the kind of research that not only helps us identify failure modes and address problems & bottlenecks, but can also be turned into something useful in production, and something that is able to withstand the test of the market and real customers. At Retrograde Labs, I build, grow, and scale products that apply research ideas in real-world workflows.

TL;DR: I study how AI safety and security break in deployed systems, and how to build defenses that survive those conditions.


My research focuses on Trustworthy Machine Learning, with an emphasis on LLM robustness of and the security & privacy of agentic AI systems. I study failures that appear when models are fine-tuned, approximated, compressed, connected to tools, exposed to external content, or placed in multi-agent, and other production ML pipeline settings. I care about issues such as data leakage, prompt injection, unsafe tool use, brittle safety behavior, and systems that cannot explain or audit what happened.

I develop effective and efficient adversarial attacks, as well as principled defenses, drawing on applied cryptography, theoretical machine learning, and systems security to characterize and mitigate emerging threats.

I am especially interested in agents that retrieve information, call tools, write code, or act on private context. These systems need stronger guarantees around provenance, permissions, policy, and auditability before people can safely delegate important work to them.

To ensure AI's transformative potential reaches as much of the society as possible in the most impactful way, with safety as the unlock that guarantees the world will benefit from AI.

* indicates equal contribution

SoK: Colluding Adversaries in Machine Learning Pipelines

Vasisht Duddu, Lipeng He, Asim Waheed, and N. Asokan

Understanding and Preserving Safety in Fine-Tuned LLMs

Jiawen Zhang, Yangfan Hu, Kejia Chen, Lipeng He, Jiachen Ma, Jian Lou, Dan Li, Jian Liu, Xiaohu Yang, and Ruoxi Jia

Locket: Robust Feature-Locking Technique for Language Models

Lipeng He, Vasisht Duddu, and N. Asokan

Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance

Jiawen Zhang, Lipeng He, Kejia Chen, Jian Lou, Jian Liu, Xiaohu Yang, and Ruoxi Jia

Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense

Jiawen Zhang*, Kejia Chen*, Lipeng He*, Jian Lou, Dan Li, Zunlei Feng, Mingli Song, Jian Liu, Kui Ren, and Xiaohu Yang

LookAhead: Preventing DeFi Attacks via Unveiling Adversarial Contracts

Shoupeng Ren, Lipeng He, Tianyu Tu, Di Wu, Jian Liu, Kui Ren, and Chun Chen

Secure Transformer Inference Made Non-interactive

Jiawen Zhang, Xinpeng Yang, Lipeng He, Kejia Chen, Wen-jie Lu, Yinghao Wang, Xiaoyang Hou, Jian Liu, Kui Ren and Xiaohu Yang

On the Atomicity and Efficiency of Blockchain Payment Channels

Di Wu, Shoupeng Ren, Yuman Bai, Lipeng He, Jian Liu, Wu Wen, Kui Ren, et al.

Revealing and Benchmarking the Safety Risks in Blockchain Agents

Jiawen Zhang, Kejia Chen, Lipeng He, Yechao Zhang, Jian Liu and Xiaohu Yang

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Jialin Yang, Dongfu Jiang, Lipeng He, Sherman Siu, Yuxuan Zhang, Disen Liao, Benjamin Schneider, Ping Nie, Wenhu Chen, et al.

FedVLP: Visual-aware Latent Prompt Generation for Multimodal Federated Learning

Hao Pan, Xiaoli Zhao, Yuchen Jiang, Lipeng He, Bingquan Wang, and Yincan Shu

A Survey of Multimodal Federated Learning: Background, Applications, and Perspectives

Hao Pan, Xiaoli Zhao, Lipeng He, Yicong Shi, and Xiaogang Lin

A Comparative Examination of Network and Contract-Based Blockchain Storage Solutions for Decentralized Applications

Citations

Defending against Adaptive Prompt Injection Attacks via Reasoning-enabled Task Alignment

Lipeng He, Yihan Wang, Jiawen Zhang and N. Asokan
Under Submission

Backdooring Bias in Large Language Models

Anudeep Das, Prach Chantasantitam, Gurjot Singh, Lipeng He, Mariia Ponomarenko, and Florian Kerschbaum
Under Submission

From Detection to Diagnosis: Lightweight Federated Prompt Learning for Interpretable Industrial Anomaly Analysis

Hao Pan, Xiaoli Zhao, Lipeng He, and Xiwu Shang
Under Submission

Token-by-Token Manipulation: Inference-Time Jailbreaking on Production LLMs via Autoregressive Harmful Guidance

Jiawen Zhang, Lipeng He, Kejia Chen, Jian Liu, Zunlei Feng, Mingli Song, Jian Lou, Dan Li, and Xiaohu Yang
Under Submission

Cybersecurity and Privacy Institute (CPI) Graduate Student Conference 2026

Locket: Robust Feature-Locking Technique for Language Models

Cybersecurity and Privacy Institute (CPI) Graduate Student Conference 2025

Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense

Program Committee Member

USENIX Security Symposium 2026

Artifact Evaluation

Program Committee Member

Privacy Enhancing Technologies Symposium (PoPETs/PETS) 2026

Artifact Evaluation

Program Committee Member

ACM Conference on Computer and Communications Security (CCS) 2025, 2026

Artifact Evaluation

Invited Reviewer

IEEE Transactions on Dependable and Secure Computing (TDSC)

Student Member

Association for Computing Machinery (ACM)

lipenghe@acm.org

Directed Reading Program (DRP) Mentor

AI Safety and Security Challenges in LLM-based Autonomous Agents (Spring 2026)

Women in Mathematics (WiM)

University of Waterloo Graduate Scholarship

CAD 4,000

University of Waterloo

AWS Startup Activate Credits (Portfolio)

USD 25,000

Amazon, Y Combinator

Lambda Research Grant Program

USD 5,000; Principal Investigator: N. Asokan

λ (Lambda) AI

David R. Cheriton Graduate Scholarship

CAD 10,000

University of Waterloo

International Master's Award of Excellence (IMAE)

CAD 7,500

University of Waterloo

University of Waterloo logo

University of Waterloo

Teaching Assistant (TA)

Jan 2026 - Present

CS 436 Networks and Distributed Computer Systems

University of Waterloo logo

University of Waterloo

Instructional Apprentice (IA)

Sept 2025 - Dec 2025

CS 135 Designing Functional Programs

University of Waterloo logo

University of Waterloo

Instructional Support Assistant (ISA)

Aug 2024 - Dec 2024

CS 135 Designing Functional Programs

University of Waterloo logo

University of Waterloo

Research Assistant (URA)

Jan 2025 - Present

Cryptography, Security, and Privacy (CrySP) Lab

Zhejiang University logo

Zhejiang University

Research Assistant

May - Aug 2024

ABC Lab, Institute of Cyberspace Research

LinkedIn
Retrograde Labs logo

Retrograde Labs

Co-Founder

May 2026 - Present

Accelerating frontier scientific discovery and commercialization

Bluelet AI logo

Bluelet AI

Co-Founder & CTO

May 2025 - June 2025

Agentic AI and data platform solutions for talent acquisition and matching

BioRender logo

BioRender

Full Stack Software Engineer

Jan - Apr 2023

SaaS, Y Combinator W18

Toronto, ON

Safyre Labs logo

Safyre Labs

Back End Software Engineer

May - Aug 2022

E-Commerce Platform, Supply Chain

North York, ON

Robinhood logo

Robinhood

Front End Software Engineer

Sep - Dec 2021

Bitbuy (Cryptocurrency Exchange), TSX: WNDR

Toronto, ON

University of Waterloo logo

University of Waterloo

Master's Degree (Research/Thesis)

Sep 2025 - Present

Computer Science

University of Waterloo logo

University of Waterloo

Honours Bachelor's Degree (Co-op)

Sep 2020 - Apr 2025

Mathematics (Minor in Computing)

Nanyang Technological University logo

Nanyang Technological University

Exchange Student (GEM Trailblazer)

Aug 2023 - Dec 2023

Mathematical Sciences

Podcast

New Article Everytime I Publish :)