Stone Tong ZHANG

I am currently a Ph.D. student in Computer Science at Fudan University, working with Prof. Tao Xie and Prof. Wei Yang. Previously, I received my M.S. in Computer Engineering from the University of California, Irvine and my B.Eng. from Southern University of Science and Technology.

My research focuses on bridging vision and language through structured representation, world simulation, and step-by-step visual reasoning. Feel free to get in touch!

“What magical trick makes us intelligent? The power of intelligence stems from our vast diversity, not from any single, perfect principle.”

— Marvin Minsky, The Society of Mind, p. 308

Stone Tong Zhang portrait

Selected Projects

Unintended Side Effects of Defense Mechanisms in Large Language Models

LLM defense project figure
Prof. Haizhou Li, CUHK-Shenzhen
PyTorch

Human-Readable SVG Generation with Vision Language Models

SVG generation project figure
Assistant Prof. Haohan Wang, UIUC
PyTorch

One-shot Controllable Head Avatar Creation

Head avatar project figure
Prof. Xiaohui Xie, UCI
PyTorch

Trajectory Prediction and Driving Video Caption

Trajectory prediction project figure
Assistant Prof. Hao Zhao, THU
PyTorch

Professional Experience

Image Editing via Reasoning

vivo AI Research
PyTorch

Lightweight OCR Models for OpenCV

OpenCV @ Google Summer of Code 2022
PyTorch, ONNX, C++

Selected Publications

For a more complete list, please see my CV.

Misc