profile photo

Stone Tong ZHANG

I am currently a research intern at vivo AI Research, working on controllable image generation, latent world models, and multimodal reasoning. Previously, I received my M.S. in Computer Engineering from the University of California, Irvine and my B.Eng. in Computer Science and Engineering from Southern University of Science and Technology. My research focuses on bridging vision and language through structured representation, world simulation, and step-by-step visual reasoning. Feel free to get in touch!

Email GitHub LinkedIn CV(EN|CN)

Academic Projects

Unintended Side Effects of Defense Mechanisms in Large Language Models

Prof. Haizhou Li, CUHK-Shenzhen
PyTorch, LaTeX

Human-Readable SVG Generation with Vision Language Models

Assistant Prof. Haohan Wang, UIUC
PyTorch

One-shot Controllable Head Avatar Creation

Prof. Xiaohui Xie, UCI
PyTorch

Trajectory Prediction and Driving Video Caption

Assistant Prof. Hao Zhao, THU
NumPy, PyTorch

Professional Experience

Image Editing via Reasoning

vivo AI Research
PyTorch, Python

Lightweight OCR Models for OpenCV

OpenCV @ Google Summer of Code
PyTorch, ONNX, C++

Publications

  • Tong Zhang, Yiming Chen, Simin Chen, Zexin Li, Xianghu Yue, Cong Liu, Chenyu You, Wei Yang, Haizhou Li, and Tao Xie, “Unintended Side Effects of Defense Mechanisms in Large Language Models: A Comprehensive Study”, Under Review, 2025.
  • Tong Zhang, Haoyang Liu, Peiyan Zhang, Yuxuan Cheng, and Haohan Wang, “Beyond Pixels: Exploring Human-Readable SVG Generation for Simple Images with Vision Language Models”, Preprint, 2024.
  • Haoyu Ma, Tong Zhang, Shanlin Sun, Xiangyi Yan, Kun Han, and Xiaohui Xie, “CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer”, IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024.
  • Bu Jin, Xinyu Liu, Yupeng Zheng, Pengfei Li, Hao Zhao, Tong Zhang, Yuhang Zheng, Guyue Zhou and Jingjing Liu, “ADAPT: Action-aware Driving Caption Transformer”, IEEE International Conference on Robotics and Automation (ICRA), 2023.

Facts about Me

  • Hometown: Wuhan
  • Idol: Richard Feynman
  • Dream: To be a great researcher and design influential software
  • I enjoy finding potential topics from active discussions and take pride in my creativity. I also write poems and blogs
This page has been accessed several times since Jan. 10, 2023.