Stone Tong ZHANG

I am currently a research intern at vivo AI Research. Previously, I received my M.S. in Computer Engineering from the University of California, Irvine and my B.Eng. from Southern University of Science and Technology.

My research focuses on bridging vision and language through structured representation, world simulation, and step-by-step visual reasoning. Feel free to get in touch!

Email / GitHub / LinkedIn

“What magical trick makes us intelligent? The power of intelligence stems from our vast diversity, not from any single, perfect principle.”

— Marvin Minsky, The Society of Mind, p. 308

Stone Tong Zhang portrait

Academic Projects

Unintended Side Effects of Defense Mechanisms in Large Language Models

Prof. Haizhou Li, CUHK-Shenzhen
PyTorch

Human-Readable SVG Generation with Vision Language Models

Assistant Prof. Haohan Wang, UIUC
PyTorch

One-shot Controllable Head Avatar Creation

Prof. Xiaohui Xie, UCI
PyTorch

Trajectory Prediction and Driving Video Caption

Assistant Prof. Hao Zhao, THU
PyTorch

Professional Experience

Image Editing via Reasoning

vivo AI Research
PyTorch

Lightweight OCR Models for OpenCV

OpenCV @ Google Summer of Code 2022
PyTorch, ONNX, C++

Publications

Misc

This page has been accessed several times since Jan. 10, 2023.