Shiyao Xu (徐诗瑶)'s Homepage

Shiyao Xu (徐诗瑶)

Email: xsy9915[at]gmail.com/shiyao.xu[at]unitn.it Github CV Google Scholar Twitter Linkedin 知乎

I am Shiyao Xu (徐诗瑶), a 2nd~~1st~~-year ELLIS PhD student at MHUG (University of Trento) and IMAGINE (ENPC), supervised by Prof. Paolo Rota and Prof. Gül Varol (and 🫡Benedetta Liberatori)🙇🏾‍♀️. My research interests lie in human-centric 3D-aware understanding and generation, specifically about vision language models for human🕺🏾 motion👯‍♂️ understanding💃🏽.

I'm fortunate to work with Junlin Han, Lingzhi Li, Dr. Li Shen, Prof. Zhouhui Lian in my pervious experience.

I obtain my M.Sc. degree from Peking University, China, 2023, supervised by Prof. Zhouhui Lian, and bachelor at Dalian University of Technology, China, 2020.

I'm actively looking for internship opportunities for 2026 summer! (or anytime, just an internship🥹!) Please feel free to reach me out!!!

News!🔥

2025.11: Here we go Vancouver!!DEMO is accepted by 3DV 2026!! See you then!

2025.05: We organize the 1st workshop on Interactive Human-centric Foundation Models at ICCV 2025 in Hawaii! come and submit your work!

2025.04: I'll attend ICVSS 2025 this summer, see you in Sicily!!🏝️😎

2024.10: FD-3DGS got rejected after several submissions...😥 Fine, life'll also encounter the same thing.🥲

2024.09: Finally!!! I ended my industry experience at some startup and moved to Trento, Italy, to start my PhD journey!🇮🇹

2023.07-08: Serve as volunteer(Teaching Assistant and Research Assistant) in SGI(Summer Gemoetry Initiative) 2023.

2023.04: FINALLY! Our paper: DeSRF: Deformable Stylized Radiance Field is accepted by CVPR 2023 Workshop: Generative Models for Computer Vision. See you in Vancouver, Canada (if my visa is approved)!

2022.10: Recieve a graduate school scholarship💰!

2022.08: Happy to announce that our paper "Your3dEmoji" is accepted by SIGGRAPH ASIA 2022 Tech. Comm.!🤪 See you in Korea!

Publications and Preprints

	Dense Motion Captioning Shiyao Xu, Benedetta Liberatori, Gül Varol, Paolo Rota 3DV 2026 [Project] [PDF] [Code] [Dataset] We propose dense motion captioning task together with a complex motion dataset CompMo includes 60,000 motion sequences, each composed of multiple actions ranging from at least 2~10, and a model DEMO for that task.
	FD-3DGS: Flexible Disentangled 3DGS for Scenes Understanding and Manipulation Shiyao Xu, Junlin Han, Jie Yang. got rejected by some conference🥲it's ok, both life and research will encounter some rejections🥹. you can see it below. [PDF] We propose FD-3DGS to distill the semantic information into 3D Gaussians and directly manipulate 3D Gaussians using language.
	DeSRF: Deformable Stylized Radiance Field Shiyao Xu, Lingzhi Li, Li Shen, Zhouhui Lian CVPRW 2023, CVPR Workshop on Generative Models for Computer Vision [Project] [PDF] [Code](tbc) [Poster] We propose a more efficient method, DeSRF, to stylize the radiance field, which also transfers style information to the geometry according to the input style.
	Your3dEmoji: Creating Personalized Emojis via One-shot 3D-aware Cartoon Avatar Synthesis Shiyao Xu, Lingzhi Li, Li Shen, Yifang Men, Zhouhui Lian SIGGRAPH ASIA 2022 Technical Communication [Project](tbc) [PDF] [DOI] [Code] We propose a novel 3D generative model to translate a real-world face image into its corresponding 3D avatar with only a single style example provided. Our model is 3D-aware in sense and also able to do attribute editing, such as smile, age, etc directly in the 3D domain.
	Dynamic Texture Transfer using PatchMatch and Transformers Guo Pu, Shiyao Xu, Xixin Cao, Zhouhui Lian [PDF] finally available on arxiv... but this is my first project;-) We propose an automatically method to transfer the dynamic texture of a given video to a still image. Abstract How to automatically transfer the dynamic texture of a given video to the target still image is a challenging and ongoing problem. In this paper, we propose to handle this task via a simple yet effective model that utilizes both PatchMatch and Transformers. The key idea is to decompose the task of dynamic texture transfer into two stages, where the start frame of the target video with the desired dynamic texture is synthesized in the first stage via a distance map guided texture transfer module based on the PatchMatch algorithm. Then, in the second stage, the synthesized image is decomposed into structure-agnostic patches, according to which their corresponding subsequent patches can be predicted by exploiting the powerful capability of Transformers equipped with VQ-VAE for processing long discrete sequences. After getting all those patches, we apply a Gaussian weighted average merging strat- egy to smoothly assemble them into each frame of the target stylized video. Experimental results demonstrate the effectiveness and superiority of the proposed method in dynamic texture transfer compared to the state of the art.

Working Experiences

	2024.07 - 2024.09: 3D Algorithm Engineer at Math Magic.
	2023.07 - 2024.05: Research Scientist at Cybever Inc., Mountain View (remotely).
	2021.08 - 2023.07: Research Intern in DAMO Academy, Alibaba Group. Mentored by Lingzhi Li, Supervised by Dr. Li Shen.
	2021.07 - 2021.08: Machine Learning Intern at Apple Inc., Beijing, China.

Education

	2024.09 - : ELLIS PhD student at University of Trento, Itlay. also in Center for Mind and Brain (CIMeC) Supervised by Prof. Paolo Rota and Prof. Gül Varol(ENPC). Working on 3D human motion understanding.
	2020.09 - 2023.06: M.Sc. in Wangxuan Institute of Computer Techonology(WICT) at Peking University, China. Supervised by Prof. Zhouhui Lian. Worked on 3D-aware Generation, Style Transfer, Neural Rendering. Thesis: 3D-aware Style Transfer based on Neural Radiance Field.
	2016.09 - 2020.06: B.Eng. in School of Software at Dalian University of Technology, China. Major in Big Data and Machine Learning.

Academic Services

Teaching Assistant: Foundation Models for master students at DISI, University of Trento, Fall 2025

Teaching Assistant: Introduction to Computer Programming (Python) for master students at CIMeC, University of Trento, Fall 2025

Teaching Assistant: SGI(Summer Gemoetry Initiative), Summer 2023

Teaching Assistant: Elementary Number Theory for undergraduate students at Peking University, Spring 2021

Reviewer: 3DV 2026, CVPR 2025, WACV 2025

Workshop: 1st-IHFM @ICCV2025

Volunteer: ECCV 2024, SIGGRAPH 2022

Selected Awards

Doctoral Student Scholarship, University of Trento

Graduate School Scholarship at Peking University, 2022

Hackathon PKU Competition rank 2/30 (¥10,000), 2021

Misc

🐟 A DDL chaser (usually failed in most cases).

⛸️ An animation connoisseur (I like hand-painting and independent animations).

🚶🏻‍♀️ A daydreamer who wants to be an athlete 🏃🏻‍♀️⚽️🏊🏻‍♀️. Recently I started my ~~CrossFit training💪🏼 and~~ (sry too expensive🥲) ~~half-~~marathon training🏃🏻‍♀️.(yes! current half-marathon time is 1h56min!💪🏼still tuned!)

⚽️ Was a member of PKU Women Football Club. Also a founder of our college women's football team.(I'm a fan of Arsenal F.C.🔫🔴⚪️)

🏆 We are the champion of Inter-faculty Women's Football Competition in Peking University Cup, 2022-2023!!

Build the bridge between 2D and 3D world. Do some cool research!😎

Last modified: 05/12/2025