
I am currently a software engineer at ByteDance, leading the Volcano Machine Learning Platform (veMLP) team. Before joining ByteDance, I was a principal software engineer at Tencent.
My passion for open-source has led me to initiate and lead several impactful projects, including:
From 2022 to 2023, I gained valuable experience in the startup world. I was a technical partner at LightYearAI, where I built and led a team of over 20 engineers focused on data curation for large language model pre-training. The company was acquired by Meituan just three months after its inception. Prior to that, I was the CTO of HPCAI-Tech for a whole year, a startup dedicated to open-source AI infrastructure at that time.
Earlier in my career, I was a senior engineer at WeChat AI (Tencent), where I focused on improving the efficiency of AI applications through parallel computing. I also contributed to the development of foundational modules in the WeChat app, such as the WeChat Input Method Engine.
I earned my Ph.D. in Computer Science from Tsinghua University in 2019, advised by Prof. Guangwen Yang and Prof. Haohuan Fu. My doctoral research was titled Parallel Deep Learning Training System on Sunway TaihuLight. I was also a visiting scholar at the University of California, Davis, supervised by Cho-Jui Hsieh from 2018 to 2019. I received my B.S. in Computer Science from Beijing University of Posts and Telecommunications in 2014.
Outside of work, I enjoy jogging, football, swimming, and table tennis. You can follow my work and interests on 知乎 and Github.