photo

Jiarui Fang (方佳瑞)

fangjiarui123 AT gmail.com

From Feburary 2022 to March 2023, I was the CTO of a startup company focused on open-sourced AI infrastructures, where I led the project ColossalAI, a training framework for large language models.

Before 2022, I was a senior engineer at Wechat AI, Tencent in Beijing. My work was focused on improving the efficiency of online and offline AI applications with innovative parallel computing techniques. I also took part in the development of some basic modules in the WeChat App, including the WeChat Input Method Engine and the WeChat Translation System. I initialized and authored some open-sourced software, e.g. TurboTransformers, a fast runtime for transformer inference on CPU and GPU, PatrickStar, a parallel training framework for large language models.

I received a Ph.D. in Computer Science from Tsinghua University in 2019. My advisors are Prof. Guangwen Yang and Prof. Haohuan Fu. My research focused on applying High-Performance Computing (HPC) for scientific applications. The title of my doctoral dissertation is Parallel Deep Learning Training System on Sunway TaihuLight. I served as a visiting scholar under the supervision of Cho-Jui Hsieh at the University of California, Davis, from 2018 to 2019.

I enjoy sports. My hobbies include jogging, football, swimming, and table tennis. I am a billiards fan and won the bronze medal in the WXG Beijing League 2020. I was an organizer of a table football league in my office building in 2020.