CV
Education
- 2022 - Now
Ph.D. in Computer Science
Carnegie Mellon University, United States
- Advised by Prof. Tianqi Chen and Prof. Todd C. Mowry.
- Member of Catalyst Group.
- 2018 - 2022
B.Eng. in Computer Science
Shanghai Jiao Tong University, China
- Member of ACM Honors Class, a pilot CS program for top talented students.
- Advised by Prof. Yong Yu.
Experience
- 2022.06 - 2022.08
Research Intern
OctoML
- Working on the Machine Learning Compilation online course, as a teaching assistant.
- 2021.10 - 2021.12
Research Intern
Catalyst Research Group, Carnegie Mellon University
- Worked on automatic tensor program optimization and sparse tensor computing.
- Advised by Prof. Tianqi Chen
Talks
- 2022.01
TensorIR: An Abstraction for Tensorized Program Optimization
- At UW SAMPL Group.
- video
- 2021.11 - 2021.12
SparseTIR: A Unified Abstraction for Sparse Workload Representation and Optimization
Selected Projects
- 2023 Spring
MLC LLM, Web LLM and Web Stable Diffusion
- Web LLM and Web Stable Diffusion bring large language models and stable diffusion models completely to people's web browsers. Everything runs inside the browser accelerated by WebGPU with no server support.
- MLC LLM is a universal solution that allows any language model to be deployed natively on a diverse set of hardware backends and native applications, plus a productive framework for everyone to further optimize model performance for their own use cases. Everything runs locally with no server support and accelerated with local GPUs on your phone and laptop.
- 2021 - 2022
SparseTIR, A Tensor-Level Abstraction for Sparse Operator Optimization in Deep Learning
- In close collaboration with Zihao Ye from University of Washington.
- Code, paper and documentation.
- 2020 Fall - Now
Apache TVM, An End-to-End Machine Learning Compiler Framework
- Committer of the community.
- Author of over 40+ PRs, 13000+ lines of code.
- Reviewer of over 80+ PRs.
- 2020 Spring
Mx-Compiler
- A toy compiler implemented in Java, from Mx* (a C- and Java-like language) to RISC-V assembly code.
- Implemented many effective optimizations. The generated code has performance close to GCC O2.
- More than 15k+ lines of code overall.
- 2019 Summer
Distributed Hash Table
- Implemented two DHT protocols, Chord and Kademlia, in Go Language.
- Implemented an instant chat room system based on the Chord protocol.
Teaching
- 2022 Summer
Machine Learning Compilation
- Teaching assistant of this online course.
- Prepare and release course assignments. Answer questions in the discussion page.
- 2020 Summer
Principle and Practice of Computer Algorithms
- Leader teaching assistant of this SJTU course.
- Advised students to implement a RISC-V simulator.
- Advised students to implement two Distributed Hash Table protocols, Chord and Kademlia, in Go Language.
- 2020 Spring
Data Structure (Honor)
- Leader teaching assistant of this SJTU course.
- Taught advanced data structures which students usually do not learn in class.
- Prepared the course assignments, projects and programming exams.
Honors and Awards
- 2022
- Shanghai Excellent Graduate Award
- 2020
- National Scholarship (Top 0.2% nationwide)
- 2018
- The 4th place of the 2018 ICPC Asia Singapore Regional Contests