|
I'm an undergraduate at Tongji University, exploring the intersection of Machine Learning Systems, LLM training / inference / serving, deep learning compilers, agent infrastructure systems, and operating systems. My recent work focuses on:
I enjoy building low-level systems that make high-level intelligence run faster, cheaper, and more reliably. |
|
Semantics-aware KV cache eviction for long-context LLM inference
|
Fused CUDA kernels for efficient LLM decoding
|
|
Rust-based POSIX-compatible kernel for RISC-V64
|
Chord-based distributed dense retrieval and RAG pipeline
|
|
Global Campus AI Algorithm Challenge |
iGEM |
MLSys Full Stack โโโโโโโโโโโโโโโโโโโโโ 95%
LLM Inference Serving โโโโโโโโโโโโโโโโโโโโโ 90%
CUDA Kernel Design โโโโโโโโโโโโโโโโโโโโโ 80%
DL Compiler โโโโโโโโโโโโโโโโโโโโโ 75%
RISC-V Systems โโโโโโโโโโโโโโโโโโโโโ 75%
Agent Infrastructure โโโโโโโโโโโโโโโโโโโโโ 70%


