Built and maintain a Python CLI tool that keeps shared GPUs active with lightweight keep-alive workloads, plus a matching Python API.
Projects & Open Source
Projects & Competitions
Built LoRA‑fine‑tuned models with pseudo‑labeling and ensembles; earned Silver Medal (39/1,849 teams).
Lightweight Monte‑Carlo Tree Search pipeline for LLM agents; supports OpenAI/DeepSeek APIs with optimized rollout policy.
2.5D narrative adventure; produced teaser and achieved charity certification from Beijing New Sunshine Charity Foundation.
Open Source Contributions
Contributed SDK improvements for model naming compatibility, log tooling, and message/image serialization in Responses workflows.
Pull Requests
Fixed rope scaling override order so Hugging Face config conversions stay correct after legacy overrides.
Extended-context fixes and truncation improvements for RL pipelines (YaRN), with tests across rollout workers.
Long-context evaluation and analysis: reproducible data synthesis, 128k LLaMA3-7B single‑GPU inference, and Phi3 retrieval-head support.