Skip to content
View alphadl's full-sized avatar
🎯
hiring @ alibaba https://liamding.cc/hiring.html
🎯
hiring @ alibaba https://liamding.cc/hiring.html

Highlights

  • Pro

Block or report alphadl

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
alphadl/README.md

Hi there

🙋‍♂️ I am building a deterministic agentic AI ecosystem at Alibaba. I was the chief scientist at a startup (raised more than 50M$), previously worked at JD Explore Academy and Tencent AI Lab, and held an adjunct researcher position at ZJU.

🔭 Working on the whole pipeline of LLM R&D and their human-centric applications, including efficient and sufficient training, alignment, evaluations, compression, multilinguality, multimodality, agentic application, and much more.

💪 I'm keen on bodybuilding (5 years+), marathon (completed first half marathon (126min) in Beijing-2016 and most recent half marathon (86min) in Sydney-2019����. will resume training in 2024💪🏻).

🥗 I (once😅) enjoy cooking.

🐈 I like to spend Sundays with my cats (two from 2020-2023, one from 2023).

🔥 Recent open-source projects — agentic AI (data, evaluation, context) and LLM alignment / policy optimization:

  • 🔄 AgentHER Hindsight relabeling of failed trajectories for training.
  • 🧬 AgentSynth Synthetic agent data from scratch with execution validation.
  • 📏 AdaRubric Dynamic rubric evaluation for trajectory quality.
  • 🗜️ trajectory_tokenization ReAct with compressed history for long-horizon context.
  • 📡 SigFibPO SNR-calibrated trust regions and causal fiber residuals for multi-domain RLVR (research code + verl hook).

Pinned Loading

  1. THUNLP-MT/MT-Reading-List THUNLP-MT/MT-Reading-List Public

    A machine translation reading list maintained by Tsinghua Natural Language Processing Group

    TeX 2.4k 442

  2. lookahead.pytorch lookahead.pytorch Public

    lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch

    Python 336 63

  3. AgentHER AgentHER Public

    AgentHER: Hindsight Experience Replay for LLM Agents

    Python 92 10

  4. AdaRubrics AdaRubrics Public

    AdaRubric: Adaptive Dynamic Rubric Evaluator for Agent Trajectories

    Python 340 36

  5. darts.pytorch1.1 darts.pytorch1.1 Public

    Implementation with latest PyTorch (v1.1) for multi-gpu differentiable architecture search https://arxiv.org/abs/1806.09055

    Python 83 28

  6. 3d-gen-for-llm-builders 3d-gen-for-llm-builders Public

    A hands-on guide to 3D latent diffusion for LLM/VLM builders

    Shell 27