Skip to content
View sastpg's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report sastpg

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. AI-Exercise AI-Exercise Public

    Exercise System Based on Pose Estimation

    Python 48 11

  2. RFTT RFTT Public

    RFTT: Reasoning with Reinforced Functional Token Tuning

    Python 29 1

  3. CoVo CoVo Public

    Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning

    Python 25

  4. bot-on-anything bot-on-anything Public

    Forked from zhayujie/bot-on-anything

    主要修复newbing回复被微软过滤器拦截的问题,目前可以成功阻止微软拦截���回文字消息。dev-qq分支增加qq图片发送支持。欢迎PR!

    Python 10 3

  5. voyager-llama voyager-llama Public

    Llama implementation of Voyager

    JavaScript 7 1

  6. HIR HIR Public

    Official codebase of paper "Replay Failures as Successes: Sample-Efficient Reinforcement Learning for Instruction Following"

    Python 6