Skip to content
View zsychina's full-sized avatar
🐭
鼠鼠我啊,又要寄了
🐭
鼠鼠我啊,又要寄了
  • Sun Yat-sen University
  • Guangzhou, China
  • 11:26 - 8h ahead

Block or report zsychina

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zsychina/README.md

Hi there 👋

  • 🔭 Education Experience:

    Undergraduate: DLUT School of Automation@20FALL

    Master: SYSU School of Computer Science@24FALL

  • 🌱 Research Focus:

    I'm interested in reinforcement learning, agents and utilizing RL to reinforce LLM agents' ability in decision making.

Looking forward to making friends and cooperating with you!

Pinned Loading

  1. PrefTransPPO Public

    Using preference transformer to learning a reward function from dataset, then train an agent with PPO

    Python

  2. ppo-vanilla Public

    ppo minimum implementation

    Python

  3. ppo-continuous Public

    ppo continuous

    Python

  4. sysu-select-course-script Public

    中山大学研究生选课脚本

    Python

  5. GA-PID-Optimize Public

    遗传算法整定PID参数,大连理工大学'23《现代智能优化算法》X《计算机控制技术课程设计》

    Python 1

  6. ppo-transformer Public

    GPT-2 structure transformer for sequential decision making in gym environment

    Python

zsychina (Siyuan Zhu) · GitHub
Skip to content
View zsychina's full-sized avatar
🐭
鼠鼠我啊,又要寄了
🐭
鼠鼠我啊,又要寄了
  • Sun Yat-sen University
  • Guangzhou, China
  • 11:26 - 8h ahead

Block or report zsychina

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zsychina/README.md

Hi there 👋

  • 🔭 Education Experience:

    Undergraduate: DLUT School of Automation@20FALL

    Master: SYSU School of Computer Science@24FALL

  • 🌱 Research Focus:

    I'm interested in reinforcement learning, agents and utilizing RL to reinforce LLM agents' ability in decision making.

Looking forward to making friends and cooperating with you!

Pinned Loading

  1. PrefTransPPO Public

    Using preference transformer to learning a reward function from dataset, then train an agent with PPO

    Python

  2. ppo-vanilla Public

    ppo minimum implementation

    Python

  3. ppo-continuous Public

    ppo continuous

    Python

  4. sysu-select-course-script Public

    中山大学研究生选课脚本

    Python

  5. GA-PID-Optimize Public

    遗传算法整定PID参数,大连理工大学'23《现代智能优化算法》X《计算机控制技术课程设计》

    Python 1

  6. ppo-transformer Public

    GPT-2 structure transformer for sequential decision making in gym environment

    Python