Shiyu Huang 黄世宇Researcher, Zhipu AI
No.1 Zhongguancun East Road, Haidian District [OpenRL]     [知乎]     [Google Scholar]     [TARTRL]     [GitHub]     [Linkedin]     [CV] |
![]() |
I am a researcher in Zhipu AI. Before that, I was a research scientist
in 4Paradigm Inc. and the leader of OpenRL Lab. I received my B.E. and Ph. D. degrees
(co-advised by Prof. Jun
Zhu and Prof. Ting
Chen) from
the Department of Computer Science and Technology, Tsinghua University in
July, 2017 and June, 2022.
My researches focus on deep reinforcement learning, multi-agent reinforcement learning, distributed
reinforcement learning,
RL for robotics, LLM as agent, artificial general intelligence (AGI) and generative artificial intelligence
(GAI).
I have also spent time working at
RealAI Inc. ,
Huawei Noah's Ark Lab,
Tencent AI Lab,
Carnegie Mellon University
and Sensetime Inc. . And I am also the founder of the
OpenRL Lab() and TARTRL group.
@misc{chen2024softqmixintegratingmaximumentropy, title={Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization}, author={Wentse Chen and Shiyu Huang and Jeff Schneider}, year={2024}, eprint={2406.13930}, archivePrefix={arXiv}, primaryClass={cs.LG} url={https://arxiv.org/abs/2406.13930}, }
@misc{wang2024lvbench, title={LVBench: An Extreme Long Video Understanding Benchmark}, author={Weihan Wang and Zehai He and Wenyi Hong and Yean Cheng and Xiaohan Zhang and Ji Qi and Shiyu Huang and Bin Xu and Yuxiao Dong and Ming Ding and Jie Tang}, year={2024}, eprint={2406.08035}, archivePrefix={arXiv}, primaryClass={cs.CV} }
@misc{xiong2024mqeunleashingpowerinteraction, title={MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment}, author={Ziyan Xiong and Bo Chen and Shiyu Huang and Wei-Wei Tu and Zhaofeng He and Yang Gao}, year={2024}, eprint={2403.16015}, archivePrefix={arXiv}, primaryClass={cs.RO}, url={https://arxiv.org/abs/2403.16015}, }
@article{huang2023openrl, title={OpenRL: A Unified Reinforcement Learning Framework}, author={Huang, Shiyu and Chen, Wentse and Sun, Yiwen and Bie, Fuqing and Tu, Wei-Wei}, journal={arXiv preprint arXiv:2312.16189}, year={2023} }
2020 Spring, TA in Big Data and Machine Intelligence, instructed by Zhen Chen
2019 Fall, TA in Big Data and Machine Intelligence, instructed by Zhen Chen
2019 Spring, TA in Machine Learning, instructed by Prof. Jun Zhu