Shiyu Huang 黄世宇Researcher, Zhipu AI
No.1 Zhongguancun East Road, Haidian District [OpenRL]     [知乎]     [Google Scholar]     [GitHub]     [TARTRL]     [Linkedin]     [CV] Visitors: 6389 |
|
I am a researcher in Zhipu AI. Before that, I was a research scientist in 4Paradigm Inc. and the leader of OpenRL Lab. I received my B.E. and Ph. D. degrees (co-advised by Prof. Jun Zhu and Prof. Ting Chen) from the Department of Computer Science and Technology, Tsinghua University in July, 2017 and June, 2022. My researches focus on deep reinforcement learning, multi-agent reinforcement learning, distributed reinforcement learning, RL for robotics, LLM as agent, artificial general intelligence (AGI) and generative artificial intelligence (GAI). I have also spent time working at RealAI Inc. , Huawei Noah's Ark Lab, Tencent AI Lab, Carnegie Mellon University and Sensetime Inc. . And I am also the founder of the OpenRL Lab() and TARTRL group.
We are looking for self-motivated interns and full-timers who have a strong background in mathematics/computer science and are eager to get involved in cutting-edge, fundamental AI research. Please feel free to drop me an email if you are interested in collaborating with me.@article{cheng2024dreampolish, title={DreamPolish: Domain Score Distillation With Progressive Geometry Generation}, author={Cheng, Yean and Cai, Ziqi and Ding, Ming and Zheng, Wendi and Huang, Shiyu and Dong, Yuxiao and Tang, Jie and Shi, Boxin}, journal={arXiv preprint arXiv:2411.01602}, year={2024} }
@article{hong2024cogvlm2, title={CogVLM2: Visual Language Models for Image and Video Understanding}, author={Hong, Wenyi and Wang, Weihan and Ding, Ming and Yu, Wenmeng and Lv, Qingsong and Wang, Yan and Cheng, Yean and Huang, Shiyu and Ji, Junhui and Xue, Zhao and others}, journal={arXiv preprint arXiv:2408.16500}, year={2024} }
@article{yang2024cogvideox, title={CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer}, author={Yang, Zhuoyi and Teng, Jiayan and Zheng, Wendi and Ding, Ming and Huang, Shiyu and Xu, Jiazheng and Yang, Yuanming and Hong, Wenyi and Zhang, Xiaohan and Feng, Guanyu and others}, journal={arXiv preprint arXiv:2408.06072}, year={2024} }
@misc{zhang2024surveyselfplaymethodsreinforcement, title={A Survey on Self-play Methods in Reinforcement Learning}, author={Ruize Zhang and Zelai Xu and Chengdong Ma and Chao Yu and Wei-Wei Tu and Shiyu Huang and Deheng Ye and Wenbo Ding and Yaodong Yang and Yu Wang}, year={2024}, eprint={2408.01072}, archivePrefix={arXiv}, primaryClass={cs.AI}, url={https://arxiv.org/abs/2408.01072}, }
@misc{chen2024softqmixintegratingmaximumentropy, title={Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization}, author={Wentse Chen and Shiyu Huang and Jeff Schneider}, year={2024}, eprint={2406.13930}, archivePrefix={arXiv}, primaryClass={cs.LG} url={https://arxiv.org/abs/2406.13930}, }
@misc{wang2024lvbench, title={LVBench: An Extreme Long Video Understanding Benchmark}, author={Weihan Wang and Zehai He and Wenyi Hong and Yean Cheng and Xiaohan Zhang and Ji Qi and Shiyu Huang and Bin Xu and Yuxiao Dong and Ming Ding and Jie Tang}, year={2024}, eprint={2406.08035}, archivePrefix={arXiv}, primaryClass={cs.CV} }
@misc{xiong2024mqeunleashingpowerinteraction, title={MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment}, author={Ziyan Xiong and Bo Chen and Shiyu Huang and Wei-Wei Tu and Zhaofeng He and Yang Gao}, year={2024}, eprint={2403.16015}, archivePrefix={arXiv}, primaryClass={cs.RO}, url={https://arxiv.org/abs/2403.16015}, }
@article{huang2023openrl, title={OpenRL: A Unified Reinforcement Learning Framework}, author={Huang, Shiyu and Chen, Wentse and Sun, Yiwen and Bie, Fuqing and Tu, Wei-Wei}, journal={arXiv preprint arXiv:2312.16189}, year={2023} }
2020 Spring, TA in Big Data and Machine Intelligence, instructed by Zhen Chen
2019 Fall, TA in Big Data and Machine Intelligence, instructed by Zhen Chen
2019 Spring, TA in Machine Learning, instructed by Prof. Jun Zhu