I am a final-year PhD student from the Department of Computer Science and Technology, Tsinghua University. I am fortunately supervised by Professor Maosong Sun and Zhiyuan Liu. In the summer of 2019, I visited MILA and conducted research under Professor Jian Tang.

My current passion revolves around building SCALABLE solutions to ASI (AGI is too vague to be a target, and its first version will be achieved soon.), which means the solutions will bring improvement simply with more resources on computation and data. This include:

Scaling Pretrain. Ensuring the growth of language models’ ability is measureable and predictable, and potentially instructing us to find better scaling law with better paradigm.
Scaling RL. This includes developing scaling RL training compute, inference compute, data source, RL horizon (O1 paradigm).
Scaling World Model Unifing modality and training objectives, then scaling with pretrain and RL.

The above objects seem very broad. I will focus on one point at a time. Currently, I am on ``Better Architecture for RL Scaling’'.

In fact, I believe that we will not have achieved ASI until the model is capable of conducting scientific research independently. There is still a long way to go. Keep fighting!

Selected Publications

LEGENT: Open Platform for Embodied Agents. Arxiv
A Unity 3D platform for building your embodied agents powered by LLMs. Still under development! If you want to be a contributor please contact me via email.
Unified View of Grokking, Double Descent and Emergent Abilities: A Perspective from Circuits Competition. Arxiv
A simplest and beautiful mechanistic interpretability work. Providing perspective to understand the fascinating phenomena during model scaling and data scaling, encompassing Grokking, Double Descent, and Emergent Abilities.
Yufei Huang, Shengding Hu, Xu Han, Zhiyuan Liu, Maosong Sun
MiniCPM: Unveiling the Potential of End-side Large Language Models. Arxiv
A small LLM with 2.4B non-embedding parameters that rivals Llama-13B or Mistral-7B. With several scaling experiments. WSD learning rate scheduler is proposed as a substitude for Cosine learning scheduler. Trending on Github and Huggingface.
OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems. Arxiv
A benchmark that challenges AGI with humanity’s most eminent intellectual contests, serving as a beacon for future AGI development and a platform for studying scalable oversight.
Chaoqun He, Renjie Luo, Yuzhuo Bai, Shengding Hu, Zhen Leng Thai, Junhao Shen, Jinyi Hu, Xu Han, Yujie Huang, Yuxiang Zhang, Jie Liu, Lei Qi, Zhiyuan Liu, Maosong Sun
∞Bench: Extending Long Context Evaluation Beyond 100K Tokens. Arxiv
Benchmark for super long context. Long context is almost everything (better in an efficient way)
Xinrong Zhang, Yingfa Chen, Shengding Hu, Zihang Xu, Junhao Chen, Moo Khai Hao, Xu Han, Zhen Leng Thai, Shuo Wang, Zhiyuan Liu, Maosong Sun
Predicting Emergent Abilities with Infinite Resolution Evaluation. Preprint
The first work that achieves predictable scaling besides GPT-4
Shengding Hu, Xin Liu, Xu Han, Xinrong Zhang, Chaoqun He, Weilin Zhao, Yankai Lin, Ning Ding, Zebin Ou, Guoyang Zeng, Zhiyuan Liu, Maosong Sun
Won’t Get Fooled Again: Answering Questions with False Premises
ACL Oral Representation
Shengding Hu, Yifan Luo, Huadong Wang, Xingyi Cheng, Zhiyuan Liu, Maosong Sun
Tool Learning with Foundation Models Preprints.
A 75 page study of LLM tool use.
Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, …(other 33 co-authors), Zhiyuan Liu, Maosong Sun.
OpenPrompt: An Open-source Framework for Prompt-learning ACL 2022 Demo
ACL 2022 Best Demo Award
Ning Ding*, Shengding Hu*, Weilin Zhao*, Yulin Chen, Zhiyuan Liu, Hai-Tao Zheng, Maosong Sun
Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification. ACL 2022.
More than 200 citations
Shengding Hu, Ning Ding, Huadong Wang, Zhiyuan Liu, JinGang Wang, Juanzi Li, Wei Wu, Maosong Sun
Delta tuning: A comprehensive study of parameter efficient methods for pre-trained language models Preprint
Neural Machine Intelligence Cover Article
Ning Ding*, Yujia Qin*, Guang Yang, Fuchao Wei, Zonghan Yang, Yusheng Su, Shengding Hu, Yulin Chen, Chi-Min Chan, Weize Chen, Jing Yi, Weilin Zhao, Xiaozhi Wang, Zhiyuan Liu, Hai-Tao Zheng, Jianfei Chen, Yang Liu, Jie Tang, Juanzi Li, Maosong Sun
Graph Neural Networks: A Review of Methods and Applications. AI Open 2021.
More than 4000 citations
Jie Zhou*, Ganqu Cui*, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, Maosong Sun.

For other papers, please refer to my google scholar

Selected Projects

Awards

Master&PhD

National Natural Science Foundation of China (NSFC) Doctoral Project Leader (~1/100 among all subjects and all institutes in China)
Siebel Scholar of Class 2023
1/83 across the world, Price $30,000
National Scholarship 2021.
One of the highest award in Tsinghua University.

Bachelor

Academic Excellence Award in 2016-2017 , 2017-2018, 2018-2019.
Good reading scholarship in 2018-2019 year.
Zhuzhou Scholarship in 2017-2018 year

Earlier

Silver Medal in the 32nd National Middle School Physics Competition

Selected Publications#

Selected Projects#

Awards#

Master&PhD#

Bachelor#

Earlier#