LLM-in-Sandbox Elicits General Agentic Intelligence

by Daixuan Cheng, Shaohan Huang, Yuxian Gu, Huatong Song, Guoxin Chen, Li Dong, Wayne Xin Zhao, Ji-Rong Wen, Furu Wei

Jan 23, 202608:30

LLM-in-SandboxAgentic IntelligenceReinforcement Learning (LLM-in-Sandbox-RL)Generalization Across Domains
00:0008:30
Download on the App Store

Get the full experience with ResearchPod

ResearchPod turns research papers into podcasts you can actually follow.