2025 Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening Andre He, Daniel Fried, and Sean Welleck 2025 EMNLP 2025 (Oral) arXiv 2024 BridgeData V2: A Dataset for Robot Learning at Scale Homer Walke, Kevin Black, Abraham Lee, and 11 more authors 2024 CoRL 2023 arXiv 2023 Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control Vivek Myers, Andre He, Kuan Fang, and 7 more authors 2023 CoRL 2023 arXiv 2022 Understanding Game-Playing Agents with Natural Language Annotations Nicholas Tomlin, Andre He, and Dan Klein 2022 ACL 2022 arXiv Neural Unsupervised Reconstruction of Protolanguage Word Forms Andre He, Nicholas Tomlin, and Dan Klein 2022 ACL 2023 arXiv