Jiahui Zhang

I'm a visiting student at University of Southern California (USC), advised by Prof. Erdem Biyik. I received my master's degree in Electronic Engineering from USC . Before that, I completed my undergrad in Fan Gongxiu Honors College at the Beijing University of Technology (BJUT) and majored in Electronic Information Engineering.

I have spent two years as a research student at Cognitive Learning for Vision and Robotics Lab (CLVR), advised by Prof. Joseph J. Lim. I was also a research intern at Horizon Robotics, working with Haonan Yu and Wei Xu.

My research interests lie in robot learning and its application to enabling robots to perform tasks in everyday human life. I am particularly interested in leveraging foundation models to help robot policy learning and generalization. Additionally, I aim to develop general-purpose policies that allow different robots to perform commonly seen tasks in daily lives.

Google Scholar / Twitter / Linkedin / CV

Research

ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations In Submission

Jiahui Zhang*, Yusen Luo*, Abrar Anwar*, Sumedh Sontakke, Joseph Lim, Jesse Thomason, Erdem Biyik Jesse Zhang

[Website]

We introduce an approach for learning reward functions for unseen new tasks. Our model learns a reward model based on outputs from pre-trained vision-language models, which then provides rewards for policy learning.

Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance
Jesse Zhang,Jiahui Zhang, Karl Pertsch,Ziyi Liu,Xiang Ren,Minsuk Chang,Shao-Hua Sun,Joseph J. Lim

Oral presentation (top 6.6%) @ CoRL 2023

Oral presentation @ SoCal Robotics 2023

Spotlight talk @ RSS 2023 Articulate Robots Workshop

[Arxiv] [Website] [Code] [OpenReview]

Our approach BOSS (BOotStrapping your own Skills) learns to accomplish new tasks by performing "skill bootstrapping," where an agent with a set of primitive skills interacts with the environment to practice new skills without receiving reward feedback for tasks outside of the initial skill set. This bootstrapping phase is guided by LLMs that inform the agent of meaningful skills to chain together. Through this process, BOSS builds a wide range of complex and useful behaviors from a basic set of primitive skills.

SPRINT: Scalable Semantic Policy Pretraining via Language Instruction Relabeling
Jesse Zhang,Jiahui Zhang, Karl Pertsch,Joseph J. Lim

Poster @ ICRA 2024

Spotlight talk @ CoRL 2022 LangRob Workshop

Spotlight talk @ CoRL 2022 Pre-training Robot Learning Workshop

[Arxiv] [Website] [Code]

We propose SPRINT, a scalable approach for pre-training robot policies with a rich repertoire of skills while minimizing human annotation effort. Given a dataset of robot trajectories with an initial set of task instructions for offline pre-training, SPRINT expands the pre-training task set without additional human effort via language-model-based instruction relabeling and cross-trajectory skill chaining.

Cross Domain Imitation Learning via MPC (Internship Project)
Jiahui Zhang, Haonan Yu, Wei Xu
[Website]

We introduce CDMPC, an approach for learning new skill combinations from long-horizon skill trajectories. CDMPC enables agents to chain skills from diverse source domains and integrates them with a low-level policy in the target domain. CDMPC learns to chain short-horizon skills from long-horizon trajectories across demonstrations from diverse source domains, including various skill combinations. The policy learned from CDMPC adapts to tasks from any source domain and makes the agent able to tackle new tasks that require novel skill combinations.

Service

Served as a reviewer

Awards

Presidential scholarship

Outstanding Research Achievement Award

Template borrow from Jon Barron's website