Posts

The importance of action chunking in imitation learning

Action chunking is an open-loop control where at every control time step, a policy outputs a chunk (sequence) of actions into the future given the current observation. Usually the action sequence will be fully or partially executed before the next control time step.

Dec 31, 2024 6 min read

The importance of action chunking in imitation learning

Why LLM+low-level control might fail to scale

A current trend of combing LLM (LVM) with robotics is to first use an LLM to decompose a high-level task into several subtasks given the instruction and scene image. For example, given an instruction “Put all toys into the basket.

Nov 10, 2024 4 min read

Why LLM+low-level control might fail to scale

On Sim2Real Transfer in Robotics （Part 3/3)

Vision Sim2Real Gap Two types of robot sensors When diving into the world of robotics, it’s crucial to understand the fundamental role sensors play in guiding a robot’s actions. Broadly speaking, sensors fall into two main categories: proprioceptive and exteroceptive.

May 5, 2024 12 min read

On Sim2Real Transfer in Robotics （Part 3/3)

On Sim2Real Transfer in Robotics （Part 2/3)

Physics Sim2Real Gap (continued) System identification errors Controller in the loop of policy control A robot typically consists of multiple actuators such as gear motors, each tasked with moving one or more joints.

Apr 12, 2024 11 min read

On Sim2Real Transfer in Robotics （Part 2/3)

On Sim2Real Transfer in Robotics （Part 1/3)

Why Scaling Hasn’t Happened in the World of Robotics Navigating the landscape of general-purpose robots today, one of the most pressing challenges we face revolves around the lack of data. This scarcity of data, particularly in terms of robot action, stands as a major roadblock hindering the widespread adoption of deep learning models within real-world robotic applications.

Mar 22, 2024 11 min read

On Sim2Real Transfer in Robotics （Part 1/3)

The key to solving LLM hallucinations (解决大语言模型幻觉的关键)

The output of large language models can be divided into two categories: the first is philosophical thoughts that are metaphysical and cannot be confirmed or falsified by current scientific methods,

Jan 15, 2024 5 min read

The key to solving LLM hallucinations (解决大语言模型幻觉的关键)

Do LLMs learn world models? (大语言模型学到了世界模型吗？)

The short answer is, they do learn, but they also fail to learn world models. The title of this article encompasses two concepts: “large language models” and “world models.” In

Oct 5, 2023 8 min read

Do LLMs learn world models? (大语言模型学到了世界模型吗？)

What is language grounding and why is it needed for an AI agent?

I’ve been asked by quite a few people about what “language grounding” means. So I think I’ll write a short post explaining it, and specifically arguing why it is important to a truly intelligent agent.

Dec 19, 2022 6 min read

What is language grounding and why is it needed for an AI agent?

A two-way switch example to better understand Total Correlation

Recently, I was working on a project that requires learning a latent representation with disentangled factors for high-dimensional inputs. For a brief introduction to disentanglement, while we could use an autoencoder (AE) to compress a high-dimensional input into a compact embedding, there is always dependence among the embedding dimensions, meaning that multiple dimensions always change together in a dependent way.

Sep 3, 2022 4 min read

A two-way switch example to better understand Total Correlation

A scoring model for the importance of research problems

The other day I was reading the article “You and your research”, transcribed from a seminar by Richard Hamming. There is one paragraph about “choosing important problems” which I think is inspirational:

Jul 1, 2022 4 min read

A scoring model for the importance of research problems