24 Jun 2024

SayCan (1/3)

With prompt engineering and scoring we can use LLM to break down an instruction into small, actionable steps. However, the LLM doesn't know about the scene, embodiment and the situation it's in. It needs what is call an affordance function!

A robotic value functions as a way to provide what's feasible in the world given the current scene and embodiment.
LLM checks what makes sense to do next given the grand plan, and the value function checks what is currently feasible

center

Robot Perception and Control

LLM for Robotics

From Transformers to Foundation Models

SayCan (1/3)

SayCan (2/3)

SayCan (3/3)

PaLM-SayCan

Inner Monologue (1/3)

Inner Monologue (2/3)

Inner Monologue (3/3)

Code as Policies

DIAL

NLMap (1/4)

NLMap (2/4)

NLMap (3/4)

NLMap (4/4)

CLIP-Nav (1/3)

CLIP-Nav (2/3)

CLIP-Nav (3/3)