Research
My research interests include bridging the gap between robotic policies and the rich space of language agents.
|
|
End-to-End Navigation with VLMs: Transforming Spatial Reasoning into Question-Answering
Dylan Goetting, Himanshu Gaurav Singh, Antonio Loquercio
arXiv, 2024
project page
/
arXiv
/
video
/
code
VLMnav: an embodied framework to transform a Vision and Language Model (VLM) into an end-to-end navigation policy.
|
|
SSEAL: Self-Supervised Explorative Agent Learning
Evan Frick, Dylan Goetting, Dhruv Gautam
Under review at ICML, 2025
1st place winner at Berkeley RDI's LLM Agents MOOC Hackathon (3000+ participants)
paper
/
code
/
demo
A novel framework for agents to autonomously explore their environment and synthesize learnings for downstream performance.
|
|
VLA as Tools: Exploring the Space of Agentic Robot Generalists
Dylan Goetting
2024
paper
A proposed system in which central planning agent, powered by a Large Language Model (LLM), interacts with the environment by iteratively prompting a VLA model. Through a carefully designed feedback module, the LLM can observe the outcome of such robotic trajectory, and use this information for downstream tasks.
|
Adventures
I like to explore the world and document my adventures. Some videos are highlights from my YouTube channel.
|
|