There are still lots of open positions. Let's find the one that's right for you.
The VLM-based Scene Understanding Research Intern will contribute to the development of advanced cloud-based systems for sparse semantic scene understanding using Vision Language Models (VLMs). This role involves architecting a cloud-retrieval pipeline, implementing systems for localization and mapping, and summarizing research findings for publication.