Li Ang
Founder and CEO @ Simular
Bio

Xin (Eric) Wang is an Assistant Professor of Computer Science and Engineering at UC Santa Cruz. His research interests include Natural Language Processing, Computer Vision, and Machine Learning, with an emphasis on Multimodal, Generative, and Embodied AI. He worked at Google Research, Facebook AI Research (FAIR), Microsoft Research, and Adobe Research.
Xin has served as Area Chair for conferences such as ACL, NAACL, EMNLP, ICLR, and NeurIPS, as well as a Senior Program Committee for AAAI and IJCAI. He organized workshops and tutorials at conferences such as ACL, NAACL, CVPR, and ICCV. He has received several awards and recognitions for his work, including CVPR Best Student Paper Award, Google Research Faculty Award, Amazon Alexa Prize Awards, and various gift awards from Adobe, Snap, eBay, etc.

Hiring

If you are interested in joining my lab, please read the information for prospective students and visitors and check out the most beautiful and unique campus of UCSC [YouTube video | bilibili video]. Due to the large volumn of emails, I may not be able to respond to each one individually.

Teaching

Selected Awards / Honors

  • Amazon Alexa Prize Award (SocialBot Second Place), 2023.
  • Amazon Alexa Prize Award (SimBot Third Place), 2023.
  • Amazon Alexa Prize Award (TaskBot Finalist, Top 5), 2023.
  • Google Faculty Research Award, 2022.
  • AAII Interdisciplinary Research Award, 2022.
  • UCSB Outstanding Publication Award, 2020.
  • CVPR Best Student Paper Award, 2019.
  • Top 100 Excellent Undergraduate Students of the Year, China Computer Federation, 2014

Publications

  • Selected
    • Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
      Yue Fan☆, Lei Ding, Ching-Chen Kuo, Shan Jiang, Yang Zhao, Xinze Guan, Jie Yang, Yi Zhang, Xin Eric Wang
      Technical report
      [Paper] [Website] [Code] [Dataset]
    • VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing
      Jing Gu☆, Yuwei Fang, Ivan Skorokhodov, Peter Wonka, Xinya Du, Sergey Tulyakov, Xin Eric Wang
      Technical report
      [Paper] [Website] [Code]
    • MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
      Xuehai He☆, Weixi Feng☆, Kaizhi Zheng☆, Yujie Lu☆, Wanrong Zhu☆, Jiachen Li☆, Yue Fan☆, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Kevin Lin, William Yang Wang, Lijuan Wang, Xin Eric Wang
      Technical report
      [Paper] [Website] [Code] [Dataset]
    • Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation
      Yufan Zhou, Ruiyi Zhang, Kaizhi Zheng☆, Nanxuan Zhao, Jiuxiang Gu, Zichao Wang, Xin Eric Wang, Tong Sun
      Technical report
      [Paper]
    • Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA
      Qianqi Yan☆, Xuehai He☆, Xiang Yue, Xin Eric Wang
      Technical report
      [Paper] [Website] [Code] [Dataset]
    • FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation
      Xuehai He☆, Jian Zheng, Jacob Zhiyuan Fang, Robinson Piramuthu, Mohit Bansal, Vicente Ordonez, Gunnar A Sigurdsson, Nanyun Peng, Xin Eric Wang
      Technical report
      [Paper] [Website]
    • LLM-Coordination: Evaluating and Analyzing Multi-Agent Coordination Abilities in Large Language Models
      Saaket Agashe☆, Yue Fan☆, Anthony Reyna☆, Xin Eric Wang
      Technical report
      [Paper] [Website] [Code]
    • MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens
      Kaizhi Zheng*☆, Xuehai He*☆, Xin Eric Wang
      Technical report
      [Paper] [Website] [Code]
    • Discriminative Diffusion Models as Few-shot Vision and Language Learners
      Xuehai He☆, Weixi Feng☆, Tsu-Jui Fu☆, Varun Jampani, Arjun Akula, Pradyumna Narayana, Sugato Basu, William Yang Wang, Xin Eric Wang
      Technical report
      [Paper] [Website] [Code]
    • Multimodal Procedural Planning via Dual Text-Image Prompting
      Yujie Lu☆, Pan Lu, Zhiyu Chen, Wanrong Zhu☆, Xin Eric Wang, William Yang Wang
      Technical report
      [Paper] [Code]
  • 2024
    • SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing
      Jing Gu☆, Yilin Wang, Nanxuan Zhao, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, HyunJoon Jung, Xin Eric Wang
      ECCV 2024
      [Paper] [Website] [Code]
    • NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
      Gengze Zhou, Yicong Hong, Zun Wang, Xin Eric Wang, Qi Wu
      ECCV 2024
    • Muffin or Chihuahua? Challenging Large Vision-Language Models with Multipanel VQA
      Yue Fan☆, Jing Gu☆, Kaiwen Zhou☆, Qianqi Yan☆, Shan Jiang, Ching-Chen Kuo, Xinze Guan, Xin Eric Wang
      ACL 2024
      [Paper] [Website] [Code] [Data]
    • ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models
      Kaiwen Zhou☆, Kwonjoon Lee, Teruhisa Misu, Xin Eric Wang
      ACL 2024
      [Paper]
    • Navigation as Attackers Wish? Towards Building Byzantine-Robust Embodied Agents under Federated Learning
      Yunchao Zhang☆, Zonglin Di☆, Kaiwen Zhou☆, Cihang Xie, Xin Eric Wang
      NAACL 2024
      [Paper] [Website] [Code]
    • ComCLIP: Training-Free Compositional Image and Text Matching
      Kenan Jiang*☆, Xuehai He*☆, Ruize Xu☆, Xin Eric Wang
      NAACL 2024
      [Paper] [Website] [Code]
  • 2023
    • Photoswap: Personalized Subject Swapping in Images
      Jing Gu☆, Yilin Wang, Nanxuan Zhao, Tsu-Jui Fu☆, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, HyunJoon Jung, Xin Eric Wang
      NeurIPS 2023
      [Paper] [Website] [Code]
    • LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
      Weixi Feng☆, Wanrong Zhu☆, Tsu-jui Fu☆, Varun Jampani, Arjun Akula, Xuehai He☆, Sugato Basu, Xin Eric Wang, William Yang Wang
      NeurIPS 2023
      [Paper] [Website] [Code]
    • LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
      Yujie Lu☆, Xianjun Yang, Xiujun Li, Xin Eric Wang, William Yang Wang
      NeurIPS 2023
      [Paper] [Code]
    • R2H: Building Multimodal Navigation Helpers that Respond to Help Requests
      Yue Fan☆, Jing Gu☆, Kaizhi Zheng☆, Xin Eric Wang
      EMNLP 2023
      [Paper] [Website] [Code]
    • Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation
      Wanrong Zhu☆, Xinyi Wang, Yujie Lu☆, Tsu-Jui Fu☆, Xin Eric Wang, Miguel Eckstein, William Yang Wang
      EMNLP 2023
      [Paper]
    • Parameter-Efficient Cross-lingual Transfer of Vision and Language Models via Translation-based Alignment
      Zhen Zhang☆, Jialu Wang☆, Xin Eric Wang
      Findings of EMNLP 2023
      [Paper] [Code]
    • ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
      Kaiwen Zhou☆, Kaizhi Zheng☆, Connor Pryor, Yilin Shen, Hongxia Jin, Lise Getoor, Xin Eric Wang
      ICML 2023
      [Paper] [Website]
    • Aerial Vision-and-Dialog Navigation
      Yue Fan☆, Winson Chen☆, Tongzhou Jiang☆, Chun Zhou☆, Yi Zhang, Xin Eric Wang
      Findings of ACL 2023
      [Paper] [Website] [Code]
    • T2IAT: Measuring Valence and Stereotypical Biases in Text-to-Image Generation
      Jialu Wang☆, Xinyue Gabby Liu☆, Zonglin Di☆, Yang Liu, Xin Eric Wang
      Findings of ACL 2023
      [Paper] [Code] [Demo]
    • Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
      Weixi Feng☆, Xuehai He☆, Tsu-Jui Fu☆, Varun Jampani, Arjun Akula, Pradyumna Narayana, Sugato Basu, Xin Eric Wang, William Yang Wang
      ICLR 2023
      [Paper] [Website] [Code]
    • Neuro-Symbolic Procedural Planning with Commonsense Prompting
      Yujie Lu☆, Weixi Feng☆, Wanrong Zhu☆, Wenda Xu, Xin Eric Wang, Miguel Eckstein, William Yang Wang
      ICLR 2023
      Spotlight Presentation
      [Paper] [Code]
    • Multimodal Graph Transformer for Multimodal Question Answering
      Xuehai He☆, Xin Eric Wang
      EACL 2023
      [Paper]
    • Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
      Wanrong Zhu☆, An Yan☆, Yujie Lu☆, Wenda Xu, Xin Eric Wang, Miguel Eckstein, William Yang Wang
      EACL 2023
      [Paper] [Code]
    • ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation
      Wanrong Zhu☆, Xin Eric Wang, An Yan, Miguel Eckstein, William Yang Wang
      EACL 2023
      [Paper]
    • Parameter-efficient Model Adaptation for Vision Transformers
      Xuehai He☆, Chunyuan Li, Pengchuan Zhang, Jianwei Yang, Xin Eric Wang
      AAAI 2023
      [Paper] [Website] [Code]
    • Athena 3.0: Personalized Multimodal Chatbot with Neuro-symbolic Dialogue Generators
      Yue Fan, Kevin K. Bowden, Wen Cui, Winson Chen, Vrindavan Harrison, Angela Ramirez, Saaket Agashe, Xinyue Gabby Liu, Neha Pullabhotla, Nan Qiang, Jeshwanth Bheemanpally, Sugam Garg, Marilyn Walker, Xin Eric Wang
      Alexa Prize SocialBot Grand Challenge 5 Proceedings 2023
      [Paper]
    • Sage: A Multimodal Knowledge Graph-based Conversational Agent for Complex Task Guidance
      Kaizhi Zheng, Jeshwanth Bheemanpally, Bhrigu Garg, Seongsil Heo, Dhananjay Sonawane, Winson Chen, Shree Vignesh S, Xin Eric Wang
      Alexa Prize TaskBot Challenge 2 Proceedings 2023
      [Paper]
    • SlugJARVIS: Multimodal Commonsense Knowledge-based Embodied AI for SimBot Challenge
      Jing Gu, Kaizhi Zheng, Kaiwen Zhou, Yue Fan, Xuehai He, Jialu Wang, Zonglin Di, Xin Eric Wang
      Alexa Prize SimBot Challenge Proceedings 2023
      [Paper]
  • 2022
    • CPL: Counterfactual Prompt Learning for Vision and Language Models
      Xuehai He☆, Diji Yang☆, Weixi Feng☆, Tsu-Jui Fu☆, Arjun Akula, Varun Jampani, Pradyumna Narayana, Sugato Basu, William Yang Wang, Xin Eric Wang
      EMNLP 2022
      [Paper] [Website] [Code]
    • VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation
      Kaizhi Zheng☆, Xiaotong Chen, Odest Chad Jenkins, Xin Eric Wang
      NeurIPS 2022
      [Paper] [Website] [Code]
    • FedVLN: Privacy-preserving Federated Vision-and-Language Navigation
      Kaiwen Zhou☆, Xin Eric Wang
      ECCV 2022
      [Paper] [Code]
    • Language-Driven Artistic Style Transfer
      Tsu-Jui Fu☆, Xin Eric Wang, William Yang Wang
      ECCV 2022
      [Paper] [Code]
    • Understanding Instance-Level Impact of Fairness Constraints
      Jialu Wang☆, Xin Eric Wang, Yang Liu
      ICML 2022
      [Paper] [Code]
    • Imagination-Augmented Natural Language Understanding
      Yujie Lu☆, Wanrong Zhu☆, Xin Eric Wang, Miguel Eckstein, William Yang Wang
      NAACL 2022
      Oral presentation
      [Paper] [Code]
    • Diagnosing Vision-and-Language Navigation: What Really Matters
      Wanrong Zhu☆, Yuankai Qi, Pradyumna Narayana, Kazoo Sone, Sugato Basu, Xin Eric Wang, Qi Wu, Miguel Eckstein, William Yang Wang
      NAACL 2022
      Oral presentation
      [Paper] [Code]
    • JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents
      Kaizhi Zheng☆, Kaiwen Zhou☆, Jing Gu☆, Yue Fan☆, Jialu Wang☆, Zonglin Di☆, Xuehai He☆, Xin Eric Wang
      SoCal NLP 2022
      Winner Model of the Alexa Prize SimBot Public Benchmark Challenge LINK
      [Paper]
    • Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning
      Juncheng Li☆, Junlin Xie, Long Qian, Linchao Zhu, Siliang Tang, Fei Wu, Yi Yang, Yueting Zhuang, Xin Eric Wang
      CVPR 2022
      [Paper] [Code]
    • M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformer
      Tsu-Jui Fu☆, Xin Eric Wang, Scott Grafton, Miguel Eckstein, William Yang Wang
      CVPR 2022
      [Paper] [Dataset] [Video]
    • Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions
      Jing Gu☆, Eliana Stefani☆, Qi Wu, Jesse Thomason, Xin Eric Wang
      ACL 2022
      [Paper] [Code]
    • Assessing Multilingual Fairness in Pretrained Multimodal Representations
      Jialu Wang☆, Yang Liu, Xin Eric Wang
      Findings of ACL 2022
      [Paper]
    • Interpretable Research Replication Prediction via Variational Contextual Consistency Sentence Masking
      Tianyi Luo☆, Rui Meng, Xin Eric Wang, Yang Liu
      Findings of ACL 2022
      [Paper]
  • 2021
    • Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search
      Jialu Wang☆, Yang Liu, Xin Eric Wang
      EMNLP 2021
      Oral presentation
      [Paper] [Code]
    • VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation
      Linjie Li, Jie Lei, Zhe Gan, Licheng Yu, Yen-Chun Chen, Rohit Pillai, Yu Cheng, Luowei Zhou, Xin Eric Wang,
      William Yang Wang, Tamara Lee Berg, Mohit Bansal, Jingjing Liu, Lijuan Wang, Zicheng Liu
      NeurIPS 2021
      [Paper] [Website] [Code] [Data]
    • Visual Question Rewriting for Increasing Response Rate
      Jiayi Wei☆, Xilian Li☆, Yi Zhang, Xin Eric Wang
      SIGIR 2021
      [Paper] [Code]
    • Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
      Wanrong Zhu☆, Xin Eric Wang, Tsu-Jui Fu, An Yan, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang
      EACL 2021
      [Paper] [Code]
    • L2C: Describing Visual Differences Needs Semantic Understanding of Individuals
      An Yan☆, Xin Eric Wang, Tsu-Jui Fu, William Yang Wang
      EACL 2021
      [Paper]
  • 2020
    • Closing the Loop Between Language and Vision for Embodied Agents
      Xin Wang
      UC Santa Barbara
      [PhD Dissertation]
    • SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning
      Tsu-Jui Fu☆, Xin Eric Wang, Scott Grafton, Miguel Eckstein, William Yang Wang
      EMNLP 2020
      Oral presentation
      [Paper] [Code] [Slides]
    • Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations
      Wanrong Zhu☆, Xin Eric Wang, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang
      EMNLP 2020
      [Paper]
    • Learning to Stop: A Simple yet Effective Approach to Urban Vision-Language Navigation
      Jiannan Xiang☆, Xin Eric Wang, William Yang Wang
      Findings of EMNLP 2020
      [Paper]
    • Environment-agnostic Multitask Learning for Natural Language Grounded Navigation
      Xin Eric Wang*, Vihan Jain*, Eugene Ie, William Yang Wang, Zornitsa Kozareva, Sujith Ravi
      ECCV 2020
      Ranking 1st on the CVDN leaderboard
      [Paper] [Code] [Video] [Slides]
    • Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling
      Tsu-Jui Fu☆, Xin Eric Wang, Matthew Peterson, Scott Grafton, Miguel Eckstein, William Yang Wang
      ECCV 2020
      Spotlight presentation
      [Paper] [Video] [Slides]
    • Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation
      Juncheng Li☆, Xin Wang, Siliang Tang, Haizhou Shi, Fei Wu, Yueting Zhuang, William Yang Wang
      CVPR 2020
      [Paper] [Video]
    • REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
      Yuankai Qi, Qi Wu, Peter Anderson, Xin Wang, William Yang Wang, Chunhua Shen, Anton van den Hengel
      CVPR 2020
      Oral presentation
      [Paper] [Code] [Video]
    • Vision-Language Navigation Policy Learning and Adaptation
      Xin Wang, Qiuyuan Huang, Asli Celikyilmaz, Jianfeng Gao, Dinghan Shen, Yuan-Fang Wang, William Yang Wang, Lei Zhang
      IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
      Journal version of the CVPR 2019 Best Student Paper
    • Generative Adversarial Zero-Shot Relational Learning for Knowledge Graphs
      Pengda Qin, Xin Wang, Wenhu Chen, Chunyun Zhang, Weiran Xu, William Yang Wang
      AAAI 2020
      Oral presentation
      [Paper] [Code]
  • 2019
    • TIGEr: Text-to-Image Grounding for Image Caption Evaluation
      Ming Jiang, Qiuyuan Huang, Lei Zhang, Xin Wang, Pengchuan Zhang, Zhe Gan, Jana Diesner, Jianfeng Gao
      EMNLP-IJCNLP 2019
      [Paper] [Code] [bibtex]
    • VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
      Xin Wang*, Jiawei Wu*, Junkun Chen, Lei Li, Yuan-Fang Wang, William Yang Wang
      ICCV 2019
      Oral presentation
      [Paper] [Website] [Video] [bibtex]
    • Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
      Xin Wang, Qiuyuan Huang, Asli Celikyilmaz, Jianfeng Gao, Dinghan Shen, Yuan-Fang Wang, William Yang Wang, Lei Zhang
      CVPR 2019
      Best Student Paper (1/5160=0.02%)
      [Paper] [Video] [bibtex]
    • MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment
      Da Zhang, Xiyang Dai, Xin Wang, Yuan-Fang Wang, Larry S. Davis
      CVPR 2019
      [Paper] [bibtex]
    • Self-Supervised Dialogue Learning
      Jiawei Wu, Xin Wang, William Yang Wang
      ACL 2019
      [Paper] [bibtex]
    • Self-Supervised Learning for Contextualized Extractive Summarization
      Hong Wang, Xin Wang, Wenhan Xiong, Mo Yu, Xiaoxiao Guo, Shiyu Chang, William Yang Wang
      ACL 2019
      [Paper] [Code] [bibtex]
    • Towards Generating Long and Coherent Text with Multi-Level Latent Variable Models
      Dinghan Shen, Asli Celikyilmaz, Yizhe Zhang, Liqun Chen, Xin Wang, Jianfeng Gao, Lawrence Carin
      ACL 2019
      [Paper] [bibtex]
    • Extract and Edit: An Alternative to Back-Translation for Unsupervised Neural Machine Translation
      Jiawei Wu, Xin Wang, William Yang Wang
      NAACL 2019
      Oral presentation
      [Paper] [bibtex]
    • Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning
      Xin Wang, Jiawei Wu, Da Zhang, Yu Su, William Yang Wang
      AAAI 2019
      Oral presentation
      [Paper] [Code] [bibtex]
  • 2018 and before
    • Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation
      Xin Wang*, Wenhan Xiong*, Hongmin Wang, William Yang Wang
      ECCV 2018
      [Paper] [bibtex]
    • XL-NBT: A Cross-lingual Neural Belief Tracking Framework
      Wenhu Chen, Jianshu Chen, Yu Su, Xin Wang, Dong Yu, Xifeng Yan, William Yang Wang
      EMNLP 2018
      [Paper] [Code] [bibtex]
    • No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling
      Xin Wang*, Wenhu Chen*, Yuan-Fang Wang, William Yang Wang
      ACL 2018
      Oral presentation
      [Paper] [Code] [Video] [Slides (pptx)] [Slides (pdf)] [bibtex]
    • S3D: Single Shot multi-Span Detector via Fully 3D Convolutional Network
      Da Zhang, Xiyang Dai, Xin Wang, Yuan-Fang Wang
      BMVC 2018
      Oral presentation
      [Paper] [Code] [Video] [Slides] [bibtex]
    • Video Captioning via Hierarchical Reinforcement Learning
      Xin Wang, Wenhu Chen, Jiawei Wu, Yuan-Fang Wang, William Yang Wang
      CVPR 2018
      [Paper] [Supp] [Dataset] [bibtex]
    • Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning
      Xin Wang, Yuan-Fang Wang, William Yang Wang
      NAACL-HLT 2018
      [Paper] [Paper] [bibtex]
    • Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer
      Xin Wang, Geoffrey Oxholm, Da Zhang, Yuan-Fang Wang
      CVPR 2017
      [Paper] [Supp] [Images] [Code (Third-Party)] [bibtex]
    • Deep Reinforcement Learning for Visual Object Tracking in Videos
      Da Zhang, Hamid Maei, Xin Wang, Yuan-Fang Wang
      Tech report 2017
      [Paper] [bibtex]

Service

  • Organizer:
  • Area Chair (or Senior Program Committee):
  • Session Chair:
  • Program Committee: ACL,   NAACL,   EMNLP,   CVPR,   ICCV,   ECCV,   NeurIPS,   ICLR,   AAAI,   IJCAI,   CoRL  
  • Journal Reviewer: TPAMI,  IJCV
  • Ready to use your computer
    in a Simular way?

    Personal AI that can perceive, reason and act on your computers.

    Take notes
    Notifications
    Give feedback
    Play computer actions
    command+k
    Thank you! Your submission has been received!
    Oops! Something went wrong while submitting the form.