资讯

[2025/05/13]: Fix bugs on environment installation and inference. Object-goal will be supported in two weeks. [2025/04/06]: Release code. Now instance-image-goal and text-goal are supported.
Given a reference image and the corresponding prompt, the keyboard or mouse signal, we transform these options to the continuous camera space. Then we design a light-weight action encoder to encode ...