🎯
Focusing
Studying at Beijing Jiaotong University
Research intern at Intelligent Computing of Alibaba Group
-
-
-
MobileAgent Public
Forked from X-PLUG/MobileAgentMobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
-
Attention-LLaVA Public
A hot-pluggable tool for visualizing LLaVA's attention.
-
AMBER Public
An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation
-
-