-
Tianjin University
- Tianjin
- https://Zongbo-Han.github.io/
Lists (3)
Sort Name ascending (A-Z)
Starred repositories
[TMLR 2022] High-Modality Multimodal Transformer
[ICLR'24] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
天大博士/硕士学位论文Latex模板,根据2021年版要求修改,可直接在Overleaf上运行。:star:所写的论文成功提交天津大学图书馆存档!(2021.12.24)
Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
Out-of-the-box (OOTB) GUI Agent for Windows and macOS
A Python library for performing calculations in the Dempster-Shafer theory of evidence.
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
【MICCAI 2023 Early Accept & MedIA submission】EyeMost "Reliable Multimodality Eye Disease Screening via Mixture of Student's t Distributions"
The project page of paper: Trusted Multi-View Classification [ICLR'2021 paper]
What do we learn from inverting CLIP models?
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Environments, tools, and benchmarks for general computer agents
[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
MedRG: Medical Report Grounding with Multi-modal Large Language Model
[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"
ID-like Prompt Learning for Few-Shot Out-of-Distribution Detection
Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".
This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our emails: [email protected] [email protected] qinyang.gm…