Lists (3)
Sort Name ascending (A-Z)
Stars
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Fusing Semantic Segmentation and Monocular Depth Estimation for Enabling Autonomous Driving in Roads without Lane Lines
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
Synthetic Data Generation Examples
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
GaussianSpeech: Audio-Driven Gaussian Avatars
Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer https://arxiv.org/abs/2404.05695
A framework built on top of NVIDIA Isaac Sim for simulating drones with PX4 support and much more
A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.
official code for "Large Language Models as Optimizers"
PepperPose: Full-Body Pose Estimation with a Companion Robot
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
[ICML 2024] 🍅HumanTOMATO: Text-aligned Whole-body Motion Generation
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
分享的所有数据集均为开源的PHM(Prognostics and Health Management)数据,涵盖故障诊断、健康评估和寿命预测等领域。
📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical papers in the field of surface defect research which are of great importance.
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
The open source implementation of "AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model"
PaPaGei: Open Foundation Models for Optical Physiological Signals
ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke
A collection of resources that investigate social agents.
A simple screen parsing tool towards pure vision based GUI agent
Code release for H-GAP Humanoid Control with a Generalist Planner
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.