seefun

SeeFun seefun

Thinking, Walking and Coding

92 followers · 57 following

Shanghai, China

Stars

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 17,276 1,423 Updated Feb 7, 2025

opentensor / bittensor

Internet-scale Neural Networks

Python 1,002 349 Updated Feb 7, 2025

omegalabsinc / omegalabs-bittensor-subnet

The World's Largest Decentralized AGI Multimodal Dataset

Python 43 26 Updated Feb 4, 2025

thomas0809 / MolScribe

Robust Molecular Structure Recognition with Image-to-Graph Generation

Python 169 34 Updated Jan 9, 2025

stanford-oval / storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 21,652 1,894 Updated Jan 23, 2025

lambdal / deeplearning-benchmark

Benchmark Suite for Deep Learning

Shell 257 53 Updated Jan 8, 2025

Delgan / loguru

Python logging made (stupidly) simple

Python 20,723 715 Updated Feb 1, 2025

bojone / softtopk

differentiable top-k operator

Python 21 Updated Dec 30, 2024

ppaanngggg / layoutreader

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

Python 164 12 Updated May 23, 2024

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,794 591 Updated Feb 1, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 3,024 1,331 Updated Feb 5, 2025

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 36,310 1,626 Updated Feb 1, 2025

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 460 38 Updated Jan 3, 2025

yformer / EfficientTAM

Efficient Track Anything

Python 459 12 Updated Jan 6, 2025

huggingface / smol-course

A course on aligning smol models.

Jupyter Notebook 5,267 1,804 Updated Jan 24, 2025

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 8,143 654 Updated Jan 24, 2025

NVIDIA / free-threaded-python

No-GIL Python environment featuring NVIDIA Deep Learning libraries.

Dockerfile 41 3 Updated Nov 19, 2024

video-db / StreamRAG

Video Search and Streaming Agent 🕵️‍♂️

Python 458 29 Updated Jan 31, 2024

infiniflow / infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

C++ 3,011 292 Updated Feb 7, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,644 446 Updated Feb 6, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 15,109 1,955 Updated Feb 1, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,423 238 Updated Jan 27, 2025

uw-ipd / RoseTTAFold2NA

RoseTTAFold2 protein/nucleic acid complex prediction

Python 343 77 Updated Jun 3, 2024

evolutionaryscale / esm

Python 1,687 188 Updated Jan 16, 2025

HJYao00 / DenseConnector

【NeurIPS 2024】Dense Connector for MLLMs

Python 155 7 Updated Oct 14, 2024

westlake-baichuan-mllm / bc-omni

Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊

260 7 Updated Jan 27, 2025

RQLuo / MixTeX-Latex-OCR

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

Python 1,095 65 Updated Dec 15, 2024

apple / ml-depth-pro

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 4,079 295 Updated Oct 5, 2024

unclecode / crawl4ai

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper

Python 29,144 2,331 Updated Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly