Skip to content
View seefun's full-sized avatar
  • Shanghai, China

Block or report seefun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fully open reproduction of DeepSeek-R1

Python 17,276 1,423 Updated Feb 7, 2025

Internet-scale Neural Networks

Python 1,002 349 Updated Feb 7, 2025

The World's Largest Decentralized AGI Multimodal Dataset

Python 43 26 Updated Feb 4, 2025

Robust Molecular Structure Recognition with Image-to-Graph Generation

Python 169 34 Updated Jan 9, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 21,652 1,894 Updated Jan 23, 2025

Benchmark Suite for Deep Learning

Shell 257 53 Updated Jan 8, 2025

Python logging made (stupidly) simple

Python 20,723 715 Updated Feb 1, 2025

differentiable top-k operator

Python 21 Updated Dec 30, 2024

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

Python 164 12 Updated May 23, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,794 591 Updated Feb 1, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 3,024 1,331 Updated Feb 5, 2025

Python tool for converting files and office documents to Markdown.

Python 36,310 1,626 Updated Feb 1, 2025

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 460 38 Updated Jan 3, 2025

Efficient Track Anything

Python 459 12 Updated Jan 6, 2025

A course on aligning smol models.

Jupyter Notebook 5,267 1,804 Updated Jan 24, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 8,143 654 Updated Jan 24, 2025

No-GIL Python environment featuring NVIDIA Deep Learning libraries.

Dockerfile 41 3 Updated Nov 19, 2024

Video Search and Streaming Agent 🕵️‍♂️

Python 458 29 Updated Jan 31, 2024

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

C++ 3,011 292 Updated Feb 7, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,644 446 Updated Feb 6, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 15,109 1,955 Updated Feb 1, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,423 238 Updated Jan 27, 2025

RoseTTAFold2 protein/nucleic acid complex prediction

Python 343 77 Updated Jun 3, 2024
Python 1,687 188 Updated Jan 16, 2025

【NeurIPS 2024】Dense Connector for MLLMs

Python 155 7 Updated Oct 14, 2024

Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊

260 7 Updated Jan 27, 2025

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

Python 1,095 65 Updated Dec 15, 2024

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 4,079 295 Updated Oct 5, 2024

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper

Python 29,144 2,331 Updated Feb 7, 2025
Next