Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Computer Vision Libraries

Open Source Computer Vision Libraries - Page 4

x

Sort By:

Most Popular

Clear All Filters

OS

Linux 212
Windows 204
Mac 163
More...
BSD 75
ChromeOS 49
Desktop Operating Systems 10
Mobile Operating Systems 8
Server Operating Systems 1

Category

Artificial Intelligence 260
Multimedia 65
Scientific/Engineering 62
Software Development 43
Business 23
Education 12
Games 7
System 6
Communications 1
Desktop Environment 1
Mobile 1
Religion and Philosophy 1
Security 1

License

OSI-Approved Open Source 188
Creative Commons Attribution License 5
Other License 2
Public Domain 2

Translations

English 38
Chinese (Simplified) 3
German 3
Spanish 3
More...
Brazilian Portuguese 2
Italian 2
Catalan 1
French 1
Galician 1
Malay 1
Portuguese 1
Russian 1
Turkish 1
Vietnamese 1

Programming Language

C++ 86
Python 53
MATLAB 27
C 23
More...
Java 23
C# 8
JavaScript 5
Assembly 3
Ruby 2
Rust 2
Scilab 2
Go 1
Objective C 1
TypeScript 1
Unix Shell 1
VBScript 1
Visual Basic 1
Visual Basic .NET 1

Status

Beta 33
Production/Stable 32
Pre-Alpha 19
Alpha 16
More...
Planning 12
Mature 2
Inactive 2

Computer Vision Libraries

View 191 business solutions

Computer Vision Libraries Clear Filters

Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
Host LLMs in Production With On-Demand GPUs
NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.

Try Free
1

CAM

Class Activation Mapping

This repository implements Class Activation Mapping (CAM), a technique to expose the implicit attention of convolutional neural networks by generating heatmaps that highlight the most discriminative image regions influencing a network’s class prediction. The method involves modifying a CNN model slightly (e.g., using global average pooling before the final layer) to produce a weighted combination of feature maps as the class activation map. Integration with existing CNNs (with light modifications). Sample scripts/examples using standard architectures. The repo provides example code and instructions for applying CAM to existing CNN architectures. Visualization of discriminative regions per class.

Downloads: 0 This Week

Last Update: 2025-10-02
See Project
2

CMUcam2 computer vision

Pembutan Modul Pembelajaran CMUcam2 Sebagai Pendukung Praktikum Mata

CMUcam computer vision merupakan proyek opensource seorang peneliti dibidang robotika dan image proccesing. Dimana pada kesempatan kali pertama peneliti mencoba bagaimana menghasilkan alat peraga CMUcam2 yang telah terintegrasi dengan dua motor servo dengan kemampuan dasar yaitu melakukan pencarian obyek secara otomatis (automatic object tracking).

Downloads: 0 This Week

Last Update: 2012-07-16
See Project
3

CV2012MatchMove

MatchMoving oriented C++ project

This is a students project repository for Computer Vision class at the University of Szeged, Hungary. Our goal is to implement an application which will be able to integrate a synthetic 3D object into a real-world based video footage - a.k.a. Match Move

Downloads: 0 This Week

Last Update: 2012-07-09
See Project
4

CVLab

1. popular machine learning algorithms 2. popular computer vision algorithms 3. based on IPP

Downloads: 0 This Week

Last Update: 2015-03-13
See Project
Powerful App Monitoring Without Surprise Bills
AppSignal starts at $23/month with all features included. No overages, no hidden fees. 30-day free trial.

Tired of monitoring tools that punish you for scaling? AppSignal offers transparent, predictable pricing with every feature unlocked on every plan. Track errors, monitor performance, detect anomalies, and manage logs across Ruby, Python, Node.js, and more. Trusted by developers since 2012 with free dev-to-dev support. No credit card required to start your 30-day trial.

Try AppSignal Free
5

CamScanner

Scanner

OCR Scanner

1 Review

Downloads: 0 This Week

Last Update: 2024-09-19
See Project
6

Cambio - 3D computer vision simulator

Cambio is a computer simulator of a robot with stereo, 3D vision. It is intended mainly as a tool for studying computer vision algorithms, but I might expand it to cover other topics in robotics of interest (sensorymotor cognition, reliability, etc).

Downloads: 0 This Week

Last Update: 2013-03-22
See Project
7

Camera Kombat

Camera Kombat is an opensource fighting game based on computer vision that enables free, unencumbered interaction. In order to enable this level of interaction, images of the users are captured by a webcam and their gestures are recognized in real-time.

Downloads: 0 This Week

Last Update: 2013-04-12
See Project
8

ChainerCV

ChainerCV: a Library for Deep Learning in Computer Vision

ChainerCV is a collection of tools to train and run neural networks for computer vision tasks using Chainer. In ChainerCV, we define the object detection task as a problem of, given an image, bounding box-based localization and categorization of objects. Bounding boxes in an image are represented as a two-dimensional array of shape (R,4), where R is the number of bounding boxes and the second axis corresponds to the coordinates of bounding boxes. ChainerCV supports dataset loaders, which can be used to easily index examples with list-like interfaces. Dataset classes whose names end with BboxDataset contain annotations of where objects locate in an image and which categories they are assigned to. These datasets can be indexed to return a tuple of an image, bounding boxes and labels. ChainerCV provides several network implementations that carry out object detection.

Downloads: 0 This Week

Last Update: 2022-08-15
See Project
9

CoTracker

CoTracker is a model for tracking any point (pixel) on a video

CoTracker is a learning-based point tracking system that jointly follows many user-specified points across a video, rather than tracking each point independently. By reasoning about all tracks together, it can maintain temporal consistency, handle mutual occlusions, and reduce identity swaps when trajectories cross. The model takes sparse point queries on one frame and predicts their sub-pixel locations and a visibility score for every subsequent frame, producing long, coherent trajectories. Its transformer-style architecture aggregates information both along time and across points, allowing it to recover tracks even after brief disappearances. The repository ships with inference scripts, pretrained weights, and simple interfaces to seed points, run tracking, and export trajectories for downstream tasks. Typical uses include correspondence building, motion analysis, dynamic SLAM priors, video editing masks, and evaluation of geometric consistency in real scenes.

Downloads: 0 This Week

Last Update: 2025-10-12
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

Computer Vision

Best Practices, code samples, and documentation for Computer Vision

In recent years, we've see an extra-ordinary growth in Computer Vision, with applications in face recognition, image understanding, search, drones, mapping, semi-autonomous and autonomous vehicles. A key part to many of these applications are visual recognition tasks such as image classification, object detection and image similarity. This repository provides examples and best practice guidelines for building computer vision systems. The goal of this repository is to build a comprehensive set of tools and examples that leverage recent advances in Computer Vision algorithms, neural architectures, and operationalizing such systems. Rather than creating implementations from scratch, we draw from existing state-of-the-art libraries and build additional utility around loading image data, optimizing and evaluating models, and scaling up to the cloud.

Downloads: 0 This Week

Last Update: 2021-07-20
See Project
11

Computer Vision

Downloads: 0 This Week

Last Update: 2013-01-14
See Project
12

Computer Vision

Assignments for the Computer Vision Course.

Downloads: 0 This Week

Last Update: 2013-07-30
See Project
13

Computer Vision Chess

A system for playing chess with a computer player using a real chess board. An experiment in learning the techniques of Computer Vision and having fun in the process.

Downloads: 0 This Week

Last Update: 2016-07-24
See Project
14

Computer Vision Pretrained Models

A collection of computer vision pre-trained models

A pre-trained model is a model created by someone else to solve a similar problem. Instead of building a model from scratch to solve a similar problem, we can use the model trained on other problem as a starting point. A pre-trained model may not be 100% accurate in your application. For example, if you want to build a self-learning car. You can spend years building a decent image recognition algorithm from scratch or you can take the inception model (a pre-trained model) from Google which was built on ImageNet data to identify images in those pictures. The model generates bounding boxes and segmentation masks for each instance of an object in the image. It's based on Feature Pyramid Network (FPN) and a ResNet101 backbone. TensorFlow implementation of 'YOLO: Real-Time Object Detection', with training and an actual support for real-time running on mobile devices. MobileNets trade off between latency, size and accuracy while comparing favorably with popular models from the literature.

Downloads: 0 This Week

Last Update: 2022-08-18
See Project
15

Computer Vision in Traffic Surveillance

Solving problems of counting the number of vehicles passing on a road during an interval time, as well as the problems of vehicles classification and estimating the speed of the observed traffic flow from traffic scenes acquired by a camera in real-time.

Downloads: 0 This Week

Last Update: 2014-04-21
See Project
16

ConvNeXt

Code release for ConvNeXt model

ConvNeXt is a modernized convolutional neural network (CNN) architecture designed to rival Vision Transformers (ViTs) in accuracy and scalability while retaining the simplicity and efficiency of CNNs. It revisits classic ResNet-style backbones through the lens of transformer design trends—large kernel sizes, inverted bottlenecks, layer normalization, and GELU activations—to bridge the performance gap between convolutions and attention-based models. ConvNeXt’s clean, hierarchical structure makes it efficient for both pretraining and fine-tuning across a wide range of visual recognition tasks. It achieves competitive or superior results on ImageNet and downstream datasets while being easier to deploy and train than transformers. The repository provides pretrained models, training recipes, and ablation studies demonstrating how incremental design choices collectively yield state-of-the-art performance.

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
17

ConvNet Burden

Memory consumption and FLOP count estimates for convnets

convnet-burden is a MATLAB toolbox / script collection estimating computational cost (FLOPs) and memory consumption of various convolutional neural network architectures. It lets users compute approximate burdens (in FLOPs, memory) for standard image classification CNN models (e.g. ResNet, VGG) based on network definitions. The tool helps researchers compare the computational efficiency of architectures or quantify resource needs. Estimation of memory consumption (e.g. feature map sizes, parameter storage). Support for multiple network definitions/architectures. Estimation of memory consumption (e.g. feature map sizes, parameter storage). Estimation of FLOPs (floating point operations) for CNN architectures.

Downloads: 0 This Week

Last Update: 2025-10-02
See Project
18

DETR

End-to-end object detection with transformers

PyTorch training code and pretrained models for DETR (DEtection TRansformer). We replace the full complex hand-crafted object detection pipeline with a Transformer, and match Faster R-CNN with a ResNet-50, obtaining 42 AP on COCO using half the computation power (FLOPs) and the same number of parameters. Inference in 50 lines of PyTorch. What it is. Unlike traditional computer vision techniques, DETR approaches object detection as a direct set prediction problem. It consists of a set-based global loss, which forces unique predictions via bipartite matching, and a Transformer encoder-decoder architecture. Given a fixed small set of learned object queries, DETR reasons about the relations of the objects and the global image context to directly output the final set of predictions in parallel. Due to this parallel nature, DETR is very fast and efficient.

Downloads: 0 This Week

Last Update: 2021-08-04
See Project
19

Data Fusion Peer

The Data Fusion Peer is a multitier computer vision internet application. The system provides image processing, motion tracking, and visualization information. Application will convert data into 3-Deminsional and other digital environments.

Downloads: 0 This Week

Last Update: 2013-03-21
See Project
20

Datasets

Hub of ready-to-use datasets for ML models

Datasets is a library for easily accessing and sharing datasets, and evaluation metrics for Natural Language Processing (NLP), computer vision, and audio tasks. Load a dataset in a single line of code, and use our powerful data processing methods to quickly get your dataset ready for training in a deep learning model. Backed by the Apache Arrow format, process large datasets with zero-copy reads without any memory constraints for optimal speed and efficiency. We also feature a deep integration with the Hugging Face Hub, allowing you to easily load and share a dataset with the wider NLP community. There are currently over 2658 datasets, and more than 34 metrics available. Datasets naturally frees the user from RAM memory limitation, all datasets are memory-mapped using an efficient zero-serialization cost backend (Apache Arrow). Smart caching: never wait for your data to process several times.

Downloads: 0 This Week

Last Update: 1 day ago
See Project
21

Deep Learning Drizzle

Drench yourself in Deep Learning, Reinforcement Learning

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures! Optimization courses which form the foundation for ML, DL, RL. Computer Vision courses which are DL & ML heavy. Speech recognition courses which are DL heavy. Structured Courses on Geometric, Graph Neural Networks. Section on Autonomous Vehicles. Section on Computer Graphics with ML/DL focus.

Downloads: 0 This Week

Last Update: 2022-07-29
See Project
22

Deep Learning with PyTorch

Latest techniques in deep learning and representation learning

This course concerns the latest techniques in deep learning and representation learning, focusing on supervised and unsupervised deep learning, embedding methods, metric learning, convolutional and recurrent nets, with applications to computer vision, natural language understanding, and speech recognition. The prerequisites include DS-GA 1001 Intro to Data Science or a graduate-level machine learning course. To be able to follow the exercises, you are going to need a laptop with Miniconda (a minimal version of Anaconda) and several Python packages installed. The following instruction would work as is for Mac or Ubuntu Linux users, Windows users would need to install and work in the Git BASH terminal. JupyterLab has a built-in selectable dark theme, so you only need to install something if you want to use the classic notebook interface.

Downloads: 0 This Week

Last Update: 2021-10-12
See Project
23

Detectron

FAIR's research platform for object detection research

Detectron is an object detection and instance segmentation research framework that popularized many modern detection models in a single, reproducible codebase. Built on Caffe2 with custom CUDA/C++ operators, it provided reference implementations for models like Faster R-CNN, Mask R-CNN, RetinaNet, and Feature Pyramid Networks. The framework emphasized a clean configuration system, strong baselines, and a “model zoo” so researchers could compare results under consistent settings. It includes training and evaluation pipelines that handle multi-GPU setups, standard datasets, and common augmentations, which helped standardize experimental practice in detection research. Visualization utilities and diagnostic scripts make it straightforward to inspect predictions, proposals, and losses while training. Although the project has since been superseded by Detectron2, the original Detectron remains a historically important, reproducible reference that still informs many productions.

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
24

Diffgram

Training data (data labeling, annotation, workflow) for all data types

From ingesting data to exploring it, annotating it, and managing workflows. Diffgram is a single application that will improve your data labeling and bring all aspects of training data under a single roof. Diffgram is world’s first truly open source training data platform that focuses on giving its users an unlimited experience. This is aimed to reduce your data labeling bills and increase your Training Data Quality. Training Data is the art of supervising machines through data. This includes the activities of annotation, which produces structured data; ready to be consumed by a machine learning model. Annotation is required because raw media is considered to be unstructured and not usable without it. That’s why training data is required for many modern machine learning use cases including computer vision, natural language processing and speech recognition.

Downloads: 0 This Week

Last Update: 2024-10-14
See Project
25

Diglo

Diglo is a Music Information Retrieval System based on Computer Vision and Audio Spectrum Analysis, using algorithmic operations to find emergent patterns in musical performance. Also it functions as a low-cost Motion Capture Analysis system.

Downloads: 0 This Week

Last Update: 2015-11-09
See Project

Previous
1
2
3
You're on page 4
5
6
7
8
Next

Related Searches

motion capture

scanner android

image 2d to 3d converter

computer vision

traffic counting

dataset

deep learning

roof

face recognition attendance system

face recognition

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise