Suggested Categories:

Computer Vision Software
Computer vision software allows machines to interpret and analyze visual data from images or videos, enabling applications like object detection, image recognition, and video analysis. It utilizes advanced algorithms and deep learning techniques to understand and classify visual information, often mimicking human vision processes. These tools are essential in fields like autonomous vehicles, facial recognition, medical imaging, and augmented reality, where accurate interpretation of visual input is crucial. Computer vision software often includes features for image preprocessing, feature extraction, and model training to improve the accuracy of visual analysis. Overall, it enables machines to "see" and make informed decisions based on visual data, revolutionizing industries with automation and intelligence.
AI Vision Models
AI vision models, also known as computer vision models, are designed to enable machines to interpret and understand visual information from the world, such as images or video. These models use deep learning techniques, often employing convolutional neural networks (CNNs), to analyze patterns and features in visual data. They can perform tasks like object detection, image classification, facial recognition, and scene segmentation. By training on large datasets, AI vision models improve their accuracy and ability to make predictions based on visual input. These models are widely used in fields such as healthcare, autonomous driving, security, and augmented reality.
Optometry Software
Optometry software helps optometrists and eye care clinics manage their practice more efficiently by automating tasks such as patient scheduling, electronic health records (EHR), billing, and inventory management. These platforms often include features like patient history tracking, eye exam results management, prescription generation, and vision correction analysis. Optometry software can also integrate with diagnostic equipment and offer tools for creating reports, managing insurance claims, and handling appointments. By using this software, eye care professionals can improve patient care, streamline administrative processes, and ensure better organization within their practices.
Data Labeling Software
Data labeling software is a tool that assists in the organization and categorization of large datasets. Data labeling tools enable data to be labeled with relevant tags depending on the purpose such as for machine learning, image annotation, or text classification. Data labeling software can also assist in categorizing input from customers so businesses can better understand their needs and preferences. The software typically comes with different features such as automated labeling, collaboration tools, and scaleable solutions to handle larger datasets.
Eye Tracking Software
Eye tracking software monitors and analyzes eye movements and gaze patterns to understand user attention, focus, and behavior. It uses specialized cameras and sensors to capture where and how long a person looks at specific areas on screens, physical environments, or products. This software is widely used in usability testing, market research, psychology, gaming, and assistive technologies to improve user experience, design, and accessibility. Features often include heatmaps, gaze plots, fixation analysis, and real-time tracking data visualization. Eye tracking software provides valuable insights into visual engagement and cognitive processes.
Artificial Intelligence Software
Artificial Intelligence (AI) software is computer technology designed to simulate human intelligence. It can be used to perform tasks that require cognitive abilities, such as problem-solving, data analysis, visual perception and language translation. AI applications range from voice recognition and virtual assistants to autonomous vehicles and medical diagnostics.
  • 1
    PXL Vision

    PXL Vision

    PXL Vision

    PXL Vision revolutionizes digital identity verification, automating customer onboarding and KYC processes to increase conversion rates. As the Swiss market leader for digital identity verification, our flexible solutions utilize efficient, AI-based ID checks as a SaaS or on-premise solutions. With our patented technologies, we ensure fast, reliable, and user-friendly identification processes that seamlessly integrate into existing workflows.
  • 2
    UI.Vision RPA

    UI.Vision RPA

    UI.Vision

    Easy automation for busy people. The UI Vision free RPA software (formerly Kantu) automates web and desktop apps on Windows, Mac, and Linux. UI.Vision RPA is a free open-source browser extension that can be extended with local apps for desktop UI automation. The UI Vision core is open-source and guarantees enterprise-grade security. Your data never leaves your machine. Join 100,000+ users and automate workflows on your desktop and in the browser.
    Starting Price: Free
  • 3
    SimpleCV

    SimpleCV

    SimpleCV

    SimpleCV is an open-source framework for building computer vision applications. With it, you get access to several high-powered computer vision libraries such as OpenCV, without having to first learn about bit depths, file formats, color spaces, buffer management, eigenvalues, or matrix versus bitmap storage. This is computer vision made easy. These are just a small number of things you can do with SimpleCV.
  • 4
    Hunyuan-Vision-1.5
    HunyuanVision is a cutting-edge vision-language model developed by Tencent’s Hunyuan team. It uses a mamba-transformer hybrid architecture to deliver strong performance and efficient inference in multimodal reasoning tasks. The version Hunyuan-Vision-1.5 is designed for “thinking on images,” meaning it not only understands vision+language content, but can perform deeper reasoning that involves manipulating or reflecting on image inputs, such as cropping, zooming, pointing, box drawing, or drawing on the image to acquire additional knowledge. ...
    Starting Price: Free
  • 5
    FABIMAGE

    FABIMAGE

    Opto Engineering

    ...Data-flow-based software. Fast and optimized algorithms. 1000+ high-performance functions. Custom machine vision filters. There are over 1000 ready-for-use machine filters tested and optimized on hundreds of applications. They have many advanced capabilities such as outlier suppression, subpixel precision or any-shape region-of-interest. FabImage® Studio is a GigE Vision compliant product, supporting the GenTL interface, as well as a number of vendor-specific APIs.
  • 6
    intuVision VA

    intuVision VA

    intuVision

    intuVision VA offers an all-in-one, server side video analytics solution to meet a wide range of requirements, with application modules in security, retail, parking, traffic, manufacturing, and face & text detection. intuVision VA is fully integrated with popular video management systems (VMS) to add intelligence to your VMS, to analyze video and generate alerts or collect object and event data.
  • 7
    OpenCV

    OpenCV

    OpenCV

    OpenCV (Open Source Computer Vision Library) is an open-source computer vision and machine learning software library. OpenCV was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in commercial products. Being a BSD-licensed product, OpenCV makes it easy for businesses to utilize and modify the code.
    Starting Price: Free
  • 8
    IMPACT Software Suite
    ...All this can be done without the loss of flexibility, like traditional configurable systems, or the need for vast amounts of development time. IMPACT Software Suite also provides a Software Development Kit (SDK) that guarantees full integration of machine vision monitoring capabilities into HMI software applications. Vision Program Manager (VPM) provides hundreds of image processing and analysis functions. Use VPM to enhance images, locate features, measure objects, check for presence or absence, and read text and bar codes. Control Panel Manager (CPM) simplifies development of operator interfaces with the ability to make on-the-fly adjustments to critical machine controls. ...
  • 9
    EVLib

    EVLib

    Irida Labs

    EV Lib is a complete embedded vision software library based on deep learning and AI with functionalities for people, vehicle and object detection, identification tracking and 3D pose estimation.
  • 10
    Mobius Labs

    Mobius Labs

    Mobius Labs

    We make it easy to add superhuman computer vision to your applications, devices and processes to give you unassailable competitive advantage. No code, customizable & on-premise AI solutions.
  • 11
    All in One Accessibility

    All in One Accessibility

    Skynet Technologies USA LLC

    It is an AI based accessibility tool to enable websites to be accessible among people with hearing or vision impairments, motor impaired, color blind, dyslexia, cognitive & learning impairments, seizure & epileptic, ADHD, & elderly. It installs in just 2 minutes. It reduces the risk of time-consuming accessibility lawsuits by improving accessibility compliance for the standards WCAG 2.1, 2.2, ADA, Section 508, European EAA EN 301 549, ACA, California Unruh, Israeli Standard 5568, Australian DDA, UK Equality Act, AODA, Indian RPD Act, GIGW 3.0, France RGAA, German BITV, Brazilian Inclusion law LBI 13.146/2015, Spain UNE 139803:2012, JIS X 8341, Italian Stanca Act, & more. ...
    Starting Price: $25/month
    Partner badge
  • 12
    Hostinger Horizons
    ...Designed for creators, entrepreneurs, and developers who want results without complexity, our prompt based editor makes customization simple. As a Hostinger product, your project comes with built in hosting and easy one click deployment, giving you everything you need to bring your vision to life.
    Leader badge
    Starting Price: $9.99/month
  • 13
    CVEDIA

    CVEDIA

    CVEDIA

    CVEDIA-RT is our AI software stack that comes pre-installed with dozens of video analytics and computer vision solutions. It's easy to configure and customize to your use case, even if you're not a data scientist or developer. For a single low price, you have access to all of our AI solutions now and in the future. This means you can discover new use cases and expand your AI capabilities risk-free! If you couldn't find what you are looking for, or you want to run on another device, no problem. ...
    Starting Price: Free
  • 14
    Voxel51

    Voxel51

    Voxel51

    FiftyOne by Voxel51 - the most powerful visual AI and computer vision data platform. Without the right data, even the smartest AI models fail. FiftyOne gives machine learning engineers the power to deeply understand and evaluate their visual datasets—across images, videos, 3D point clouds, geospatial, and medical data. With over 2.8 million open source installs and customers like Walmart, GM, Bosch, Medtronic, and the University of Michigan Health, FiftyOne is an indispensable tool for building computer vision systems that work in the real world, not just in the lab. ...
    Starting Price: $0
  • 15
    MatConvNet
    ...It supports Windows, Mac OS X, and Linux. MatConvNet is a MATLAB toolbox implementing Convolutional Neural Networks (CNNs) for computer vision applications. It is simple, efficient, and can run and learn state-of-the-art CNNs. Many pre-trained CNNs for image classification, segmentation, face recognition, and text detection are available.
  • 16
    SmolVLM

    SmolVLM

    Hugging Face

    SmolVLM-Instruct is a compact, AI-powered multimodal model that combines the capabilities of vision and language processing, designed to handle tasks like image captioning, visual question answering, and multimodal storytelling. It works with both text and image inputs, providing highly efficient results while being optimized for smaller, resource-constrained environments. Built with SmolLM2 as its text decoder and SigLIP as its image encoder, the model offers improved performance for tasks that require integration of both textual and visual information. ...
    Starting Price: Free
  • 17
    Hugging Face Transformers
    ...A comprehensive trainer that supports features such as mixed precision, torch.compile, and FlashAttention for training and distributed training for PyTorch models.​ Fast text generation with large language models and vision language models. Every model is implemented from only three main classes (configuration, model, and preprocessor) and can be quickly used for inference or training.
    Starting Price: $9 per month
  • 18
    Qwen2.5-VL

    Qwen2.5-VL

    Alibaba

    Qwen2.5-VL is the latest vision-language model from the Qwen series, representing a significant advancement over its predecessor, Qwen2-VL. This model excels in visual understanding, capable of recognizing a wide array of objects, including text, charts, icons, graphics, and layouts within images. It functions as a visual agent, capable of reasoning and dynamically directing tools, enabling applications such as computer and phone usage.
    Starting Price: Free
  • 19
    Gravio

    Gravio

    Gravio

    Gravio enables new ways to connect and interact with your environment through the power of IoT, sensors, edge computing, computer vision, and AI without programming knowledge. Gravio is an easy-to-use software platform that runs on Windows, macOS, or Linux. You can connect to various inputs and outputs, including some bundled IoT sensors, computer vision/AI cameras, and MQTT or HTTP APIs. Gravio is very easy to use without software programming knowledge. Gravio unlocks the power of connected technologies by connecting sensors, input devices, cameras, and APIs within a space, then continuously gathering and sharing their information, enabling new ways to interact with, learn from and enhance a physical space. ...
    Starting Price: $4.99 per month
  • 20
    GLM-4.5V-Flash
    GLM-4.5V-Flash is an open source vision-language model, designed to bring strong multimodal capabilities into a lightweight, deployable package. It supports image, video, document, and GUI inputs, enabling tasks such as scene understanding, chart and document parsing, screen reading, and multi-image analysis. Compared to larger models in the series, GLM-4.5V-Flash offers a compact footprint while retaining core VLM capabilities like visual reasoning, video understanding, GUI task handling, and complex document parsing. ...
    Starting Price: Free
  • 21
    Rosepetal AI

    Rosepetal AI

    Rosepetal AI

    Rosepetal AI is an innovative technology company specializing in advanced artificial vision and deep-learning solutions designed specifically for industrial quality control. Our platform integrates dataset handling, automated labelling and training of adaptive neural networks, enabling real-time defect detection without requiring advanced technical expertise. This intuitive, no-code SaaS solution democratizes access to sophisticated AI, significantly enhancing efficiency, reducing waste, and driving operational excellence across multiple industries such as automotive, food processing, pharmaceuticals, plastics, and electronics. ...
    Starting Price: €250
  • 22
    GLM-4.1V

    GLM-4.1V

    Zhipu AI

    GLM-4.1V is a vision-language model, providing a powerful, compact multimodal model designed for reasoning and perception across images, text, and documents. The 9-billion-parameter variant (GLM-4.1V-9B-Thinking) is built on the GLM-4-9B foundation and enhanced through a specialized training paradigm using Reinforcement Learning with Curriculum Sampling (RLCS).
    Starting Price: Free
  • 23
    Falcon 2

    Falcon 2

    Technology Innovation Institute (TII)

    Falcon 2 11B is an open-source, multilingual, and multimodal AI model, uniquely equipped with vision-to-language capabilities. It surpasses Meta’s Llama 3 8B and delivers performance on par with Google’s Gemma 7B, as independently confirmed by the Hugging Face Leaderboard. Looking ahead, the next phase of development will integrate a 'Mixture of Experts' approach to further enhance Falcon 2’s capabilities, pushing the boundaries of AI innovation.
    Starting Price: Free
  • 24
    OpenVINO
    The Intel® Distribution of OpenVINO™ toolkit is an open-source AI development toolkit that accelerates inference across Intel hardware platforms. Designed to streamline AI workflows, it allows developers to deploy optimized deep learning models for computer vision, generative AI, and large language models (LLMs). With built-in tools for model optimization, the platform ensures high throughput and lower latency, reducing model footprint without compromising accuracy. OpenVINO™ is perfect for developers looking to deploy AI across a range of environments, from edge devices to cloud servers, ensuring scalability and performance across Intel architectures.
    Starting Price: Free
  • 25
    SAFR

    SAFR

    SAFR from RealNetworks

    Unlock a new level of situational awareness with exceptionally accurate face recognition and additional face- and person-based computer vision features. SAFR delivers actionable insights that protect the health and safety of people everywhere. Designed as a standalone networked solution, SAFR SCAN provides SMB and enterprise-level users with uncompromised biometrics features and performance at an affordable price point. Its fast, frictionless throughput can authenticate up to 30 individuals per minute, making it ideal for high-volume applications in office building lobbies, professional offices, secured employee entrances and more. ...
  • 26
    Kognition

    Kognition

    Kognition AI

    Kognition AI security stops threats in real-time. Transform legacy security into intelligent protection that pays for itself. Kognition AI integrates seamlessly with existing cameras and access control - no costly rip-and-replace required. Why Security Leaders Choose Us: ✓ 24/7 AI Guardian that never misses threats or calls in sick ✓ Works with Axis, Hanwha, Avigilon, Genetec, Milestone, and other popular platforms and devices. ✓ Real-time alerts deliver actionable intelligence in...
    Starting Price: $10,000
    Partner badge
  • 27
    SuperAnnotate

    SuperAnnotate

    SuperAnnotate

    SuperAnnotate is the world's leading platform for building the highest quality training datasets for computer vision and NLP. With advanced tooling and QA, ML and automation features, data curation, robust SDK, offline access, and integrated annotation services, we enable machine learning teams to build incredibly accurate datasets and successful ML pipelines 3-5x faster. By bringing our annotation tool and professional annotators together we've built a unified annotation environment, optimized to provide integrated software and services experience that leads to higher quality data and more efficient data pipelines.
  • 28
    Pixtral Large

    Pixtral Large

    Mistral AI

    Pixtral Large is a 124-billion-parameter open-weight multimodal model developed by Mistral AI, building upon their Mistral Large 2 architecture. It integrates a 123-billion-parameter multimodal decoder with a 1-billion-parameter vision encoder, enabling advanced understanding of documents, charts, and natural images while maintaining leading text comprehension capabilities. With a context window of 128,000 tokens, Pixtral Large can process at least 30 high-resolution images simultaneously. The model has demonstrated state-of-the-art performance on benchmarks such as MathVista, DocVQA, and VQAv2, surpassing models like GPT-4o and Gemini-1.5 Pro. ...
    Starting Price: Free
  • 29
    AskUI

    AskUI

    AskUI

    AskUI is an innovative platform that enables AI agents to visually perceive and interact with any computer interface, facilitating seamless automation across various operating systems and applications. Leveraging advanced vision models, AskUI's PTA-1 prompt-to-action model allows users to execute AI-driven actions on Windows, macOS, Linux, and mobile devices without the need for jailbreaking. This technology is particularly beneficial for tasks such as desktop and mobile automation, visual testing, and document or data processing. By integrating with tools like Jira, Jenkins, GitLab, and Docker, AskUI enhances workflow efficiency and reduces the burden on developers. ...
  • 30
    Mistral Small

    Mistral Small

    Mistral AI

    ...The company also unveiled Mistral Small v24.09, a 22-billion-parameter model offering a balance between performance and efficiency, suitable for tasks like translation, summarization, and sentiment analysis. Furthermore, they made Pixtral 12B, a vision-capable model with image understanding capabilities, freely available on "Le Chat," allowing users to analyze and caption images without compromising text-based performance.
    Starting Price: Free
  • Previous
  • You're on page 1
  • 2
  • Next