Browse free open source OCR software and projects for Linux below. Use the toggles on the left to filter open source OCR software by OS, license, language, programming language, and project status.

  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
    Learn More
  • 1
    NAPS2 - Not Another PDF Scanner

    NAPS2 - Not Another PDF Scanner

    Scan documents to PDF and other file types, as simply as possible.

    Visit NAPS2's home page at www.naps2.com. NAPS2 is a document scanning application with a focus on simplicity and ease of use. Scan your documents from WIA- and TWAIN-compatible scanners, organize the pages as you like, and save them as PDF, TIFF, JPEG, PNG, and other file formats. Available on Windows, Mac, and Linux. NAPS2 is currently available in over 40 different languages. Want to see NAPS2 in your preferred language? Help translate! See the wiki for more details.
    Leader badge
    Downloads: 563 This Week
    Last Update:
    See Project
  • 2
    VideoSubFinder
    The main purpose of this program is to provide functionality for extract hardcoded subtitles (hardsub) from video. It provides two main features: 1) Autodetection of frames with hardcoded text (hardsub) on video with saving info about timing positions. 2) Generation of cleared from background text images, which allows with usage of OCR programs (like FineReader, Subtitle Edit, Google Drive) to generate complete subtitles with original text and timing. For working of this program on Windows will be required "Microsoft Visual C++ Redistributable runtime libraries 2022": https://support.microsoft.com/en-us/help/2977003/the-latest-supported-visual-c-downloads Latest versions were built and tested on: Windows 10 x64, Ubuntu 20.04.5 LTS, openSUSE Leap 15.4, Arch Linux (EndeavourOS Cassini Nova 03-2023) For faster support in case of bug fixes please contact me in: https://vk.com/skosnits For donate: https://sourceforge.net/projects/videosubfinder/donate
    Leader badge
    Downloads: 534 This Week
    Last Update:
    See Project
  • 3
    tesseract-ocr alternative download

    tesseract-ocr alternative download

    Alternative download for tesseract-ocr project

    Alternative download for tesseract-ocr project
    Leader badge
    Downloads: 1,279 This Week
    Last Update:
    See Project
  • 4
    gImageReader

    gImageReader

    A graphical frontend to tesseract-ocr

    gImageReader is a simple Gtk/Qt front-end to tesseract. Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**: This page is only a mirror for the downloads. Development is happening on github at https://github.com/manisandro/gImageReader, release binaries are also posted there.
    Leader badge
    Downloads: 193 This Week
    Last Update:
    See Project
  • The #1 CRM in Real Estate Icon
    The #1 CRM in Real Estate

    Chime is the all-in-one CRM and Sales Acceleration platform that real estate professionals LOVE to use!

    Automate marketing campaigns, boost your brand awareness, capture and convert more leads, all in ONE intuitive platform.
    Learn More
  • 5
    Provides optical character recognition (OCR) solutions for Vietnamese language.
    Leader badge
    Downloads: 205 This Week
    Last Update:
    See Project
  • 6
    A GUI to ease the process of producing a multipage PDF from a scan. gscan2pdf should work on almost any Linux/BSD machine.
    Leader badge
    Downloads: 189 This Week
    Last Update:
    See Project
  • 7
    A Java JNA wrapper for Tesseract OCR API
    Leader badge
    Downloads: 138 This Week
    Last Update:
    See Project
  • 8
    CD+Graphics Magic
    Timeline based editor for creating Compact Disc Subcode Graphics (also known as CD+G or CDG). Both karaoke and multimedia styles of content are supported. Please visit cdgmagic.sf.net for examples playable directly in the HTML5 CD+G player. CD+Graphics Scribe utility (separate download -- click "Browse All Files" above) can now convert existing CDG karaoke content to CMP (CD+Graphics Magic Project), LRC (Enhanced Lyrics), and ASS (Advanced SubStation Alpha) format.
    Leader badge
    Downloads: 30 This Week
    Last Update:
    See Project
  • 9

    Image To Text tools

    ITTT is a Free tool designed to Scan and extract Text from Images.

    Image To Text Tools is a 100% Free user-friendly tool designed to Scan and extract containing text in images into editable text formats. Whether you need to extract text from scanned documents, photographs, or other image files, Image To Text Tools provides accurate and reliable Optical Character Recognition (OCR) capabilities to meet your needs.
    Downloads: 38 This Week
    Last Update:
    See Project
  • CloudZero: The Cloud Cost Optimization Platform Icon
    CloudZero: The Cloud Cost Optimization Platform

    CloudZero automates the collection, allocation, and analysis of your infrastructure and AI spend to uncover waste and improve unit economics.

    CloudZero is the leader in proactive cloud cost efficiency. We enable engineers to build cost-efficient software without slowing down innovation. CloudZero's next-generation cloud cost optimization platform automates the collection, allocation, and analysis of cloud costs to uncover savings opportunities and improve unit economics. We are the only platform that enables companies to understand 100% of their operational cloud spend and take an engineering-led approach to optimizing that spend. CloudZero is used by industry leaders worldwide, such as Coinbase, Klaviyo, Miro, Nubank, and Rapid7.
    Learn More
  • 10
    Linux-Intelligent-Ocr-Solution

    Linux-Intelligent-Ocr-Solution

    Easy-OCR solution and Tesseract trainer for GNU/Linux

    Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial : https://www.youtube.com/playlist?list=PLn29o8rxtRe1zS1r2-yGm1DNMOZCgdU0i Tesseract Training Tutorial (beta) : https://www.youtube.com/watch?v=qLpCld4cdtk Source Code Github : https://github.com/Nalin-x-Linux/lios-3 Gitlab : https://gitlab.com/Nalin-x-Linux/lios-3 User guide is available in download page
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    Java OCR
    Java OCR is a suite of pure java libraries for image processing and character recognition. Small memory footprint and lack of external dependencies makes it suitable for android development. Provides modular structure for easier deployment
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12

    phpSANE

    Web-Based Frontend for SANE

    phpSANE is a web-based frontend for SANE written in HTML/PHP so you can scan with your web-browser. It also supports OCR.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    An image postprocessor for the DIY Book Scanner described on instructables.com and diybookscanner.org. Gets images ready for OCR or for PDF. Written in Java based on a partial port of the Leptonica image processing library.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    FormRead

    FormRead

    Free OMR - OCR web sofware based on javascript and PHP

    https://formread.org FormRead is a completely free OMR (optical mark recognition) web software for scanning and grading user-filled, multiple choice forms. Create your formats with any of your office or drawing tools, scan them and parameterize their coordinates in an easy way. Once you have parameterized your form, you can print many of them, give it to your students/respondents, scan and recognize them with formread, and you can finally export the data in your preferred formats (excel, pdf, csv)
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    eSearch

    eSearch

    screen recognition and search

    Downloads: 5 This Week
    Last Update:
    See Project
  • 16

    cbrTekStraktor

    an application to automatically extract text from comic books.

    cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes. The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. The text extraction is achieved by a combination of statistical and graphical processing operations. It is based on the following 3 major algorithms - Binarization of color images (Niblak and other methods) - Connected components - K-Means clustering Apache Tesseract is used to perform Optical Character Recognition on the extracted text. A subsequent version of the application will integrate with translation software in order to provide automated translation of comic book texts and re-inserion of translated texts
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    OcrGui
    A GUI for OCR programs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    SubHub

    SubHub

    Post-OCR correction tool for SRT subtitles

    A post-OCR correction tool for SRT subtitle files.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19

    cuneiformplus

    Fork of OCR software cuneiform

    Fork of OCR software cuneiform Original software see: https://launchpad.net/cuneiform-linux by Cognitive Technologies and Jussi Pakkanen Other Open Source OCR stuff see * Tesseract by Ray Smith (using the Leptonica image library) * GOCR * OCRAD
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Tifftool is a high-performance tool to clean scanned documents in preparation for onscreen display or for OCR. Features include skew correction, orientation correction, despeckle, page alignment, split pages and batch processing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Tested for Ubuntu Maverick - Create Audiobooks from eBooks, text or pictures. - Read eBooks or text aloud while scrolling through pages
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    opendias
    NB: openDIAS is moving away from SF.net. Please visit the homepage link for the most update to date information, support and files. Document Imaging Archive System. Home document imaging, with OCR. Scan documents (with SANE) or import ODF documents, assign tags. Use openDIAS to store all our letters, bills, statements, etc in a convenient, safe and easily retrievable way.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Akshara Malayalam OCR is a project for the development of an OCR for printed and handwritten documents in Malayalam language. The inspiration is from similar OCR softwares in other languages etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Common OCR Service Interface. COSI is an API that allows developpers to easily bring OCR (Optical Character Recognition) capabilities to image processing applications. COSI supports existing OCR tools such as Tesseract, GOCR or GNU Ocrad.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next