Browse free open source Java OCR Software and projects below. Use the toggles on the left to filter open source Java OCR Software by OS, license, language, programming language, and project status.

  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • $300 in Free Credit for Your Google Cloud Projects Icon
    $300 in Free Credit for Your Google Cloud Projects

    Build, test, and explore on Google Cloud with $300 in free credit. No hidden charges. No surprise bills.

    Launch your next project with $300 in free Google Cloud credit—no hidden charges. Test, build, and deploy without risk. Use your credit across the Google Cloud platform to find what works best for your needs. After your credits are used, continue building with free monthly usage products. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    OpenKM Document Management - DMS

    OpenKM Document Management - DMS

    Document Management System and Content Management System

    OpenKM Community Edition is a free Document Management System (DMS) that helps businesses control the production, storage, management and distribution of electronic documents, boosting effectiveness and productivity. It integrates document management, collaboration and advanced search into one easy-to-use solution, including administration tools for user roles, access control, security levels, activity logs and automation setup. With OpenKM Community Edition you can: Collect information from any digital source. Collaborate with colleagues on documents and projects. Capitalize on accumulated knowledge by locating documents and information sources. Control business processes with an embedded workflow engine. Automate tasks. For a complete feature list visit: http://goo.gl/au8cQy
    Leader badge
    Downloads: 414 This Week
    Last Update:
    See Project
  • 2
    Provides optical character recognition (OCR) solutions for Vietnamese language.
    Leader badge
    Downloads: 133 This Week
    Last Update:
    See Project
  • 3
    A Java JNA wrapper for Tesseract OCR API
    Leader badge
    Downloads: 72 This Week
    Last Update:
    See Project
  • 4
    chessPDFBrowser

    chessPDFBrowser

    Chess application whichs allows working with chess PDF books and PGNs.

    Chess application which allows working with PDFs and PGNs. You can work with the chess games of the PDF and edit their tree of variants. Graphical environment. Standard PGN TAGs. PGN comments. Ocr like (Fen string detection from chess board position images). Connection to Uci chess engines (like stockfish). Position analysis, full game analysis. You can now play games against uci engines. pdf2pgn command line command included. Detailed documentation. Multilanguage currently support for English, Spanish and Catalan. Dark mode option. JDK-17 compatibility
    Downloads: 49 This Week
    Last Update:
    See Project
  • Easily Host LLMs and Web Apps on Cloud Run Icon
    Easily Host LLMs and Web Apps on Cloud Run

    Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.

    Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.
    Try Cloud Run Free
  • 5
    An image postprocessor for the DIY Book Scanner described on instructables.com and diybookscanner.org. Gets images ready for OCR or for PDF. Written in Java based on a partial port of the Leptonica image processing library.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    Java OCR
    Java OCR is a suite of pure java libraries for image processing and character recognition. Small memory footprint and lack of external dependencies makes it suitable for android development. Provides modular structure for easier deployment
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8

    File-em

    File-'em is an automatic receipts organizer implemented in Java & SWT.

    File-'em (pronounced like phylum) is an open source alternative to the software behind NeatReceipts?®. It allows you to load in scanned receipts and automatically pulls the information out of the receipt using OCR and stores it in a SQLite database for easy reference, reports, and retrieval.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9

    Immutable Sparse Wave Trees (WaveTree)

    Realtime bigdata tool for bit strings up to 2^63 based on AVL forest

    Realtime bigdata tool at the bit level based on immutable AVL forest which can be run in memory or, in future versions, as a merkle forest like a blockchain. Main object is a sparse bit string (Bits) that efficiently scales up to 2^63 bits normally compressed as forest has duplicated substrings. Bits objects support reading bit, byte, short, int, or long (Java primitives) at any bit index in 64 bit range. Example: instead of building a class to hold a header and then data, represent all of that as Bits, subranges of them, and ints for sizes of its parts. Expansion ability for other kinds of compression, since Bits is a Java interface. Main functions on bits are substring, concat, number of 0 or 1 bits, and number of bits (size). All those operations can be done millions of times per second regardless of size because the AVL forest reuses existing branches recursively. Theres a scalar (originally for copy/pasting subranges of sounds) and a bit Java package. Sparse n dimensional matrix.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Run Any Workload on Compute Engine VMs Icon
    Run Any Workload on Compute Engine VMs

    From dev environments to AI training, choose preset or custom VMs with 1–96 vCPUs and industry-leading 99.95% uptime SLA.

    Compute Engine delivers high-performance virtual machines for web apps, databases, containers, and AI workloads. Choose from general-purpose, compute-optimized, or GPU/TPU-accelerated machine types—or build custom VMs to match your exact specs. With live migration and automatic failover, your workloads stay online. New customers get $300 in free credits.
    Try Compute Engine
  • 10
    OCR Software developed by acem students as their minor project
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Optical Character Recognition (OCR) software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Eye is an experimental OCR (image-to-text) application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    JOcrad is a graphical frontend for GNU/Ocrad written in Java. GNU Ocrad is an OCR (Optical Character Recognition) program based on a feature extraction method.JOcrad supports italian and english languages, JPG,PNG and GIF images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Joshi is a program that tries to recognize shapes on an image with a focus on OCR. It converts the image to vector graphics (polylines) and then tries to project these on stored vector graphics, calculating the best match.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Neuroph OCR - Handwriting Recognition
    Neuroph OCR - Handwriting Recognition is developed to recognize hand written letter and characters. It's engine derived's from the Java Neural Network Framework - Neuroph and as such it can be used as a standalone project or a Neuroph plug in.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Java Swing application to modify PNG graphics ahead of OCR.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Note as of 2013-09-13: I'm moving this project over to github due to this: http://www.gluster.org/2013/08/how-far-the-once-mighty-sourceforge-has-fallen/ Feel free to rejoin the more updated versions on https://github.com/mnott/PDFOCRWrapper Thanks. Matthias -- This is a wrapper written in Java that allows to recursively iterate a directory structure and call an OCR engine on each found PDF on the condition that it hat not yet been called for that PDF. It works well with the ABBYY OCR Engine for Linux.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    SecureJDMS is the attempt to develop a secure DMS, based on an RCP and (even untrusted) DB for data storage. For now, its all about managing scanned documents and searching them by content (using OCR). All data sent and stored will be strongly encrypted.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Socr3 is a plugin-oriented, open source platform upon which I'm building an OCR suite. The name Socr3 stands for "Open Source Optical Character Recognition, Reading, Rendering, and Exporting", and is subject to change in the future.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    TCR Neuroph -Text Character Recognition
    TCR Neuroph - Text Character Recognition is java tool developed to recognize scanned text , using Java Neural Network Framework - Neuroph
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    cbrTekStraktor

    an application to automatically extract text from comic books.

    cbrTekStraktor is an application to automatically extract text from the text bubbles or speech balloons present in comic book reader files (CBR). Its prime goal is to perform analysis on the texts of comic books. cbrTekStraktor can however also be used for scanlation or similar purposes. The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. The text extraction is achieved by a combination of statistical and graphical processing operations. It is based on the following 3 major algorithms - Binarization of color images (Niblak and other methods) - Connected components - K-Means clustering Apache Tesseract is used to perform Optical Character Recognition on the extracted text. A subsequent version of the application will integrate with translation software in order to provide automated translation of comic book texts and re-inserion of translated texts
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB