Open Source Java Text Processing Software

Java Text Processing Software

View 91 business solutions

Browse free open source Java Text Processing Software and projects below. Use the toggles on the left to filter open source Java Text Processing Software by OS, license, language, programming language, and project status.

  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
    Learn More
  • 1
    Command-line/Ant-task/embeddable text file preprocessor. Macros, flow control, expressions. Recursive directory processing. Extensible in Java to display data from any data sources (as database). Can generate complete homepages (tree of HTML-s, images, etc.)
    Leader badge
    Downloads: 83 This Week
    Last Update:
    See Project
  • 2
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest versions of iText build on the success of previous versions and feature an improved document engine, high and low-level programming capabilities, and a more efficient modular structure. iText represents the next level for developers looking to leverage PDF in document workflows. The main project page for iText is now on GitHub, and all the latest releases, code samples, open source add-ons and tools, etc. can be found at https://github.com/itext/.
    Leader badge
    Downloads: 182 This Week
    Last Update:
    See Project
  • 3

    ConcatPDF

    PDF Concatenation Tool

    ConcatPDF is the tool to concatenate PDF files. It can concatenate, extract, encrypt, decrypt, configure PDF files, convert image files to PDF. GUI version and CUI version are both available. iText.NET is iText porting on .NET Framework by J#. This library allows you to generate PDF, (X)HTML, XML, RTF files on Microsoft.NET Framework including ASP.NET.
    Downloads: 42 This Week
    Last Update:
    See Project
  • 4
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 24 This Week
    Last Update:
    See Project
  • Run applications fast and securely in a fully managed environment Icon
    Run applications fast and securely in a fully managed environment

    Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.

    Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
    Try for free
  • 5
    PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 6
    PDF Clown

    PDF Clown

    General-Purpose PDF Library for Java and .NET

    PDF Clown is a general-purpose Java and .NET library for manipulating PDF files through multiple abstraction layers, rigorously adhering to PDF 1.7 specification (ISO 32000-1). This project aims to provide a universal access to PDF files (creation, reading, editing, rendering...) through an accurate and elegant object-oriented API. * Features: http://pdfclown.org/overview/features/ * Overview: http://pdfclown.org/overview/architecture/ * Website: http://pdfclown.org/ * Blog: http://www.pdfclown.org/blog/ * Twitter: https://twitter.com/PDFClown
    Downloads: 13 This Week
    Last Update:
    See Project
  • 7
    FCKeditor

    FCKeditor

    FCKeditor (retired)

    FCKeditor is the previous version of CKEditor and has been discontinued after version 2. The new CKEditor is redesigned from the ground up, offering more WYSIWYG text editing features, enhanced security and better integration. Don’t force yourself with retro FCKeditor. Switch to the new, cool CKEditor at ckeditor.com
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    JODReports is a solution for generating dynamic documents and reports in Java based on the OpenDocument format (ODF). Templates can be easily composed with a word processor such as OpenOffice.org Writer. Data sources include POJOs and XML.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Early Access iText, a PDF generation library in Java
    Downloads: 8 This Week
    Last Update:
    See Project
  • Cloud-based help desk software with ServoDesk Icon
    Cloud-based help desk software with ServoDesk

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 10
    Web Book Downloader

    Web Book Downloader

    Download websites as e-book: pdf, txt, epub.

    This application allows user to download chapters from website in 3 ways: - from table of contents; - from range: first chapter address, last chapter address; - by crawling from first chapter to n; In settings you can customize language, input(website encoding) for simplicity output is in the same encoding. If you want your language add new class into strings package, and new fields into Settings class and GUI menu(initialize method).
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    Babeldoc is integration tool that can plumb together data flows. It is completely configurable and scriptable. It is heavily XML biased but not exclusively so.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Helium (He) is a lightweight and extremely useful templating engine based entirely on XML. He is 100% Java and 100% TDD (Test Driven Development).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    CPLed is an OpenSIPS tool for editing CPL scripts in a friendly and easy graphical way. It can be used as a standalone application or embedded in a web page as applet. It also provide CPL script transport functionalities via SIP and HTTP protocols.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A software tool to discover the names of people in electronic documents and HTML markup, note the use of the work 'discover' rather than search. Using this tool, the association bewteen names in documents can be inferred.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Chaperon is a LALR(1) parser, which parse structured text documents and generate XML documents as output. It includes a parser generator like yacc and a regex scaner like lex. As input use Chaperon a grammar written in XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    I have stopped developing this project, as I found DokuWiki to be exactly what I was looking for when starting to develop Codeslang. Sorry.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    (The Bliki engine Wikipedia API now moved to http://code.google.com/p/gwtwiki ). Eclipse plugin: Converts Wikipedia syntax to HTML. Features: syntax highlighting, content outline&assist, templates, HTML preview&insert, Java/PHP link2wiki, PDF creation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Prototype for a framework and user interface for combining various structured search and document clustering techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The simple converter form Fiction Book files (www.fictionbook.org) to Palm ZText pdb format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Fast Local File Search Using Lucene, HTMLParser and Highlighter Support Chinese now
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The most powerful non-commercial translation memory software (TM tool) with enhanced capabilities, like networking/collaboration (http, rpc), encoding conversion, project management capabilities, email capability with attachments, file tree diff etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A basic html editor that I have been developing. In first stages of development. I hope to make this a very easy-to-use text/html editor and I have begun adding insert functions for common tags. I plan to add insert function for common Html Tags.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Integradata is a plugable, rules-based, declarative data validation system written in Java
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    JCurricula is a web-application designed to build a professional and good-looking Curriculum Vitae. jCurricula will create a cv in PDF format, ready to be printed or viewed. JCurricula is a java servlet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next