Skip to content
View mattbernst's full-sized avatar

Block or report mattbernst

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
11 stars written in Python
Clear filter

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 70,234 9,785 Updated Feb 4, 2026

Painless relocation of Linux binaries–and all of their dependencies–without containers.

Python 3,006 73 Updated Nov 5, 2023

SNIPER / AutoFocus is an efficient multi-scale object detection training / inference algorithm

Python 2,692 441 Updated Aug 22, 2021

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Python 2,253 371 Updated Jun 24, 2022

Python module for quantum chemistry

Python 1,518 664 Updated Feb 5, 2026

Tinfoil Chat - Onion-routed, endpoint secure messaging system

Python 1,305 88 Updated Jun 16, 2025

Tag manager and captioner for image datasets

Python 1,259 65 Updated Oct 11, 2025

LLM-based text extraction from unstructured data like PDFs, Words and HTMLs. Transform and cluster the text into your desired format. Less information loss, more interpretation, and faster R&D!

Python 233 61 Updated Sep 24, 2025

Automated, smooth, N'th order derivatives of non-uniformly sampled time series data

Python 230 8 Updated Oct 20, 2024

Your files ready for Gen AI ✨🚀 AlcheMark is a lightweight PDF to Markdown, alchemical-inspired toolkit that transmutes PDF documents into structured Markdown pages—complete with rich metadata and n…

Python 77 7 Updated Apr 28, 2025

Generates Fortran, C, and Python header files containing CODATA 2014 physical constants

Python 1 Updated Jan 21, 2018