Showing 265 open source projects for "audio convert"

View related business solutions
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • Cut Data Warehouse Costs up to 54% with BigQuery Icon
    Cut Data Warehouse Costs up to 54% with BigQuery

    Migrate from Snowflake, Databricks, or Redshift with free migration tools. Exabyte scale without the Exabyte price.

    BigQuery delivers up to 54% lower TCO than cloud alternatives. Migrate from legacy or competing warehouses using free BigQuery Migration Service with automated SQL translation. Get serverless scale with no infrastructure to manage, compressed storage, and flexible pricing—pay per query or commit for deeper discounts. New customers get $300 in free credit.
    Try BigQuery Free
  • 1
    HandBrake

    HandBrake

    A open source video to convert video from any format to modern codecs

    HandBrake is an open-source, GPL-licensed, multiplatform, multithreaded video transcoder, available for MacOS X, Linux and Windows.
    Downloads: 233 This Week
    Last Update:
    See Project
  • 2
    OpenAI.fm

    OpenAI.fm

    Code for openai.fm, a demo for the OpenAI Speech API

    OpenAI.fm is an official interactive demo application built to showcase the OpenAI Speech API and its advanced text-to-speech capabilities, providing developers and creators with a hands-on web interface to convert text into high-quality, customizable audio using state-of-the-art TTS models. Developed using Next.js and the OpenAI Speech API, this demo illustrates how the latest neural voice models can produce natural, expressive speech with adjustable styles and voices, highlighting features like emotional range, tone, and real-time playback. ...
    Downloads: 246 This Week
    Last Update:
    See Project
  • 3
    File Converter

    File Converter

    Simple tool which allows you to convert and compress files

    File Converter is a minimalist open‑source tool (GPL‑3.0) that lets users convert and compress one or multiple files directly via the Windows Explorer context menu. It integrates with powerful back-end utilities—FFmpeg, ImageMagick, Ghostscript—to handle a broad range of media and document transformations. File Converter is a personal open source project started in 2014. I have put hundreds of hours adding, refining and tuning File Converter with the goal of making the conversion and...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 4
    idonthavespotify

    idonthavespotify

    Effortlessly convert Spotify links to your preferred streaming service

    Copy a link from your favorite streaming service, paste it into the search bar, and voilà! Links to the track on all other supported platforms are displayed. If the original source is Spotify you'll even get a quick audio preview to ensure it's the right track.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • 5
    abogen

    abogen

    Generate audiobooks from EPUBs, PDFs and text with captions

    abogen is a tool designed to generate audiobooks (or speech narrations) from textual sources such as EPUBs, PDFs, or plain text, with synchronized captions. In other words, it automates the pipeline of reading a digital book (or document), converting its text into speech via a TTS engine, and packaging the result into an audiobook format — likely along with timestamped captions or subtitles that align with the spoken audio. This can be very useful for accessibility, content consumption on...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 6
    ebook2audiobook

    ebook2audiobook

    Generate audiobooks from e-books, voice cloning & 1107+ languages

    ebook2audiobook is a tool to convert legally obtained eBooks (non-DRM) into fully narrated audiobooks, complete with chapters and metadata. It automates the pipeline: it reads the eBook file, splits it into appropriate segments (chapters, paragraphs), uses text-to-speech (TTS) models to synthesize audio, optionally applies voice cloning, and outputs a final audiobook — ideal for people who prefer listening over reading, or for accessibility purposes.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 7
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and responsive speech generation without noticeable delay. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    A2M — Audio to MIDI

    A2M — Audio to MIDI

    A2M is a desktop app that converts AUDIO TO MIDI in one click.

    A2M (Audio To MIDI) is a simple desktop tool for transcribing local audio files into MIDI files with one click. It is designed primarily for piano recording transcription, and works best on solo piano recordings. Using A2M is straightforward: Select an audio file, click Convert, and the application generates a MIDI file automatically in your Downloads/A2M folder.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 9
    OpenAudible

    OpenAudible

    Audiobook Manager for Audible Users

    OpenAudible is a cross-platform audiobook manager designed for Audible users. Manage/Download all your audiobooks with this easy-to-use desktop application. Say goodbye to the hassle of managing your audiobooks across multiple devices. With OpenAudible, you can easily download, view, and manage all your Audible books in one place. Our lightning-fast conversion to MP3 and M4B audio formats makes it easy to enjoy your favorite books on any device. Plus, our automation features make it a breeze...
    Downloads: 10 This Week
    Last Update:
    See Project
  • Deploy Apps in Seconds with Cloud Run Icon
    Deploy Apps in Seconds with Cloud Run

    Host and run your applications without the need to manage infrastructure. Scales up from and down to zero automatically.

    Cloud Run is the fastest way to deploy containerized apps. Push your code in Go, Python, Node.js, Java, or any language and Cloud Run builds and deploys it automatically. Get fast autoscaling, pay only when your code runs, and skip the infrastructure headaches. Two million requests free per month. And new customers get $300 in free credit.
    Try Cloud Run Free
  • 10
    Qwen3-ASR

    Qwen3-ASR

    Qwen3-ASR is an open-source series of ASR models

    Qwen3-ASR is an automatic speech recognition system in the QwenLM family, developed to convert spoken language into text with strong accuracy and real-time performance. As a specialized ASR variant of the broader Qwen language model ecosystem, it focuses on capturing reliable transcriptions from audio sources such as recordings, live streams, or conversational inputs while supporting low latency use cases. The architecture combines advanced neural acoustic modeling with context-aware language prediction so that outputs maintain both fidelity to the original speech and grammatical coherence. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 11
    EPUB to Audiobook Converter

    EPUB to Audiobook Converter

    EPUB to audiobook converter, optimized for Audiobookshelf

    EPUB to Audiobook Converter is a tool designed to convert EPUB ebooks into chaptered audiobooks, optimized specifically for Audiobookshelf servers. It reads each chapter from an EPUB file, generates audio using a chosen text-to-speech backend, and outputs separate MP3 files with chapter titles preserved as metadata to make navigation easier. The project supports multiple TTS providers, including Microsoft Azure TTS, EdgeTTS, OpenAI TTS, local Piper, and Kokoro via an OpenAI-compatible endpoint, allowing users to choose between cloud and self-hosted voices. ...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 12
    ffmpeg.wasm

    ffmpeg.wasm

    FFmpeg for browser, powered by WebAssembly

    ffmpeg.wasm is a pure WebAssembly (and JavaScript/TypeScript) port of FFmpeg that enables in-browser media recording, conversion, and streaming—letting developers perform video/audio processing entirely client-side without server uploads. Transpiled via Emscripten from FFmpeg and its codecs into WebAssembly. Supports both single-threaded and multi-threaded cores using web workers. Written in TypeScript for improved developer experience.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 13
    SoX is the Swiss Army Knife of sound processing utilities. It can convert audio files to other popular audio file types and also apply sound effects and filters during the conversion.
    Leader badge
    Downloads: 19,115 This Week
    Last Update:
    See Project
  • 14
    wa-automate-nodejs

    wa-automate-nodejs

    WhatsApp tool for chatbots with advanced features

    wa-automate-nodejs is the most advanced NodeJS library which provides a high-level API to control WA. Want to convert your WA account to an API instantly? You can now with the CLI. For more details see Easy API. After executing create() function, @open-wa/wa-automate will create an instance of WA web. If you are not logged in, it will print a QR code in the terminal. Scan it with your phone and you are ready to go! @open-wa/wa-automate will remember the session so there is no need to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Audiblez

    Audiblez

    Generate audiobooks from e-books

    Audiblez is a tool for generating high-quality .m4b audiobooks directly from .epub e-books using the Kokoro-82M neural text-to-speech model. It focuses on making audiobook creation easy and fast: from a single command, the tool splits an e-book into chapters, synthesizes audio for each section, and then merges the results into a structured audiobook with chapter-based WAV files and a final .m4b container. The Kokoro-82M model it uses is compact (82M parameters) yet natural sounding, trained...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 16
    clone-voice

    clone-voice

    A sound cloning tool with a web interface, using your voice

    ...The tool supports around sixteen languages, including Chinese, English, Japanese, Korean, French, German, Italian, and others, and can capture reference voices directly from a microphone or from uploaded audio.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 17
    XLD

    XLD

    A tool for transcoding lossless audio files

    X Lossless Decoder(XLD) is a tool for Mac OS X that is able to decode/convert/play various 'lossless' audio files. The supported audio files can be split into some tracks with cue sheet when decoding. It works on Mac OS X 10.4 and later.
    Leader badge
    Downloads: 4,428 This Week
    Last Update:
    See Project
  • 18
    yt-dlp

    yt-dlp

    A youtube-dl fork with additional features and fixes

    yt-dlp is a youtube-dl fork based on the now inactive youtube-dlc. The main focus of this project is adding new features and patches while also keeping up to date with the original project
    Downloads: 378 This Week
    Last Update:
    See Project
  • 19
    Kid3 Tag Editor

    Kid3 Tag Editor

    Audio Tag Editor

    Kid3 audio tag editor can edit the tags of MP3, Ogg, FLAC, MPC & WMA files in an efficient way, convert between ID3v1 and ID3v2, set the tags of multiple files, generate tags from file names or vice versa and import from freedb, MusicBrainz and Disco
    Leader badge
    Downloads: 2,075 This Week
    Last Update:
    See Project
  • 20
    VidCoder

    VidCoder

    A Blu-ray, DVD and video file transcoder for Windows

    VidCoder is a Windows-based open-source video transcoding and ripping tool that provides a graphical interface built around standard command-line multimedia tools. It lets users convert video files (or rip DVDs/Blu-rays, when supported) into modern formats and codecs, making it useful for people who want to compress, re-encode, or transcode video content without dealing directly with low-level encoder settings. Because VidCoder integrates and automates the invocation of complex backend...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Krajee

    Krajee

    An enhanced HTML 5 file input for Bootstrap 5.x/4.x./3.x

    An enhanced HTML 5 file input for Bootstrap 5.x or Bootstrap 4.x or Bootstrap 3.x with file preview for various files, offers multiple selection, and more. The plugin allows you a simple way to setup an advanced file picker/upload control built to work specially with Bootstrap CSS3 styles. It enhances the file input functionality further, by offering support to preview a wide variety of files i.e. images, text, html, video, audio, flash, and objects. In addition, it includes AJAX based...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Audacity

    Audacity

    A free multi-track audio editor and recorder

    Audacity is a free, easy-to-use, multi-track audio editor and recorder for Windows, Mac OS X, GNU/Linux and other operating systems. Audacity is free software, developed by a group of volunteers and distributed under the GNU General Public License (GPL). Programs like Audacity are also called open source software, because their source code is available for anyone to study or use. There are thousands of other free and open source programs, including the Firefox web browser, the...
    Leader badge
    Downloads: 3,638 This Week
    Last Update:
    See Project
  • 23
    Bulk Image and Audio Merger

    Bulk Image and Audio Merger

    Easily pair images with audio file counterparts in bulk

    Automatically convert your audio files into YouTube-ready videos by pairing them with static images! Perfect for turning podcasts, audio episodes, or music tracks into video format in bulk. Simply match your audio files with corresponding images, and let the program handle the rest. Ideal for content creators who need to process multiple audio files without the hassle of manual video editing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    VCT - Video Converter & Transcoder

    VCT - Video Converter & Transcoder

    Transcode or convert files to H.264 or H.265 HEVC MKV/MP4 - FFmpeg GUI

    Open source video/audio converter GUI based on FFmpeg (included) Works natively for Windows and Linux (with mono-complete installed) Batch convert audio/video collection to H.264 or H.265 HEVC AAC/MP3, etc. Transcode MKV to MP4 or MP4/M4V to MKV in 1 minute. Rotate video Resize video Extract audio from movie clip Extract specific part of video/audio file Convert various audio files to MP3 or AAC Select audio stream to encode with video Add subtitle stream Easily create conversion tasks Pause/resume encoding queue Play files Record AV streams from Internet (Record tab) FFmpeg command is exposed so FFmpeg command is fully editable 64-bit version is now default. 32 bit and XP version as separate downloads For more details on usage, please check README Short video demonstration: https://user-images.githubusercontent.com/10812743/103661971-a568b900-4f6f-11eb-8c11-ac92ea41944b.mp4 NOTE: Windows will warn you about security. ...
    Leader badge
    Downloads: 69 This Week
    Last Update:
    See Project
  • 25
    MP3 Rocket

    MP3 Rocket

    YouTube to mp3 converter and downloader

    ...It's fast, free, and no registration is required. MP3 Rocket is a simple and to use YouTube to MP3 converter and video downloader. MP3 Rocket converts video to multiple audio or video formats and provides easy play back on computers or mobile devices. MP3 Rocket also allows users to turn any video into a MP3 ringtone, without sound quality loss. MP3 Rocket is more than a video downloader, it allows users to download HD videos and convert them to any common video format. The program is easy to use. ...
    Downloads: 129 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.