Showing 131 open source projects for "audio convert"

View related business solutions
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • $300 in Free Credit for Your Google Cloud Projects Icon
    $300 in Free Credit for Your Google Cloud Projects

    Build, test, and explore on Google Cloud with $300 in free credit. No hidden charges. No surprise bills.

    Launch your next project with $300 in free Google Cloud credit—no hidden charges. Test, build, and deploy without risk. Use your credit across the Google Cloud platform to find what works best for your needs. After your credits are used, continue building with free monthly usage products. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    HandBrake

    HandBrake

    A open source video to convert video from any format to modern codecs

    HandBrake is an open-source, GPL-licensed, multiplatform, multithreaded video transcoder, available for MacOS X, Linux and Windows.
    Downloads: 233 This Week
    Last Update:
    See Project
  • 2
    OpenAI.fm

    OpenAI.fm

    Code for openai.fm, a demo for the OpenAI Speech API

    OpenAI.fm is an official interactive demo application built to showcase the OpenAI Speech API and its advanced text-to-speech capabilities, providing developers and creators with a hands-on web interface to convert text into high-quality, customizable audio using state-of-the-art TTS models. Developed using Next.js and the OpenAI Speech API, this demo illustrates how the latest neural voice models can produce natural, expressive speech with adjustable styles and voices, highlighting features like emotional range, tone, and real-time playback. ...
    Downloads: 246 This Week
    Last Update:
    See Project
  • 3
    idonthavespotify

    idonthavespotify

    Effortlessly convert Spotify links to your preferred streaming service

    Copy a link from your favorite streaming service, paste it into the search bar, and voilà! Links to the track on all other supported platforms are displayed. If the original source is Spotify you'll even get a quick audio preview to ensure it's the right track.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    abogen

    abogen

    Generate audiobooks from EPUBs, PDFs and text with captions

    abogen is a tool designed to generate audiobooks (or speech narrations) from textual sources such as EPUBs, PDFs, or plain text, with synchronized captions. In other words, it automates the pipeline of reading a digital book (or document), converting its text into speech via a TTS engine, and packaging the result into an audiobook format — likely along with timestamped captions or subtitles that align with the spoken audio. This can be very useful for accessibility, content consumption on...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 99.99% Uptime for MySQL and PostgreSQL on Google Cloud Icon
    99.99% Uptime for MySQL and PostgreSQL on Google Cloud

    Enterprise Plus edition delivers sub-second maintenance downtime and 2x read/write performance. Built for critical apps.

    Cloud SQL Enterprise Plus gives you a 99.99% availability SLA with near-zero downtime maintenance—typically under 10 seconds. Get 2x better read/write performance, intelligent data caching, and 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server with built-in vector search for gen AI apps. New customers get $300 in free credit.
    Try Cloud SQL Free
  • 5
    ebook2audiobook

    ebook2audiobook

    Generate audiobooks from e-books, voice cloning & 1107+ languages

    ebook2audiobook is a tool to convert legally obtained eBooks (non-DRM) into fully narrated audiobooks, complete with chapters and metadata. It automates the pipeline: it reads the eBook file, splits it into appropriate segments (chapters, paragraphs), uses text-to-speech (TTS) models to synthesize audio, optionally applies voice cloning, and outputs a final audiobook — ideal for people who prefer listening over reading, or for accessibility purposes.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 6
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and responsive speech generation without noticeable delay. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    OpenAudible

    OpenAudible

    Audiobook Manager for Audible Users

    OpenAudible is a cross-platform audiobook manager designed for Audible users. Manage/Download all your audiobooks with this easy-to-use desktop application. Say goodbye to the hassle of managing your audiobooks across multiple devices. With OpenAudible, you can easily download, view, and manage all your Audible books in one place. Our lightning-fast conversion to MP3 and M4B audio formats makes it easy to enjoy your favorite books on any device. Plus, our automation features make it a breeze...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8
    Qwen3-ASR

    Qwen3-ASR

    Qwen3-ASR is an open-source series of ASR models

    Qwen3-ASR is an automatic speech recognition system in the QwenLM family, developed to convert spoken language into text with strong accuracy and real-time performance. As a specialized ASR variant of the broader Qwen language model ecosystem, it focuses on capturing reliable transcriptions from audio sources such as recordings, live streams, or conversational inputs while supporting low latency use cases. The architecture combines advanced neural acoustic modeling with context-aware language prediction so that outputs maintain both fidelity to the original speech and grammatical coherence. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 9
    ffmpeg.wasm

    ffmpeg.wasm

    FFmpeg for browser, powered by WebAssembly

    ffmpeg.wasm is a pure WebAssembly (and JavaScript/TypeScript) port of FFmpeg that enables in-browser media recording, conversion, and streaming—letting developers perform video/audio processing entirely client-side without server uploads. Transpiled via Emscripten from FFmpeg and its codecs into WebAssembly. Supports both single-threaded and multi-threaded cores using web workers. Written in TypeScript for improved developer experience.
    Downloads: 12 This Week
    Last Update:
    See Project
  • Cut Data Warehouse Costs up to 54% with BigQuery Icon
    Cut Data Warehouse Costs up to 54% with BigQuery

    Migrate from Snowflake, Databricks, or Redshift with free migration tools. Exabyte scale without the Exabyte price.

    BigQuery delivers up to 54% lower TCO than cloud alternatives. Migrate from legacy or competing warehouses using free BigQuery Migration Service with automated SQL translation. Get serverless scale with no infrastructure to manage, compressed storage, and flexible pricing—pay per query or commit for deeper discounts. New customers get $300 in free credit.
    Try BigQuery Free
  • 10
    SoX is the Swiss Army Knife of sound processing utilities. It can convert audio files to other popular audio file types and also apply sound effects and filters during the conversion.
    Leader badge
    Downloads: 19,115 This Week
    Last Update:
    See Project
  • 11
    wa-automate-nodejs

    wa-automate-nodejs

    WhatsApp tool for chatbots with advanced features

    wa-automate-nodejs is the most advanced NodeJS library which provides a high-level API to control WA. Want to convert your WA account to an API instantly? You can now with the CLI. For more details see Easy API. After executing create() function, @open-wa/wa-automate will create an instance of WA web. If you are not logged in, it will print a QR code in the terminal. Scan it with your phone and you are ready to go! @open-wa/wa-automate will remember the session so there is no need to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Audiblez

    Audiblez

    Generate audiobooks from e-books

    Audiblez is a tool for generating high-quality .m4b audiobooks directly from .epub e-books using the Kokoro-82M neural text-to-speech model. It focuses on making audiobook creation easy and fast: from a single command, the tool splits an e-book into chapters, synthesizes audio for each section, and then merges the results into a structured audiobook with chapter-based WAV files and a final .m4b container. The Kokoro-82M model it uses is compact (82M parameters) yet natural sounding, trained...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 13
    clone-voice

    clone-voice

    A sound cloning tool with a web interface, using your voice

    ...The tool supports around sixteen languages, including Chinese, English, Japanese, Korean, French, German, Italian, and others, and can capture reference voices directly from a microphone or from uploaded audio.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 14
    XLD

    XLD

    A tool for transcoding lossless audio files

    X Lossless Decoder(XLD) is a tool for Mac OS X that is able to decode/convert/play various 'lossless' audio files. The supported audio files can be split into some tracks with cue sheet when decoding. It works on Mac OS X 10.4 and later.
    Leader badge
    Downloads: 4,428 This Week
    Last Update:
    See Project
  • 15
    yt-dlp

    yt-dlp

    A youtube-dl fork with additional features and fixes

    yt-dlp is a youtube-dl fork based on the now inactive youtube-dlc. The main focus of this project is adding new features and patches while also keeping up to date with the original project
    Downloads: 378 This Week
    Last Update:
    See Project
  • 16
    Kid3 Tag Editor

    Kid3 Tag Editor

    Audio Tag Editor

    Kid3 audio tag editor can edit the tags of MP3, Ogg, FLAC, MPC & WMA files in an efficient way, convert between ID3v1 and ID3v2, set the tags of multiple files, generate tags from file names or vice versa and import from freedb, MusicBrainz and Disco
    Leader badge
    Downloads: 2,075 This Week
    Last Update:
    See Project
  • 17
    Burn

    Burn

    Simple but advanced disc burning for macOS

    Create data discs with advanced data settings. Create video and audio discs, Burn will convert if needed. Create DVD-Video discs. Recreate discs. Burn doesn't reinvent the wheel, it uses many powerful open source Unix utilities and is also open source.
    Leader badge
    Downloads: 2,376 This Week
    Last Update:
    See Project
  • 18
    Krajee

    Krajee

    An enhanced HTML 5 file input for Bootstrap 5.x/4.x./3.x

    An enhanced HTML 5 file input for Bootstrap 5.x or Bootstrap 4.x or Bootstrap 3.x with file preview for various files, offers multiple selection, and more. The plugin allows you a simple way to setup an advanced file picker/upload control built to work specially with Bootstrap CSS3 styles. It enhances the file input functionality further, by offering support to preview a wide variety of files i.e. images, text, html, video, audio, flash, and objects. In addition, it includes AJAX based...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Audacity

    Audacity

    A free multi-track audio editor and recorder

    Audacity is a free, easy-to-use, multi-track audio editor and recorder for Windows, Mac OS X, GNU/Linux and other operating systems. Audacity is free software, developed by a group of volunteers and distributed under the GNU General Public License (GPL). Programs like Audacity are also called open source software, because their source code is available for anyone to study or use. There are thousands of other free and open source programs, including the Firefox web browser, the...
    Leader badge
    Downloads: 3,638 This Week
    Last Update:
    See Project
  • 20
    MP3 Rocket

    MP3 Rocket

    YouTube to mp3 converter and downloader

    ...It's fast, free, and no registration is required. MP3 Rocket is a simple and to use YouTube to MP3 converter and video downloader. MP3 Rocket converts video to multiple audio or video formats and provides easy play back on computers or mobile devices. MP3 Rocket also allows users to turn any video into a MP3 ringtone, without sound quality loss. MP3 Rocket is more than a video downloader, it allows users to download HD videos and convert them to any common video format. The program is easy to use. ...
    Downloads: 129 This Week
    Last Update:
    See Project
  • 21
    EasyABC

    EasyABC

    EasyABC is an open source ABC editor

    EasyABC allows the user to create, edit, view, play, convert music written in the ABC music notation language. The program was originally written in Python 2.7 and WxPython by Nils Liberg and runs on Windows, OSX, and Linux. Jan Wybren de Jong has converted to run on Python 3.8 or higher. Frédéric Aupépin has been supporting EasyABC on OSX. EasyABC depends upon other external programs like abc2midi, abcm2ps, fluidsynth. If you install the Windows or Mac executables most of these programs...
    Leader badge
    Downloads: 276 This Week
    Last Update:
    See Project
  • 22
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3,...
    Leader badge
    Downloads: 105 This Week
    Last Update:
    See Project
  • 23
    4allDownloader Converter

    4allDownloader Converter

    Video,audio&Files Downloader&Convert with built-in browser with AI.

    The most advanced downloader & converter tool that combines the power of leading open-source technologies under an intuitive GUI. Download videos, Audio, from 10000+ of sites, and also files from any platform with advanced format conversion capabilities. Features a built-in browser with JavaScript injection support that remembers logins while maintaining complete privacy. Five powerful tabs streamline your workflow: Home tab for pasting URLs, channels, playlists, and direct search; Browser...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 24
    IMYplay

    IMYplay

    Plays iMelody (IMY) files using many sound systems

    IMYplay is a player for iMelody (IMY) ringtones / files (https://en.wikipedia.org/wiki/IMelody). IMYplay can use the following outputs to produce sound: 1. The Allegro game library (http://alleg.sf.net), 2. SDL (Simple DirectMedia Layer; http://www.libsdl.org), 3. ALSA (Advanced Linux Sound Architecture; http://alsa-project.org), 4. OSS (Open Sound System), 5. Libao (http://xiph.org/ao), 6. PortAudiov19 (http://www.portaudio.com), 7. PulseAudio (http://www.pulseaudio.org), 8....
    Downloads: 10 This Week
    Last Update:
    See Project
  • 25
    AudiooPy

    AudiooPy

    Audio manager in Python Object-Oriented Programming

    AudiooPy stands for "Audio Manager in Python Object-Oriented Programming." This library provides a range of useful operations for sound files and audio fragments. It processes audio at the frame level, working with signed integer samples of 8, 16, or 32 bits, stored in byte-like objects. Key features include: - Reading and writing WAV files using Python's standard library. - A scientifically validated method for automatically detecting sound segments in speech. -...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.