Browse free open source Python Sound/Audio Software and projects below. Use the toggles on the left to filter open source Python Sound/Audio Software by OS, license, language, programming language, and project status.

  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    OpenDAFF

    OpenDAFF

    Directional Audio File Format

    OpenDAFF is a free, open-source software package for directional audio data - like the directivity of microphones, speakers, as well as head-related transfer functions (HRTFs)
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Voice keyboard/dictation. Aims to be a total substitute for a keyboard. Spell out words letter by letter (using code: alpha, bravo, ..). Arrow keys, modifiers work. Speak whole words (but whole word accuracy is not good). Attach commands to some word
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3

    pydatascope

    Software oscilloscope using Python and tkinter

    Software oscilloscope using Python and tkinter. Supports multiple sources: socket, file, audio, USB. Displays data by samples, time or frequency. Scales the input automatically or manually.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    A python-based calculator that uses the Sabine, Norris-Eyring and Fitzroy formulae to calculate the RT60 time of a room with given specifications. Material list from SAE Institute reference.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Failed Payment Recovery for Subscription Businesses Icon
    Failed Payment Recovery for Subscription Businesses

    For subscription companies searching for a failed payment recovery solution to grow revenue, and retain customers.

    FlexPay’s innovative platform uses multiple technologies to achieve the highest number of retained customers, resulting in reduced involuntary churn, longer life span after recovery, and higher revenue. Leading brands like LegalZoom, Hooked on Phonics, and ClinicSense trust FlexPay to recover failed payments, reduce churn, and increase customer lifetime value.
    Learn More
  • 5
    The Carolina Computer Assistive Technolog group at UNC-Chapel Hill focuses on the development of enabling technologies. We are currently maintaining the pyTTS package here, but our older work is still available for (unsupported) use.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    A.L.V.I. e' nato per essere un semplice ma modulare Bot, in grado di interagire con l'essere umano attraverso il linguaggio naturale ed eseguire svariati compiti, come leggere ad alta voce Mail, notizie, Feeds. Tutto in Italiano!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ASR-Builder provides an easy-to-use interface to the HTK toolkit, that allows users to build ASR systems. ASR-Builder provides a platform that performs house-keeping tasks when using HTK and also provides default training/testing/recognition scripts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    AGTK is a suite of software components for building tools for annotating linguistic signals, time-series data which documents any kind of linguistic behavior (e.g. audio, video). The internal data structures are based on annotation graphs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    AudioSavant is a music visualization software that is aimed to allow students to get a better understanding of music. It is based in the OLPC's TamTam application, with a set of modifications from the information visualization world.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Dun and Bradstreet Risk Analytics - Supplier Intelligence Icon
    Dun and Bradstreet Risk Analytics - Supplier Intelligence

    Use an AI-powered solution for supply and compliance teams who want to mitigate costly supplier risks intelligently.

    Risk, procurement, and compliance teams across the globe are under pressure to deal with geopolitical and business risks. Third-party risk exposure is impacted by rapidly scaling complexity in domestic and cross-border businesses, along with complicated and diverse regulations. It is extremely important for companies to proactively manage their third-party relationships. An AI-powered solution to mitigate and monitor counterparty risks on a continuous basis, this cutting-edge platform is powered by D&B’s Data Cloud with 520M+ Global Business Records and 2B+ yearly updates for third-party risk insights. With high-risk procurement alerts and multibillion match points, D&B Risk Analytics leverages best-in-class risk data to help drive informed decisions. Perform quick and comprehensive screening, using intelligent workflows. Receive ongoing alerts of key business indicators and disruptions.
    Learn More
  • 10
    The Blind Audio Tactile Mapping System (BATS) attempts to address the lack of spatial information available for visually impaired students.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DTMF detector library and/or application that reads in the specified audio file and returns/outputs whether DTMF detected and/or list of detected digits. May have GUI and command line interfaces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Ouroborus is an artificial life framework for mobile agents on a background of cellular automata. It can be used to teach and research topics such as population genetics, ecology and evolution. The demo includes a curses view and live Csound audio.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    STASS stands for Sound Treatment, Analysis and Synthesis Software. It will provide ingeneers and searchers easy-to-use as well as performant tools for sound analysis (spectro, sona and so on), treatment (filters, noise-gate) and synthesis (scepstrum).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Signal Ranger DSP Linux Support Tools
    This project delivers Linux support (USB driver and Tools) for a Digital Signal Processor (DSP) cards of the Signal Ranger series, see http://www.softdb.com/product_dsp.shtml for details) connected to a Linux host via USB.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Transana is a cross-platform tool for the transcription and qualitative analysis of audio and video data. It also provides the ability to identify and organize analytically interesting portions of videos, as well as attach keywords to those video clips.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Function graph drawing/rendering application for use with audio applications. Exports .wav files. Dependencies: wxPython & Numeric(not in 1.10)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    allTunes is an audible interface for the iTunes for Windows media player. This software is targetted mainly for use of by blind or visually-impaired users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Software to fit whole-sentence language models using the principle of maximum entropy. For developers of speech recognizers, text prediction interfaces, OCR, machine translation software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A collection of software made by Milos Rancic.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    pyespeak

    Python to eSpeak speech synthesis

    ctypes Python module for eSpeak http://espeak.sf.net speech synthesis
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    pyscope

    Software oscilloscope using Python and tkinter

    Software oscilloscope using Python and tkinter. Supports multiple sources: socket, file, audio, USB. Displays data by samples, time or frequency. Scales the input automatically or manually. It has been renamed "pdatascope" to avoid a name-clash with Pyscope, a scoping package on PyPi. See https://sourceforge.net/p/pydatascope/ for the latest code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    pysoundanalyser

    a python program to generate, visualize, and manipulate short sounds

    pysoundanalyser is a Python application that can be used to generate, visualize, and manipulate short sounds through a graphical user interface. Visualization functions include visualization of the power spectrum, the spectrogram, the autocorrelation, and the autocorrelogram of a sound. Manipulation functions include filtering, concatenating, cutting, and scaling the level of a sound. Several types of sounds can also be generated including, pure tones, harmonic complex tones, noise of different colours, frequency modulated and amplitude modulated tones.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next