Home

Welcome to the ai-coustics real-time speech enhancement SDK for Python.

This package provides Python bindings and binaries for high-quality, low-latency neural audio enhancement.

Highlights:

  • Real-time processing optimized for streaming
  • Multiple model sizes: QUAIL_L, QUAIL_S, QUAIL_XS, QUAIL_XXS
  • STT-optimized models: QUAIL_STT_L16, QUAIL_STT_L8, QUAIL_STT_S16, QUAIL_STT_S8, QUAIL_VF_STT_L16
  • Simple, Pythonic API with context-manager support
  • Built-in Voice Activity Detection (VAD) powered by the Quail model family
  • Multiple processing layouts: planar, interleaved, and sequential

Quick example:

import os
import numpy as np
from dotenv import load_dotenv
from aic import Model, AICModelType, AICParameter

load_dotenv()
license_key = os.getenv("AIC_SDK_LICENSE", "")

with Model(AICModelType.QUAIL_L, license_key=license_key, sample_rate=48000, channels=1, frames=480) as model:
    model.set_parameter(AICParameter.ENHANCEMENT_LEVEL, 0.8)

    audio = np.random.randn(1, 480).astype(np.float32)
    enhanced = model.process(audio)

Use the navigation to learn how to get started and explore the full API.