Early beta — not for distribution

Crystal-clear voice.
Anywhere your audio goes.

One API request removes background noise, hum, traffic, keyboards, fans and crowd chatter from any voice stream. Plug it into your contact center, recording pipeline, transcription engine or voice AI — without rebuilding a thing.

Try the quickstart → See what it's for

Before Noisy audio

After One API request

Designed for teams that ship voice products

Real‑time

Live streaming & batch

Studio quality

Speech preserved, noise gone

Drop‑in

REST & WebSocket, any stack

Private by default

Never used for training

The problem

Background noise
silently breaks your product.

Every barking dog, traffic hum, keyboard click and crowd chatter that bleeds into your audio costs your team something: a frustrated customer, a misheard transcript, a refunded order, a missed sale. Most voice apps just live with it because building a real fix means months of audio research, GPUs, and edge cases.

Customers speak from anywhere

Cars, cafes, open-plan offices, factory floors. Your agents strain to hear and your transcripts miss critical details — through no fault of either.

Speech-to-text suffers

Transcription accuracy collapses on noisy phone audio. Your model isn't broken — the input is. Clean the audio first and downstream quality jumps with no model change.

Building this in-house is huge

Hiring audio ML engineers, training a model, deploying it on real-time infrastructure, then maintaining it forever. Ship customer value this afternoon instead.

Use cases

Built for any product that touches a microphone.

Drop the API into an existing voice flow without touching the rest of your stack. Streaming for live audio, batch for recordings — same API, same results.

Customer support & sales

Cut handle time. Agents stop saying "can you repeat that?". Recorded conversations become legible for QA. Higher CSAT, lower escalation rate.

Telemedicine

Doctors hear patient details the first time. Waiting-room chatter, traffic, breathing artifacts — gone. Cleaner recordings simplify clinical review.

Podcasts & UGC

Creators record from anywhere — bedrooms, hotel rooms, cafes. Your platform delivers broadcast-quality audio without asking them to buy a studio.

Conferencing & collab

Meetings sound like everyone's in the same quiet room. Dogs, doorbells, construction outside — suppressed in real time, no client install.

Voice AI & transcription

Drop in front of any speech-to-text or voice-agent stack. Cleaner input means dramatically higher accuracy on noisy real-world recordings.

Online learning

Students join from anywhere. Background TV, siblings, traffic — cleaned before the lecturer or the captioner has to deal with it.

What you get

Less friction.
Better customers.
Faster shipping.

Drop-in integration

Standard REST and WebSocket. No custom protocols, no proprietary clients, no audio engineering required. If you can curl an API, you can ship this.

Real-time, end‑to‑end

Stream 10-millisecond audio frames over WebSocket and get clean audio back in the same connection. Suitable for live conversations, voice agents, and interactive recording UIs.

Private by default

Audio is never used to train anything. Batch files are encrypted at rest and auto-deleted on a schedule you control. SOC 2 alignment in progress.

Pay only for clean seconds

Usage-based pricing on actual audio seconds processed — not seats, not connections, not minimums. Generous free tier for development and small teams.

How it works

Three steps. Two flavors. One bill.

You send audio. We clean it. You get audio back. Pick the mode that matches your product — live streaming for interactive flows, batch for files at rest. Both share the same authentication, the same pricing, and the same quality.

Send your audio

Stream raw audio frames over a secure WebSocket, or POST a file (WAV, MP3, FLAC, OGG) to a one-shot endpoint. Authenticated with a Bearer key.

Our engine cleans it

A purpose-built noise-cancellation engine isolates the human voice and suppresses everything else: hum, hiss, traffic, keyboards, fans, background chatter. Speech-aware, not just a filter.

You get clean audio back

Streaming: a clean frame for every frame you sent, in order. Batch: a URL to download the processed file when it's done. Same sample rate in, same out.

Streaming

Live audio, in and out, on a single WebSocket. Use it when latency matters: live conversations, voice agents, live captions.

✓ 10 ms PCM-16 frames in & out
✓ Bidirectional, single connection
✓ Per-tier concurrency budget

Batch

Upload a file, get a clean file back. Use it for recordings, voicemails, podcasts, archive processing.

✓ WAV, MP3, FLAC, OGG in
✓ Pre-signed direct upload
✓ Async with status polling
✓ Auto-delete on a schedule

Quickstart

Clean your first file in three steps.

You will need an API key. Request one here — we onboard beta partners individually.

Configure your key

Export your API key. Keep it server-side — never ship it in browser JavaScript or a mobile binary.

shell

# export your key once per shell
export NC_API_KEY="sk_your_api_key_here"
export NC_BASE_URL="https://api-noise-cancellation.us.tech"

Submit a job & upload audio

Ask the API for an upload URL. The response contains a pre-signed URL valid for 15 minutes — you upload directly to storage from anywhere with internet access.

# 1. Ask for an upload URL
RESP=$(curl -sS -X POST "$NC_BASE_URL/v1/enhance" \
  -H "authorization: Bearer $NC_API_KEY" \
  -H "content-type: application/json" \
  --data '{"content_type": "audio/wav"}')

JOB_ID=$(echo "$RESP" | jq -r .job_id)
UPLOAD_URL=$(echo "$RESP" | jq -r .upload_url)

# 2. Upload your audio directly to the pre-signed URL
curl -sS -X PUT "$UPLOAD_URL" \
  -H "content-type: audio/wav" \
  --data-binary @my-recording.wav

import os, requests

BASE = os.environ["NC_BASE_URL"]
KEY  = os.environ["NC_API_KEY"]
HEAD = {"authorization": f"Bearer {KEY}"}

# 1. Reserve a job & get an upload URL
r = requests.post(
    f"{BASE}/v1/enhance",
    headers={**HEAD, "content-type": "application/json"},
    json={"content_type": "audio/wav"},
    timeout=10,
)
r.raise_for_status()
job        = r.json()
job_id     = job["job_id"]
upload_url = job["upload_url"]

# 2. Upload your audio directly to storage
with open("my-recording.wav", "rb") as f:
    requests.put(
        upload_url,
        data=f,
        headers={"content-type": "audio/wav"},
        timeout=60,
    ).raise_for_status()

Start, poll, download

Kick the job, poll for completion, and download.

# 3. Start the job (queues for processing)
curl -sS -X POST "$NC_BASE_URL/v1/enhance/$JOB_ID/start" \
  -H "authorization: Bearer $NC_API_KEY"

# 4. Poll until done
while true; do
  STATE=$(curl -sS "$NC_BASE_URL/v1/jobs/$JOB_ID" \
    -H "authorization: Bearer $NC_API_KEY")
  STATUS=$(echo "$STATE" | jq -r .status)
  [ "$STATUS" = "completed" ] && break
  [ "$STATUS" = "failed" ]    && exit 1
  sleep 2
done

# 5. Download the cleaned audio
DL=$(echo "$STATE" | jq -r .download_url)
curl -sS -o cleaned.wav "$DL"

import time

# 3. Start the job
requests.post(
    f"{BASE}/v1/enhance/{job_id}/start",
    headers=HEAD, timeout=10,
).raise_for_status()

# 4. Poll until done
while True:
    state = requests.get(
        f"{BASE}/v1/jobs/{job_id}",
        headers=HEAD, timeout=10,
    ).json()
    if state["status"] in ("completed", "failed"):
        break
    time.sleep(2)

if state["status"] != "completed":
    raise RuntimeError(state.get("error_message"))

# 5. Download the cleaned audio
audio = requests.get(state["download_url"], timeout=60).content
with open("cleaned.wav", "wb") as f:
    f.write(audio)

That's the whole batch flow.

No queues to manage, no servers to provision, no audio-engineering background needed. Same shape works for short clips and long recordings alike, up to the per-file size limit.

Streaming

Live audio in & out, one WebSocket.

For live conversations and voice agents, open a WebSocket, send a one-line JSON config, wait for the server's ready message, then stream raw 10‑millisecond PCM-16 audio frames. You get a cleaned frame back for every frame you send, in order, on the same connection.

Endpoint

wss://api-noise-cancellation.us.tech/v1/stream

Audio format

Sample rate: 48 kHz
Format: PCM-16 little-endian
Channels: 1 (mono)
Frame size: 480 samples (10 ms)

Auth

Send your key as a header on the upgrade request, or append ?api_key=<key> for browsers.

stream.py

import asyncio, json, os, websockets

async def clean_stream(mic_frames):
    """Yield cleaned 10ms PCM-16 frames for each frame from `mic_frames`."""
    url     = "wss://api-noise-cancellation.us.tech/v1/stream"
    headers = {"authorization": f"Bearer {os.environ['NC_API_KEY']}"}

    async with websockets.connect(url, extra_headers=headers) as ws:
        # 1. Handshake: declare audio format, wait for `ready`.
        await ws.send(json.dumps({
            "type": "config", "sample_rate": 48000,
            "frame_ms": 10,    "format": "pcm16",
        }))
        ready = json.loads(await ws.recv())
        assert ready["type"] == "ready"   # server replies frame_bytes=960

        # 2. Stream: 960-byte PCM-16 frames in & out, in order.
        async def send_loop():
            async for frame in mic_frames:        # 960 bytes/frame
                await ws.send(frame)

        send_task = asyncio.create_task(send_loop())
        try:
            async for clean_frame in ws:          # clean PCM-16
                yield clean_frame
        finally:
            send_task.cancel()

Close codes

1000Normal close

1001Server going away (deploy)

1003Bad config or unsupported frame

1008Concurrency cap / idle timeout

1011Internal error (incl. backpressure)

4401Invalid or missing API key

1000/1001/1003/1008/1011 are the only forward-compatible codes you'll see; anything else from the underlying transport is normalized to 1011 so client code can stay simple.

API reference

Five endpoints. That's all of it.

Base URL: https://api-noise-cancellation.us.tech. All endpoints require Authorization: Bearer <key> except /healthz and /readyz.

POST /v1/enhance Reserve a job & get an upload URL

Request body

{
  "content_type": "audio/wav"
}

Accepted today: audio/wav (PCM-16, 8–48 kHz, mono or stereo — resampled to 48 kHz mono server-side). Additional container formats coming in a follow-up release.

Response (202)

{
  "job_id": "5aef129a-...",
  "upload_url": "https://...",
  "upload_expires_in_sec": 900,
  "content_type": "audio/wav"
}

POST /v1/enhance/{job_id}/start Queue the uploaded audio for processing

Path parameters

job_id — the id returned by /v1/enhance.

Errors

400 nothing uploaded yet
404 wrong owner / not found
409 already started / completed

Response (202)

{
  "job_id": "5aef129a-...",
  "status": "queued"
}

GET /v1/jobs/{job_id} Poll status & collect the download URL

Status values

pending_upload
queued
processing
completed
failed

Response (200)

{
  "job_id": "5aef129a-...",
  "status": "completed",
  "created_at": "2026-05-11T18:45:26Z",
  "completed_at": "2026-05-11T18:45:30Z",
  "download_url": "https://...",
  "download_expires_in_sec": 3600
}

WS /v1/stream Bidirectional real-time streaming

Each WebSocket message is a single 480-sample PCM-16 frame (960 bytes). Send and receive frames in any order; cleaned frames return in send order. See the streaming guide for example code.

GET /healthz · /readyz Liveness & readiness

No auth required. /readyz also reports backing-store health.

$ curl https://api-noise-cancellation.us.tech/readyz
{"status":"ok","service":"api-gateway","db":"ok","cache":"ok"}
# Degraded probes return 503 with status=degraded and the offending
# component reporting a non-`ok` state (`unreachable`, `not_initialized`).

Errors & limits

Predictable failure modes.

All errors return a JSON body shaped {"detail": "..."}. Authentication failures use HTTP 401 for REST and close code 4401 for WebSocket. 429 responses also carry a retry_after_sec field and a Retry-After response header (seconds, rounded up).

HTTP error codes

400Malformed request body

401Invalid / missing / revoked API key

404Job not found / not yours

409Job state conflict (already started / no upload)

413File exceeds 100 MB cap

422Field validation failed

429Rate limit (Retry-After header)

503Service temporarily unavailable

5xxServer error — retry with backoff

Rate limits by tier

Limit	Default	Pro
REST req/sec (sustained)	10	100
REST burst capacity	20	200
Concurrent streams / key	5	50
Max batch file size	100 MB
Artifact retention	30 days

REST limits use a token bucket per key: the sustained rate refills the bucket, the burst capacity absorbs short spikes. 100 MB ≈ 30 minutes of 48 kHz mono WAV. Higher tiers and custom envelopes are available on request.

FAQ

Common questions.

Will it work with my existing transcription / voice-AI stack?

Yes. We output the same sample rate, format and channel count we received, so we slot in front of any downstream speech-to-text, voice-agent, or recording pipeline without changing the rest of your stack.

Do you store my audio?

Streaming audio is never persisted — frames pass through and are dropped. Batch files are encrypted at rest, auto-deleted on a schedule you control (default 30 days), and never used to train any model. Your customer data stays yours.

How is this different from a normal high-pass / noise gate filter?

Traditional filters cut frequency bands — they remove some noise but also remove a lot of speech, and they fail completely on noise that overlaps with the voice (crowd chatter, keyboards, traffic). Our engine learned what human speech sounds like and isolates it, leaving voice intact while suppressing everything else, including non-stationary, voice-overlapping interference.

What's the latency?

Streaming is real-time: cleaned frames return on the same WebSocket connection as you send raw frames, designed to stay well under the latency threshold humans perceive as "delayed" in conversation. Batch jobs run as quickly as the audio allows.

How is pricing structured?

Usage-based, per second of audio processed. No seat fees, no per-connection fees, no minimums in beta. Volume discounts and committed-use pricing for high-volume workloads — talk to us.

Can I self-host?

On-premise and VPC-private deployments are available for enterprise customers with data-residency or HIPAA requirements. Reach out to discuss.

Which languages does it support?

It's language-agnostic: it isolates the human voice based on acoustic properties shared across languages, so it works on any spoken language without configuration. We have validated quality on English phone-codec audio and on a range of European languages.

Ready to ship cleaner audio?

Beta access is free for development and small production traffic. Tell us a little about what you're building and we'll get you a key.

Request a beta key → Read the quickstart

Crystal-clear voice. Anywhere your audio goes.

Background noisesilently breaks your product.

Built for any product that touches a microphone.

Customer support & sales

Telemedicine

Podcasts & UGC

Conferencing & collab

Voice AI & transcription

Online learning

Less friction.Better customers.Faster shipping.

Drop-in integration

Real-time, end‑to‑end

Private by default

Pay only for clean seconds

Three steps. Two flavors. One bill.

Send your audio

Our engine cleans it

You get clean audio back

Streaming

Batch

Clean your first file in three steps.

Configure your key

Submit a job & upload audio

Start, poll, download

Live audio in & out, one WebSocket.

Close codes

Five endpoints. That's all of it.

Predictable failure modes.

HTTP error codes

Rate limits by tier

Common questions.

Ready to ship cleaner audio?

Crystal-clear voice.
Anywhere your audio goes.

Background noise
silently breaks your product.

Less friction.
Better customers.
Faster shipping.