Extract One Speaker from a Recording

Isolate the main speaker from everything else — other voices, music and noise — to get a focused single-voice track from a busy recording.

Pro Premium AI tool — included with any paid plan.

🎧

Drop an audio or video file here

MP3, WAV, M4A, FLAC, OGG, AAC

How it works

The separation engine extracts the foreground voice and pushes background talkers, music and ambience down, leaving the target speaker as the clear centre of the recording.

What it's good for

Pulling one host from a noisy room
Focusing a presenter over a crowd
Isolating dialogue from ambience
Prepping a clean voice for cloning

Details

Engine: Demucs
Formats: MP3, WAV, M4A, FLAC, OGG, AAC
Price: Paid plans

Frequently asked questions

Denoising removes non-speech noise but leaves other voices and music. Target-speaker extraction also removes competing voices and music, keeping only the main speaker.

Not today — it extracts the dominant foreground voice automatically. Reference-guided extraction is planned for a future update.

They're strongly reduced. A background voice as loud as the target is the hardest case and may leave faint traces.

Speaker separation untangles two people talking over each other on an otherwise clean track, while this tool isolates one voice from a full mix of other talkers, music and ambient noise.

It produces a focused single-voice track that works well as cloning input, though a short, naturally clean recording will always beat a heavily processed extraction.

Yes, music and ambience are pushed down along with competing talkers, so the target speaker is left as the clear foreground of the recording.

Most clips finish within a minute, with processing time tied to the length of the recording rather than how crowded the background is.

Related tools

Vocal Isolation (Extract Voice)

Pull a clean vocal out of a full mix, separating the singer or speaker …

Vocal Remover (Karaoke)

Strip the lead vocal out of a song to make an instrumental or karaoke …

Stem Separation (Drums/Bass/Vocals)

Break a finished song back into its parts — vocals, drums, bass and other …

Remove Background Music from Speech

When a voiceover, interview or clip has music underneath, this tool removes the music …

Speaker Separation

When two people talk over each other on one track, this tool pulls the …

Background Noise Removal

Strip steady and shifting background noise — air conditioning, fans, street hum, room tone …

Extract One Speaker from a Recording

How it works

What it's good for

Details

Frequently asked questions

How does this differ from background-noise removal?

Do I need to provide a reference of the speaker?

Will quiet background voices fully disappear?

How does this differ from speaker separation?

Is the extracted voice clean enough for voice cloning?

Does it remove background music as well as voices?

How long does extraction take?