VocalsVocal Processing After Stem Extraction: Getting a Clean, Usable Sound
AI-extracted vocals need specific processing to sound their best. Here's a signal chain that works for most stem separation outputs.
An extracted vocal stem is not a finished vocal — it's a starting point. Even a high-quality extraction from Demucs will have characteristics that need addressing before the vocal sits comfortably in a new production. Here's a workflow that addresses the most common issues.
Understanding What You're Working With
The extracted vocal will have:
- Residual instrument bleed — ghost harmonics of the original backing track
- Potential phase artifacts from the separation process
- The exact room reverb and processing the original producer applied
- All the mic characteristics and recording quality of the original session
The last point is often a feature, not a bug — vintage vocal tones from classic recordings have a character that no modern plugin can convincingly replicate.
The Processing Chain
1. High-Pass Filter
First in the chain: a high-pass filter at 80–120Hz. Removing the low end eliminates bass bleed and clears space for your new bass elements without frequency masking.
2. Noise Gate
Set a noise gate with a medium attack and slow release to reduce the "ghost" of the backing track that appears between vocal phrases. Threshold around -40 to -50dB usually captures the gap between phrases without clipping the natural reverb tail of the voice.
3. De-Esser
Sibilance is often exaggerated during AI extraction because the frequency band (6–10kHz) where "s" and "sh" sounds live is where the AI sometimes introduces artifacts. A de-esser set at around 7kHz brings this under control.
4. EQ — Corrective Pass
Listen for any nasal buildup around 500–800Hz from bleed artifacts, and any harshness in the 2–4kHz range introduced by the extraction process. Use surgical narrow cuts (Q of 4–6) rather than broad strokes.
5. Compression
A gentle 4:1 compressor with a medium attack (20ms) keeps the dynamic range of the performance intact while controlling the loudest peaks. The goal is consistency, not pumping.
6. Reverb
Match your reverb to the new context. If the original had a lot of room reverb already embedded in the extraction, use a shorter room reverb and lower wet level than you normally would. If the extraction sounds surprisingly dry, a medium hall with a 1–1.5 second decay time works for most styles.
Try stem separation now
Upload any track and extract vocals, drums, bass and instruments in minutes.
Start splitting — free