Clean video audio (dialogue)

A dialogue-first workflow: reduce background noise without losing consonant clarity or creating pumping.

What you’re optimizing for

Goal What it means Common mistake
Intelligibility Consonants and word onsets remain crisp (speech band ~1–4 kHz stays clean) Over-denoise smears consonants
Stable noise floor Background reduces without “breathing” between phrases Hard gating / aggressive thresholds
Natural voice tone Speaker still sounds like themselves Over-processing creates metallic tone

Typical video noise sources

  • wind (low-frequency bursts, broad turbulence)
  • traffic and ambience (non-stationary)
  • HVAC and fans (steady broadband)
  • camera handling and clothing rustle (transients)
  • room echo (reverb tail)

80/20 workflow

  1. Pick your “dialogue anchor”: find a representative section with speech + noise.
  2. Fix obvious hum: tonal issues first reduce later artifacts.
  3. Denoise conservatively: target the noise floor, not the voice.
  4. Check pumping: listen between words and at sentence ends.
  5. Deliver a clean file: export a dialogue track ready for your editor.

Dialogue clarity: what to preserve

Avoid the big 3 video-audio cleanup failures

1) Pumping between phrases

2) “Metallic” dialogue

3) Echo becomes more obvious

Delivery checklist (for editors)

FAQ

Can I fully remove crowd noise from video?

Crowd noise is non-stationary and overlaps speech. Aim to reduce masking and make dialogue dominant, not to delete all background without artifacts.

Should I denoise before editing?

Often yes: cleaning early helps you make better editorial decisions. For complex timelines, denoise key dialogue tracks and leave music/effects untouched.

Clean video audio now.

← Back to guides