Multi-Track Audio Mixer With Auto Ducking
Balance dialog, music, and effects on a real mixer. Per-track levels, pan, solo, mute, and AI-powered ducking that drops music under voice automatically.
What it is and why it matters
Most video projects fail their audio mix before they fail their picture. Music drowns the voiceover, dialog peaks into distortion, sound effects sit at the wrong volume relative to ambience, and the final export has uneven loudness across the cut. Skrrol AI's audio mixer is built to fix that — with the same channel-strip layout audio engineers expect, plus modern features like AI-powered ducking that automatically drops the music bed every time someone speaks. Each track on the timeline gets its own channel strip with a fader, pan knob, solo, mute, and meter, and the master bus gives you a final loudness check before export.
The mixer is more than just a volume utility. Smart ducking listens to the dialog track and pulls the music bed down by a configurable amount whenever speech is detected, then lifts it back up cleanly when speech stops — the same effect podcast producers spend years dialing in by hand. Sends and routing let you bus reverb to a single processor instead of one per track. The master meter shows true peak and integrated loudness in LUFS so you can hit the target levels for YouTube (-14 LUFS), broadcast (-23 LUFS), or wherever your final delivery lives. Whether your project is a single talking-head video or a music-driven short film with twenty audio tracks, the mixer scales to it.
How it works
- 1
Add audio tracks to the timeline
Each audio clip you drop creates a track. Add as many as your project needs — dialog, music, sound effects, room tone, voiceover.
- 2
Open the Mixer panel
Click the Mixer tab. Each timeline track shows up as a channel strip with fader, pan, meter, mute, and solo controls.
- 3
Set base levels
Push dialog to a comfortable speaking level (-12 dB to -6 dB peaks), music to -18 to -24 dB, and sound effects relative to dialog.
- 4
Enable ducking on the music track
Right-click the music channel, choose Sidechain to Dialog, set the duck amount (typically 9 to 12 dB), and the music bed will dip automatically under voice.
- 5
Pan and balance
Use the pan knob to spread sound effects in the stereo field, keep dialog centered, and place ambience slightly left or right for width.
- 6
Watch master loudness and export
Check the master LUFS meter, target -14 LUFS for YouTube or your delivery spec, then export — the mix renders into the final video file.
Benefits
Per-track channel strips
Fader, pan, solo, mute, and meter for every audio track — the layout audio engineers already know.
AI-powered ducking
Music drops automatically under voice and lifts back when speech stops, no manual keyframing required.
LUFS-accurate metering
True peak and integrated loudness meters help you hit YouTube, podcast, or broadcast delivery targets.
Stereo placement
Pan controls and stereo width let you spread ambience and effects across the field without smearing dialog.
Who uses it
Podcast video creators
Two or three voice tracks, intro music, and sting effects mixed clean with auto-ducking on the music bed.
YouTube creators
Voiceover, B-roll music, and on-camera audio balanced so the channel sounds consistent across every video.
Course and tutorial producers
Narration sits on top of background music with predictable, hands-off ducking for every lesson.
Documentary editors
Interview audio, archival sound, ambience, and score mixed on a real channel-strip layout with stem export.
Short film teams
Dialog, foley, ambience, and music balanced to LUFS targets for festival or platform delivery specs.
Frequently asked questions
How many audio tracks can I run?
There's no fixed cap. Track count is bounded by your device's memory and CPU; modern laptops easily handle 20+ stereo tracks with effects.
Does ducking work on multiple voice tracks?
Yes. Set the music channel to sidechain off any dialog channel, or build a dialog bus and sidechain to that bus to duck under any speaker.
Can I export individual stems?
Yes. Export Stems writes each track or bus to its own audio file — useful for handing off to a dedicated audio engineer.
What loudness target should I aim for?
YouTube, Spotify, and most streaming platforms target -14 LUFS integrated. Broadcast TV in the EU targets -23 LUFS, in the US -24 LUFS.
Is there a built-in limiter on the master?
Yes. The master bus has a true-peak limiter to prevent clipping, with a configurable ceiling (-1 dBTP is the safe default).
Does mixing happen in the cloud?
No. The Web Audio API mix runs locally in your browser, so audio processing is private and offline.
Related editor features
Parametric EQ — Surgical Tone-Shaping In The Browser
Sculpt voice and music with a real parametric EQ. Multi-band cuts and boosts, visual frequency response, and presets for dialog clarity and rumble removal.
AI Noise Reduction — Clean Voice From Any Recording
Strip hum, hiss, fan noise, and room tone from any audio. AI-powered spectral denoise plus a manual noise gate, both running locally in your browser.
Audio Scrubbing — Hear Your Cut While You Scrub
Drag the playhead and hear the audio in real time. Frame-accurate audio scrub, J-K-L transport keys, and jog-wheel feel for finding sync points fast.
Multi-Track Timeline — The Way Pro Editors Cut
Unlimited layered video and audio tracks, ripple and roll edits, three-point editing, J/L cuts, and nested sequences. The pro NLE timeline, in your browser.
Try it in the Skrrol AI editor
Skrrol is a browser-native video studio. Open the editor in your browser, drop in your media, and use this feature alongside the rest of the timeline. Free, no install, your files stay on your device.