Skip to main content
AUDIO

Multi-Track Audio Mixer With Auto Ducking

Balance dialog, music, and effects on a real mixer. Per-track levels, pan, solo, mute, and AI-powered ducking that drops music under voice automatically.

What it is and why it matters

Most video projects fail their audio mix before they fail their picture. Music drowns the voiceover, dialog peaks into distortion, sound effects sit at the wrong volume relative to ambience, and the final export has uneven loudness across the cut. Skrrol AI's audio mixer is built to fix that — with the same channel-strip layout audio engineers expect, plus modern features like AI-powered ducking that automatically drops the music bed every time someone speaks. Each track on the timeline gets its own channel strip with a fader, pan knob, solo, mute, and meter, and the master bus gives you a final loudness check before export.

The mixer is more than just a volume utility. Smart ducking listens to the dialog track and pulls the music bed down by a configurable amount whenever speech is detected, then lifts it back up cleanly when speech stops — the same effect podcast producers spend years dialing in by hand. Sends and routing let you bus reverb to a single processor instead of one per track. The master meter shows true peak and integrated loudness in LUFS so you can hit the target levels for YouTube (-14 LUFS), broadcast (-23 LUFS), or wherever your final delivery lives. Whether your project is a single talking-head video or a music-driven short film with twenty audio tracks, the mixer scales to it.

How it works

  1. 1

    Add audio tracks to the timeline

    Each audio clip you drop creates a track. Add as many as your project needs — dialog, music, sound effects, room tone, voiceover.

  2. 2

    Open the Mixer panel

    Click the Mixer tab. Each timeline track shows up as a channel strip with fader, pan, meter, mute, and solo controls.

  3. 3

    Set base levels

    Push dialog to a comfortable speaking level (-12 dB to -6 dB peaks), music to -18 to -24 dB, and sound effects relative to dialog.

  4. 4

    Enable ducking on the music track

    Right-click the music channel, choose Sidechain to Dialog, set the duck amount (typically 9 to 12 dB), and the music bed will dip automatically under voice.

  5. 5

    Pan and balance

    Use the pan knob to spread sound effects in the stereo field, keep dialog centered, and place ambience slightly left or right for width.

  6. 6

    Watch master loudness and export

    Check the master LUFS meter, target -14 LUFS for YouTube or your delivery spec, then export — the mix renders into the final video file.

Benefits

Per-track channel strips

Fader, pan, solo, mute, and meter for every audio track — the layout audio engineers already know.

AI-powered ducking

Music drops automatically under voice and lifts back when speech stops, no manual keyframing required.

LUFS-accurate metering

True peak and integrated loudness meters help you hit YouTube, podcast, or broadcast delivery targets.

Stereo placement

Pan controls and stereo width let you spread ambience and effects across the field without smearing dialog.

Who uses it

Podcast video creators

Two or three voice tracks, intro music, and sting effects mixed clean with auto-ducking on the music bed.

YouTube creators

Voiceover, B-roll music, and on-camera audio balanced so the channel sounds consistent across every video.

Course and tutorial producers

Narration sits on top of background music with predictable, hands-off ducking for every lesson.

Documentary editors

Interview audio, archival sound, ambience, and score mixed on a real channel-strip layout with stem export.

Short film teams

Dialog, foley, ambience, and music balanced to LUFS targets for festival or platform delivery specs.

Frequently asked questions

How many audio tracks can I run?

There's no fixed cap. Track count is bounded by your device's memory and CPU; modern laptops easily handle 20+ stereo tracks with effects.

Does ducking work on multiple voice tracks?

Yes. Set the music channel to sidechain off any dialog channel, or build a dialog bus and sidechain to that bus to duck under any speaker.

Can I export individual stems?

Yes. Export Stems writes each track or bus to its own audio file — useful for handing off to a dedicated audio engineer.

What loudness target should I aim for?

YouTube, Spotify, and most streaming platforms target -14 LUFS integrated. Broadcast TV in the EU targets -23 LUFS, in the US -24 LUFS.

Is there a built-in limiter on the master?

Yes. The master bus has a true-peak limiter to prevent clipping, with a configurable ceiling (-1 dBTP is the safe default).

Does mixing happen in the cloud?

No. The Web Audio API mix runs locally in your browser, so audio processing is private and offline.

Related editor features

Try it in the Skrrol AI editor

Skrrol is a browser-native video studio. Open the editor in your browser, drop in your media, and use this feature alongside the rest of the timeline. Free, no install, your files stay on your device.