Participants will develop a MATLAB audio plugin that helps podcasters and social-media creators achieve clear, professional-sounding speech recordings in noisy, budget environments. By combining Voice Activity Detection (VAD) with AI-based enhancement and classic DSP, the plugin will automatically suppress background noise and polish vocals with a simple, user-friendly interface.
These blocks, and there ordering, are totally optional - this is just a starting off point for a signal chain for this type of plugin effect.
Removes low-frequency rumble and proximity effects
Uses 'voiceActivityDetector' to compute speech-presence probability.
Applies gating to attenuate non-speech.
Calls 'enhanceSpeech' to denoise and reduce residual artifacts
Reduces harsh sibilants with a narrow-band compressor around 5–10 kHz
Adds presence with a gentle boost at 3–5 kHz for clarity
Smooths level variations (e.g. ratio 3:1, attack 10 ms, release 100 ms)
Prevents clipping and controls peaks
Final output gain controlled by the user (–12 dB to +12 dB)
Measure the STOI of the speech signal
Here are some tips for scoring highly with the judges!
This website created and maintained independently by Christopher L. Bennett and is neither owned nor controlled by Focal Press or the University of Miami. MATLAB is a registered trademark of Mathworks, Inc. Christopher L. Bennett, BennettAudio.com, and DigitalAudioTheory.com are in no way affiliated with Mathworks, Inc.
BennettAudio.com DigitalAudioTheory.com