jmvalin | RNNoise: Learning Noise Suppression

You're viewing

jmvalin's journal
Create a Dreamwidth Account Learn More

Reload page in style: site light

This demo presents the RNNoise project, showing how deep learning can be applied to noise suppression. The main idea is to combine classic signal processing with deep learning to create a real-time noise suppression algorithm that's small and fast. No expensive GPUs required — it runs easily on a Raspberry Pi. The result is much simpler (easier to tune) and sounds better than traditional noise suppression systems (been there!).

Read More

Flat | Top-Level Comments Only

From: (Anonymous)

Dear Jean-Marc,

Many thanks for publishing your exciting work and sharing your code.
I've two points which are not 100% clear to me after reading your documentations and code:

(1) Network training input and output data samples are finite sequences of 42- and 23-element vectors, respectively. But in the operation mode, the trained network is fed sequentially with a single input vector and outputs a single vector?

(2) Is the training data extracted from overlapping spectrogram segments?

Kind regards

From:

jmvalin

1) For training, the RNN needs to be fed a full sequence so that it can backpropagate through time. When we actually use the RNN, we also feed a sequence, but we do it one frame at a time because we want the algorithm to be real-time (for batch denoising, we could feed the sequence all at once)

2) Yes, we use a frame size of 20 ms, with 10 ms overlap.

From: (Anonymous)

Thanks a lot for your quick answer!
Regarding 2), I think I have to specify my question:

Looking at your training code (rnnoise/training/rnn_train.py), you feed the network with sequences of 2000 42-element vectors/frames (= 1 training sample). Now I wonder if two distinct training samples might share a certain number of frames?

Flat | Top-Level Comments Only

Profile

jmvalin

March 2023

S	M	T	W	T	F	S
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Page Summary

(Anonymous) - Input and output data dimensions

Style Credit

Style: Dreamer for Dusty Foot by timeasmymeasure

Expand Cut Tags

No cut tags

Page generated Jul. 2nd, 2025 03:38 am

Jean-Marc Valin

RNNoise: Learning Noise Suppression