jmvalin: (Default)
[personal profile] jmvalin

This is a follow-up on the first LPCNet demo. In this new demo, we turn LPCNet into a very low-bitrate neural speech codec (see submitted paper) that's actually usable on current hardware and even on phones. It's the first time a neural vocoder is able to run in real-time using just one CPU core on a phone (as opposed to a high-end GPU). The resulting bitrate — just 1.6 kb/s — is about 10 times less than what wideband codecs typically use. The quality is much better than existing very low bitrate vocoders and comparable to that of more traditional codecs using a higher bitrate.

Read More

Crash on raspi

Date: 2019-03-30 07:43 am (UTC)
From: (Anonymous)

Thank you for the new codec. I have tested it on my desktop without avx2 and like the quality/space tradeoff. It's painfully slow decoding without avx2, but that is expected. I tried to cross-compile for raspberry pi 3 and get a crash in sincos. Do you know what I'm doing wrong? I'm using gcc-8.3.0 and I had to do a bit of work to get it to compile:
CC=arm-unknown-linux-gnueabi-gcc-8.3.0 CFLAGS='-O3 -ggdb -march=armv8-a -mfpu=neon' ../configure --target=arm-unknown-linux-gnueabi --host=x86_64-pc-linux-gnu
I'm on commit 343e35.

Possible to parallelize?

Date: 2019-04-04 02:16 pm (UTC)
From: (Anonymous)
Hello, jmvalin!

Ist it possible to parallelize the neural-network (or even build it in a decent FPGA)?



Date: 2019-04-10 11:18 pm (UTC)
From: (Anonymous)
In your left to do paragraph, you mention maybe this could be used in Opus. Could this technology also be used to enhance music at lower bit rates, say 20kb/s and achieve similar quality to a higher bitrate of say 64kb/s? Also, is there any samples at to what music would sound like (I know this is for speech so far, but would be interesting to see what happens to music at 1.6kb/s. Cheers, Kirk

Re: Opus

Date: 2019-04-11 05:13 am (UTC)
From: (Anonymous)
I was thinking the same, but thought I would double check. It would be great if this ends up in Opus and can be used for say a podcast where speech / music detection takes place to mix music with current opus WB codec with higher bitrate in with the LPC net stream at 1.6kb/s. I hope that makes sense. Should yield very small files for distribution. Great work by the way. Extremely impressive result so far.


jmvalin: (Default)

April 2019

 1234 56

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags

No cut tags
Page generated Apr. 26th, 2019 04:26 am
Powered by Dreamwidth Studios