jmvalin | Opus quality update

Those who have been following the Opus git repository in the past few weeks probably haven't noticed much work going on. The reason is pretty simple, most of the work has been going on elsewhere in an experimental branch (exp_wip3 names for now) of my private repository. The reason it's in an experimental branch is that its not fully converted to fixed-point and hasn't been tested on any frame size other than 20 ms. Here's an (incomplete) list of changes for now:

Really unconstrained VBR (not trying to keep the same average rate)
Tonality detection to give highly tonal audio a boost in bit-rate
(yet another) rewrite of the transient detection code
New dynamic allocation code that boosts the rate of bands that have significant spectral leakage caused by short blocks

Thanks to these changes, the quality has (as far as we can tell) gone up compared to the current master branch. I invite you to judge for yourself by comparing the audio coded with the current master branch with the audio coded with the new exp_wip3 experimental branch. This is 64 kb/s, so fairly low rate for stereo music. The original is here. Let me know what you think.

S	M	T	W	T	F	S
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Most Popular Tags

aac - 2 uses
academic scam - 1 use
amazon - 3 uses
aom - 1 use
bugs - 3 uses
c - 1 use
celt - 23 uses
codec2 - 1 use
codecs - 37 uses
conference - 1 use
daala - 4 uses
deep learning - 4 uses
demo - 11 uses
entropy - 1 use
eusipco - 1 use
fixed-point - 2 uses
ghost - 2 uses
hardware - 1 use
ietf - 9 uses
laptop - 1 use
lca - 1 use
memmove - 1 use
mozilla - 18 uses
noise - 3 uses
open access - 1 use
opus - 22 uses
paper - 2 uses
patents - 1 use
quebec - 1 use
rant - 2 uses
renovations - 6 uses
silk - 2 uses
speech - 5 uses
speex - 7 uses
testing - 1 use
type safety - 1 use
ubuntu - 2 uses
underhanded - 1 use
video - 4 uses
vorbis - 3 uses
vp8 - 1 use
webrtc - 5 uses
xiph - 37 uses

Flat | Top-Level Comments Only

From:

prodicus.myopenid.com (from livejournal.com)

I don't have "golden ears"- both of the encodes sound good enough that I figured I could only tell them from the original during the "Tom's Diner" material from the last ten seconds. I did test to make sure and I can definitely ABX them both from the original there.

I have a little more trouble distinguishing between the two Opus encodes there-- I identified them correctly six times in a row but then monotony/ear fatigue started to creep in. It seemed to me that the experimental version encode was slightly worse on Suzanne Vega's voice. The voice sounds less smooth/ "harsher"in both Opus encodes and I think that effect is a little more pronounced in the experimental encode.

jmspeex.livejournal.com

Thanks, I'll see if I can hear the difference you're talking about. In terms of improvements in the experimental branch, the samples where it should be the most obvious are samples 4 (guitar, just after the Dave Matthews Band sample) and 5 (woodblocks, just before Vega). Let me know what you think.

Tried again, this time using AKG studio headphones rather than earbuds (still using my laptop's integrated audio though, my recording equipment is inconvenient to use at the moment). Still struggle to ABX either Opus encode from the original on the guitar or woodblocks.

If you encoded the samples which Opus did the worst at in the public 64kbps multiformat listening test with these two encoders, maybe the difference on those might be more audible. Or you could ask over at hydrogenaudio and see if any "golden ears" type folks there can discern the differences in the samples you posted.

Two more details about the Tom's Diner issue:

-I'd forgotten that abchr randomizes the sample numbers. After doing the test again and reading the saved result file I found that the better of the two encodes was actually the new experimental encoder. Taking a brief break between tests allowed me to reliably ABX the difference between the encodes, so this is a real perceptible improvement.

-I can now better describe what the problem sounds like-- Vega's voice is kind of "breathy," especially on the lowest notes, and it sounds as though Opus (especially the older encoder) is encoding some of this essentially non-harmonic aspect of the sound as false overtones.

Well, one extreme example is the harpsichord sample (#2) from the HA test. The original is here (http://media.xiph.org/audio/HA_2011/sample02r.wav) and you can compare the quality of Opus in the HA test (http://media.xiph.org/audio/HA_2011/Sample02_4.wav) to the experimental branch (http://jmvalin.ca/misc_stuff/sample02_coded64A.wav). Just be aware that the new VBR code actually cranks up the bit-rate to around 100 kb/s on that sample for the experimental branch (the average for many sample is still around 64 kb/s).

As for the Vega sample, I'm not sure I understand what artefact you're describing. Are you talking about noise during voiced segments or tones during unvoiced segments?

blaise potard (from livejournal.com)

Hi Jean-Marc,

I have to say both samples sound very good to me, and I have a hard time distinguishing one from the other.

However, I have a small remark: if you look at a spectrogram of the Suzanne Vega sample at the very end, especially the voiced or fricatives sections, it looks like the whole 7-15kHz frequency band has been transposed to 15-20kHz. I am not familiar at all with the Opus codec, so it may be normal, but it certainly "looks" strange.

It happens on both the experimental and "old" codec, although the old one looks slightly worse.

This being said, I have a hard time actually "hearing" the effect of this (unless I do a high-pass filter first), so the codec may just be exploiting some weaknesses of the human ear.

giodj-cmp.livejournal.com

tengo una pregunta :¿como logro convertir una vos que esta en un formato mp3 o wav a el formato speex o ZGR ? esto resulta porque quiero hacer un espeech como las de fl studio, pero con mi vos.En el programa se arrastra el formato a una plataforma y ,luego ,aparese un cuadro de dialogo donde se puede escribir lo que quiere que diga esa vos . si me ayuda les agradesco :)por cualquier cosa mi correo es :gioandkat@hotmail.com

Jean-Marc Valin

Opus quality update

Opus quality update

Not sure there's improvement

Re: Not sure there's improvement

Re: Not sure there's improvement

Re: Not sure there's improvement

no subject

convertir a formato zgr

Profile

March 2023

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags