Demo: soundshader.github.io/acf-v2
Image is clickable.
These images are auto-correlation spectrograms of vowels.
ACF(X)=FFT[abs(FFT(X))^2]. It splits input X into amp*cos(freq*t+phi) waves, drops phases phi and squares amplitudes amp. For this reason, ACF(X) is a symmetric function.abs(ACF(X))/max(abs(ACF(X))) to avoid oversaturation. The ACF values aren’t squared and aren’t log10-scaled.FFT[BPF*abs(FFT(X))^2]. The low frequency ACF is rendered with color (12,3,1) and the high frequency ACF - with color (1,3,12). Oversaturation allows to reveal more details without resorting to log10-scaling (which doesn’t look good).Vowel sounds below taken from the IPA table on Wikipedia. Tag “ncnfr” means near-close near-front rounded. Images are clickable.
For comparison, here is a visualization of 63 drum/kick sounds (clickable):
10/2022