/vt/ - Virtual Youtubers

Anonymous Tue Aug 9 05:55:12 2022 No.30413784 [View]
File: 776 KB, 2400x5800, 1649682477141.png [View same] [iqdb] [saucenao] [google]

Pink noise is a type of noise whose power spectral density (PSD) follows the curve 1/f^a (where a = 1, corresponding to a -3db/octave slope), and whose energy is equal across the frequency spectrum. Pink noise appears in many phenomena and systems including tides, electronic devices, pulsars, and EEG recordings of human brains. Thus, it has become a subject of interest among scientists and retarded hippies alike.

The article claims that the frequency spectrum of Hoshimachi Suisei's voice follows a pink noise curve on average. The author does not clearly outline his methodology, choosing only to say that he used the least squares method on the data, and that he used the entire human hearing range from 20 Hz to 20 kHz[1].

We can use the Welch averaging method to estimate PSD. Welch's method involves splitting the signal into overlapping segments and windowing them, after which a PSD estimate, or periodogram, is calculated by taking the squared magnitudes of the discrete Fourier transform on each windowed segment. This method reduces signal noise in exchange for reducing frequency resolution, producing a smoother signal. I used 2048 segments with 50% overlap, hamming windows, 4096 discrete Fourier transforms. All audio files were recorded or downloaded with the minimum standard audio sampling rate of 44.1 kHz. I opted to use the range from 300 Hz to 20 kHz as the data below 300 Hz is not necessary for this analysis.

There are two ways to fit the data gathered from the PSD estimate to the 1/f curve: linearize the data and function and use linear regression, or use nonlinear least squares regression. I never took stats, so I relied on Matlab for both of these.

y=1/x^a can be rewritten as y=x^-a, a power function. It can then be linearized as log(y) = -a*log(x). Performing linear regression with pink noise as the input consistently yields a = 1 with less than 1% error, proving that this method works. However, on the two Suisei acapella recordings, we get a ≈ 5. This is not consistent with pink noise. A Risu acapella gives a ≈ 6, and a well-recorded raw vocal gives a ≈ 5.4. Looking at the graphs, this method seems to ignore data at high frequencies.

Using nonlinear least squares regression, we can fit the data directly to 1/x^a. Fitting real pink noise to this curve consistently yields a = 1 with less than 5% error. The Suisei acapellas give a = 0.865 and a = 0.855. The Risu acapella and raw vocal give a = 0.856 and a = 0.857 respectively.

Averaging the values between the two methods does not give us anything close to a = 1. Using a larger bandwidth lowers a values from 0.85 to 0.7 for the nonlinear method, and makes no significant change for the linear method. Alternatively, we can look at the power per octave of each audio file and see that they barely resemble pink noise, except for over short, insignificant frequency ranges.

Alternatively, use your ears.

[1] J. Retard "星街すいせいの歌声スペクトル解析および1/fゆらぎの確認." https://suichyan-ha-kyoumokawaii.amebaownd.. com/posts/17805948

Anonymous Mon Apr 11 13:07:57 2022 No.22238721 [View]
File: 776 KB, 2400x5800, pink noise.png [View same] [iqdb] [saucenao] [google]

Pink noise is a type of noise whose power spectral density (PSD) follows the curve 1/f^a (where a = 1, corresponding to a -3db/octave slope), and whose energy is equal across the frequency spectrum. Pink noise appears in many phenomena and systems including tides, electronic devices, pulsars, and EEG recordings of human brains. Thus, it has become a subject of interest among scientists and retarded hippies alike.

The article claims that the frequency spectrum of Hoshimachi Suisei's voice follows a pink noise curve on average. The author does not clearly outline his methodology, choosing only to say that he used the least squares method on the data, and that he used the entire human hearing range from 20 Hz to 20 kHz[1].

We can use the Welch averaging method to estimate PSD. Welch's method involves splitting the signal into overlapping segments and windowing them, after which a PSD estimate, or periodogram, is calculated by taking the squared magnitudes of the discrete Fourier transform on each windowed segment. This method reduces signal noise in exchange for reducing frequency resolution, producing a smoother signal. I used 2048 segments with 50% overlap, hamming windows, 4096 discrete Fourier transforms. All audio files were recorded or downloaded with the minimum standard audio sampling rate of 44.1 kHz. I opted to use the range from 300 Hz to 20 kHz as the data below 300 Hz is not necessary for this analysis.

There are two ways to fit the data gathered from the PSD estimate to the 1/f curve: linearize the data and function and use linear regression, or use nonlinear least squares regression. I never took stats, so I relied on Matlab for both of these.

y=1/x^a can be rewritten as y=x^-a, a power function. It can then be linearized as log(y) = -a*log(x). Performing linear regression with pink noise as the input consistently yields a = 1 with less than 1% error, proving that this method works. However, on the two Suisei acapella recordings, we get a ≈ 5. This is not consistent with pink noise. A Risu acapella gives a ≈ 6, and a well-recorded raw vocal gives a ≈ 5.4. Looking at the graphs, this method seems to ignore data at high frequencies.

Using nonlinear least squares regression, we can fit the data directly to 1/x^a. Fitting real pink noise to this curve consistently yields a = 1 with less than 5% error. The Suisei acapellas give a = 0.865 and a = 0.855. The Risu acapella and raw vocal give a = 0.856 and a = 0.857 respectively.

Averaging the values between the two methods does not give us anything close to a = 1. Using a larger bandwidth lowers a values from 0.85 to 0.7 for the nonlinear method, and makes no significant change for the linear method. Alternatively, we can look at the power per octave of each audio file and see that they barely resemble pink noise, except for over short, insignificant frequency ranges.

Alternatively, use your ears.

[1] J. Retard "星街すいせいの歌声スペクトル解析および1/fゆらぎの確認." https://suichyan-ha-kyoumokawaii.amebaownd.. com/posts/17805948

Advanced search
Text to find
Subject [?]Search by post subject. Leave empty for any.
Username [?]Search for user name. Leave empty for any user name.
Tripcode [?]Search for tripcode. Leave empty for any.
Email [?]Search by email. Leave empty for any.
Filename [?]Search by image filename. Leave empty for any.
From Date [?]Enter what date to start searching from. Format is YYYY-MM-DD
To Date [?]Enter what date to start searching until. Format is YYYY-MM-DD
Image hash
Search in	All Posts OPs Only
Deleted posts	Show all posts Show only deleted posts Only show non-deleted posts
Internal posts	Show all posts Show only internal posts Show only archived posts
Order	New posts first Old posts first
Capcode	All Posts Only by Users Only by Mods Only by Admins Only by Developers
Results	Posts Threads
Action	[ Simple ]

Navigation
View posts	[+24]	[+48]	[+96]

/vt/ - Virtual Youtubers

Search: