Overview This project uses PyTorch to implement a CNN model for classifying atarayo, yorushika, yoasobi and zutomayo, which is suitable for beginners who ardently love Japanese music (such as me) in ...
This repository archives the software components used in an acoustic monitoring and identification system. It covers the complete workflow from dataset acquisition, data format conversion, ...
As shown in Figure 4a, the feature extraction using MFCC includes the steps such as pre-emphasis, framing the signal, windowing, Fast Fourier Transform (FFT), Mel-Filter Bank, Logarithm, Discrete ...
Are you working on Neural Audio Codecs or TTS? Then you have probably heard of 𝐔𝐓𝐌𝐎𝐒𝐯𝟏, a MOS predictor widely used for evaluating speech quality. A recent ICASSP 2026 paper showed that UTMOSv1 ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results