MFCC Feature Extraction Python Code

A music classification project for atarayo, yorushika, yoasobi and zutomayo which based on PyTorch.

Overview This project uses PyTorch to implement a CNN model for classifying atarayo, yorushika, yoasobi and zutomayo, which is suitable for beginners who ardently love Japanese music (such as me) in ...

GitHub

Acoustic Monitoring and Identification System

This repository archives the software components used in an acoustic monitoring and identification system. It covers the complete workflow from dataset acquisition, data format conversion, ...

Frontiers

Multi-QuadEmoNet: cat and dog emotion classification model from animal vocalization using multi-stage LSTM-GRU paradigm

As shown in Figure 4a, the feature extraction using MFCC includes the steps such as pre-emphasis, framing the signal, windowing, Fast Fourier Transform (FFT), Mel-Filter Bank, Logarithm, Discrete ...

UTMOSv1 PyTorch Implementation for Neural Audio Codecs

Are you working on Neural Audio Codecs or TTS? Then you have probably heard of 𝐔𝐓𝐌𝐎𝐒𝐯𝟏, a MOS predictor widely used for evaluating speech quality. A recent ICASSP 2026 paper showed that UTMOSv1 ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results