Reduce MFCC output

Question

Reduce MFCC output

274 views Asked by nebula186 At 15 November 2018 at 04:23

I am trying to analyze song audio using a python library, the output is a numpy array, the array is very large in size as the MFCC is calculated for every frame of the audio. When I write this output to a file , each song has an output of about 3-4MB. Is there a way to reduce the N frames of information into a single row of features?

click here]([![MFCC outut )

Original Q&A

There are 1 answers

**Jon Nordby** · Answer 1 · 2018-12-02T02:55:04+00:00

A common practice is to group consecutive frames into sequence windows, compute aggregate statistics on each texture window and then summarize this again using aggregated statistics.

The statistics are computed per input feature (MFCC band in your case). Example statistics functions would be mean, standard deviation, min, max. Texture sizes can be between 1-60 seconds.

See Low-level features and timbre, Juan Pablo Bello, MPATE-GE 2623 Music Information Retrieval

TechQA.

Reduce MFCC output

There are 1 answers

Related Questions in PYTHON

Related Questions in MFCC

Related Questions in AUDIO-ANALYSIS

Popular Questions

Popular Tags

Trending Questions