What is the fastest way of extracting mfcc from audio files in linux (Raspberry Pi in my case). I tried sphinx3 but it was slow for large files (on Raspberry Pi). SFS (speech filing system) was quite fast on windows but i could not install it on linux. Please suggest some way.
Fastest method of MFCC extraction on linux machine
759 views Asked by Ironclad At
2
There are 2 answers
0
Aleksandr Dubinsky
On
Try compiling sphinx-train or HTK with a good optimizing compiler and set the correct options for the instruction set. See, eg, raspberrypi.stackexchange.com/questions/2046/which-cpu-flags-are-suitable-for-gcc-on-raspberry-pi Looks like you have to use the right Linux distro to even have FPU support (very important!)
They don't use something like fftw or hand-coded assembly, but their implementations of FFT etc are efficient.
Related Questions in SIGNAL-PROCESSING
- What kind of ARIMA model would be best fit for this data?
- Find Transfer Function from FFT Plot MATLAB
- How can I calculate the SNR of a curve that has impulse noise added?
- How to decrease too many False Positives I get from a KNN classifier for ECG R-peak detection?
- Constant and inconstant values using NI-DAQmx Python API although not issues with NI SignalExpress 2015
- How to get the frequencies and corresponding amplitudes from the FFT of a signal?
- How to get the correct frequency amplitudes in the FFT of a signal
- Using FFT to sum independent random variables
- Decompose time-series signal into different components
- Cross-talk correction in 2D spectrum using Python
- How to remove constant part of a signal in python?
- Analyzing a Power Spectrum of an Audio File for Patterns
- Matlab Real-Time Audio Simulation Speaker Output, Annoying Clicking Issue
- Spectrogram PNG back to WAV Audio
- Is there a way to (automatically) detect if the channels of a stereo video/audio are out of phase and canceling each other?
Related Questions in SPEECH-RECOGNITION
- How to Avoid Speech Recognition from Recognizing Speaker Playback in Unity
- recognize_google fails with WinError 10060
- React native voice isn't detecting my voice
- Comparing analog signal from Electret mic with samples
- Unable to convert Speech to Text using Azure Speech-to-Text service
- Python Script Not Generating Sync Map Despite Successful Command Line Execution
- Automatic speech recognition from scratch
- google speech transcribe-streaming-audio with single_utterance and time limit
- Azure AI Speech Service - No punctuation on Recognized return
- How to get the microphone to record sound with Google Speech recognition on Raspberry Pi 3?
- How to fix the below mention error in python
- How to increase the time for which the Microsoft Speech Service SDK listens in a single go?
- Make real time prediction with Keras
- AttributeError: module 'speech_recognition' has no attribute 'Microphone'
- Is there any way to do this without writing the file to memory first?
Related Questions in RASPBERRY-PI
- Raspberry Pi Sense Hat with Pyinstaller
- Waiting for several hours before resuming execution
- Using compositor in Gstreamer to merge imagesequence with video/audio stream
- How to make this sensor keep taking readings once its when_in_range function has been activated?
- Emulating an HID device with a raspberry pi 5
- How to stream PCM audio to a speakers both on mac and linux in Node.js?
- I want to use NRF24L01 transceiver with raspberry pi but some GPIO pins are already occupied by 3.5 inch display how do i connect transceiver
- voron 2.4 mainsail kiauh error install crowsnest (solved problem)
- Pi4J minimal example's LED doesn't blink
- How can I connect and send data from react-native to raspberry pi
- Xpt 2046 Display with Pn532 RFID reader
- Google SDK for Raspberry pi
- How to use a RaspberryPi camera with the cv2.videoCapture() function?
- How do I capture audio from the system bus in a node app (or from sox) on a raspberry pi?
- Can't open /dev/mem: Operation not permitted Raspberry Pi
Related Questions in MFCC
- Generate MFCC with good noise for an audio signal of 0.01 seconds
- Problems that arise in learning features of audio files
- What is wrong with this code performing unsupervised segmentation?
- How to compare pronouncation accuracy of a recoreded voice with an audio file already on device and return similarity percentage in android Kotlin
- Is it possible to feed CNN with raw numbers instead of images?
- from python_speech_features import mfcc (Import "python_speech_features" could not be resolved)
- How can praat get formant data from audio?
- Error using trainNetwork . Number of observations in X and Y disagree
- How can i change the the tensor value into normal text data?
- matlab error about the speech recognition by using pitch and mfcc
- Why does Librosa.core.spectrum.stft divide the signal into frames without padding?
- Understanding MFCC output for a simple sine wave
- Is there any way to reverse normalization of mfcc? I have used CMVN of SpeechPy to normalize the mfcc
- Differences in MFCC values
- MFCC plot in real-time audio processing
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
sphinx-trainhas an app to do it. I think it works reasonably hard, I've been using it on an old P4 and didn't take too long. I didn't have that much data, though.You can use HCopy from HTK though I haven't tried it.
If you are a programmer, consider profiling the
sphinx-trainextractor and optimizing it. Make sure to let the authors know if you succeed.Final note: the process does take time. It can't be avoided. Try to set-up automated scripts and leave the extraction running for the night. It's often the best way to approach it.