How to convert human voice into digital format?

Question

How to convert human voice into digital format?

4.6k views Asked by Shekhar At 27 February 2011 at 05:49

I am working on a project where biometric system is used to secure the system. We are planning to use human voice to secure the system.

Idea is to allow the person to say some words or sentences and system will store that voice in digital format. Next time person wants to enter the system, he/she has to speak some words which may or may not be different from the words used earlier.

We don't want to match words but want to match voice frequency.

I have read some research papers regarding this system but those papers don't have any implementation details.

So just want to know whether there is any software/API which can convert analog voice into digital format and will also tell us the frequency of voice.

Until now I was working on normal web based applications so I know normal APIs and platforms like Java EE, C#, etc but I don't have any experience about this kind of application.

Please enlighten !!!

Original Q&A

There are 3 answers

JohnP On 01 March 2011 at 07:53

This is as good a starting point as any : http://marsyas.info/

It's a open source software framework for audio processing. They've listed a bunch of projects that have used their framework in various ways so you could probably draw inspiration from it. http://marsyas.info/about/projects. The Telligence project in particular seems the closest to your needs as it it was used to gender classify audio : http://marsyas.info/about/projects#5Teligence

yms On 07 March 2011 at 16:27

There are two steps on a project like this one I believe:

First step would be to record the voice from an analog input into digital format (let's assume wav-pcm). For this you can use DirectShow API in C#, or standard Wav-In as in this project: http://www.codeproject.com/KB/audio-video/cswavrec.aspx. You may consider compressing your audio files later on, there are many options for this, in Windows you may consider Windows Media Format SDK to avoid licensing issues with other formats.

Second step is to build or use a voice recognition framework, if you want to build a recognition framework you will probably need to define a set of "features" for your sound fragments and select+implement a recognition algorithm. There are many aproaches available for this, IEEE amd ACM.org websties are usually good sources. If you want to use an existing framework you may want to consider Nuance Recognizer (commercial) or http://cmusphinx.sourceforge.net (open source).

Hope this helps.

**abmv** · Accepted Answer · 2011-03-01T08:23:10+00:00

abmv On 01 March 2011 at 08:23 BEST ANSWER

http://www.loquendo.com/en/products/speaker-verification/
http://www.nuance.com/for-business/by-solution/contact-center-customer-care/cccc-solutions-services/verifier/index.htm
(two links removed due to reported virus content)
http://www.persay.com/products.asp

TechQA.

How to convert human voice into digital format?

There are 3 answers

Related Questions in SECURITY

Related Questions in SPEECH-RECOGNITION

Related Questions in SPEECH-TO-TEXT

Related Questions in ANALOG-DIGITAL-CONVERTER

Popular Questions

Trending Questions