How to find count of words in Audio Song

Question

0.00/5 (No votes)

See more:

Hello All,

How to find the count of words in an Audio song file using C++ or Objective C.

Thanks & Regards
Tamilvanan.K

*REMOVED MAIL*

Posted 20-Jan-11 0:48am

K. Tamilvanan

Updated 20-Jan-11 0:58am

JF2015

v2

Add a Solution

Comments

JF2015 20-Jan-11 6:58am

Edited to remove mail address. You will get an automated mail if someone answers your question.

1 solution

Add a Solution

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Andrew Brock · Accepted Answer · 2011-01-20T01:00:00

If this means what I think it means, then don't bother.
Speech recognition is difficult, then you want to be able to do it when the singer is putting their voice all through the spectrum, you have background noises, etc...

Althoug you only want the word count and not the actual words, in song 2 or more words often blend together seamlessly, so if you want it to be accurate you would need it to have some understanding of language.

I'm no pro on audio analysis, but what I would do is start by removing all the frequencies outside of the vocal range perhaps with a FFT[^], then try to apply a speech recognition[^] system over that.