Click here to Skip to main content
15,886,788 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
Hi..

I was doing a research related to speech recognition. So far I already able to convert a sample waveform (.wav format) to a feature vector by using MFCC which generate .mfc file. I also able to read the .mfc file but I dont know how to related it to the spectogram. Actually, I'm not sure whether this is possible or not, but because of I read 1 book "The Voice in the Machine", it shows somehow it is possible to turn the feature vector value into some kind of matrix table that represent the spectogram of the speech sample. Below is the example of the figure (I cannot provide the real image because it says it is copyrighted image so I create the example based on that figure):

2.0|2.3|2.1|2.4|4.5|7.3|7.6|7.4|5.6|4.1|1.1|1.3|0.3|0.5|0.1|0.3|0.4|
2.2|2.0|2.2|2.6|3.8|7.2|7.6|7.1|6.2|4.5|1.4|1.5|0.1|0.5|0.8|0.2|0.1|
2.5|2.1|2.6|7.4|6.7|7.4|3.1|2.2|3.4|3.3|2.5|3.0|0.1|0.2|0.5|0.3|0.7|
2.1|2.0|2.4|2.7|5.9|6.3|5.2|5.7|5.9|3.3|3.1|3.7|1.3|0.4|0.9|0.4|0.2|
2.0|2.1|2.1|2.4|6.3|7.4|6.3|6.8|6.5|5.8|5.5|5.4|1.1|0.3|0.3|0.5|0.7|
2.4|2.5|2.2|6.7|6.4|7.1|9.7|9.4|8.5|7.7|6.3|2.2|1.1|0.4|0.5|0.6|0.3|
9.6|8.3|9.2|9.4|9.2|9.8|9.5|9.8|9.7|8.1|6.8|2.3|1.9|0.1|0.4|0.4|0.2|
9.5|8.7|8.6|8.7|9.8|8.5|8.3|8.2|7.5|8.2|8.8|7.4|6.9|7.7|4.3|3.7|2.5|

It is possible to generate this kind of matrix that represent the spectogram? Is there any tools, software can be used? The bold is actually represent the spectogram intensity level.
I really hope someone can help me. Thank you
Posted

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900