|
Aside: here is a Speech Recognition joke of the previous millennium -
A smart programmer went to a college classroom and proudly claimed that "My speech recognition software is so advanced that it can run voice commands on my DOS machine; you are free to test it now", and ran it. Immediately, a smarter student from the last bench shouted - "FORMAT C COLON ENTER".
It is left to your imagination about what happened next.
modified 19-Nov-23 20:27pm.
|
|
|
|
|
somewhat off topic but perhaps amusing though you may have heard this story previously . re/ language translation early technology English to Russian and back again "The Spirit is willing but the flesh is weak ." -> Russian -> English "The vodka is strong but the meat is rancid ."
|
|
|
|
|
"Out of sight, out of mind" -> Russian -> English "Invisible insanity".
Freedom is the freedom to say that two plus two make four. If that is granted, all else follows.
-- 6079 Smith W.
|
|
|
|
|
Yes. A Professor in the School of Automation at the Indian Institute of Science, Bengaluru, by name Prof. M R Chidambara had told this, sometime in the 1980's. I've heard it directly from him.
|
|
|
|
|
There used to be "services" on the net where you could set up a list of languages, such as English -> Russian -> Greek -> German -> English, and give it a text that would be passed through the specified series of translations. At least one of the services could even iterate the sequence until the result was stable (or in some cases, oscillated between two alternatives).
I saved printouts of a few such iterations in my scrapbook, but didn't save the URL. It most likely would be dead today anyway.
Does anyone know of any such service in existence today?
|
|
|
|
|
For quite a few years (it has been fixed now), Google Translate claimed that Norwegian 'postoppkrav' (charge on delivery) would translate to Swedish 'TORSK' (codfish).
Regardless of source and target language, everything was first translated to English, and then from English to the target language. So the Norwegian word for charge on delivery became COD, an COD in Swedish, maintaining the capitalization, is TORSK. Both steps make perfect sense.
Google also could translate English number to French: Forty - quarante, fortyone - quarante-et-un, fortytwo - 42, fortythree - 43 ... It was a mystery to me why it stopped at 41, and not at some "round" number. Maybe it was because "42" has an iconic value.
But we are sidetracking from the subject "Speech to text".
|
|
|
|
|
The next thing was another person jumping up, yelling "Yes" to answer the question "Are you sure?"
This was regularly claimed to be a "true" story from Microsoft's first demonstration of their text recognition. Lots of people did believe that the story was true. In Norwegian, we have a way of speech that goes "Well, if it ain't true, it sure is a good lie!"
|
|
|
|
|
Amarnath S wrote: A smart programmer went to a college classroom and proudly claimed that "My speech recognition software is so advanced that it can run voice commands on my DOS machine; you are free to test it now", and ran it. Immediately, a smarter student from the last bench shouted - "FORMAT C COLON ENTER". That says nothing of the quality of the medium in which someone delivered that command. It could've been done with a keyboard just as easy, rendering the intended point moot. I thought this dude was supposed to be smart in the example?
Jeremy Falcon
|
|
|
|
|
Obligatory xkcd : xkcd: Listening
"A little song, a little dance, a little seltzer down your pants"
Chuckles the clown
|
|
|
|
|
xkcd: Listening[^]
I have done something similar to this at one place I knew they had Alexa. It didn't work (I suppose I used the wrong formulation or Amazon changed the way to do it), but the owner got ing frightened and almost bans me from the house. The other guests were rofling for half an hour.
M.D.V.
If something has a solution... Why do we have to worry about?. If it has no solution... For what reason do we have to worry about?
Help me to understand what I'm saying, and I'll explain it better to you
Rating helpful answers is nice, but saying thanks can be even nicer.
|
|
|
|
|
If you're determined to not use your hands, you can always have macros/code snippets to handle the parts that a program wouldn't get correct and have voice dictation run those. Then you can use the normal functionality for the parts that will. Technology is a long way away from making this a worthwhile pursuit though. You'd be better off having ChatGPT code your crap and you using text to speech to give it prompts.
Jeremy Falcon
modified 20-Nov-23 10:56am.
|
|
|
|
|
|
Given enough time, anything is possible. Your code or someone else's?
I use a dictionary to validate every word in my text-to-speech program.
I use "markup" to indicate words that need to be spoken via phonetics.
RecognizedWordUnit.Pronunciation Property (System.Speech.Recognition) | Microsoft Learn
"Before entering on an understanding, I have meditated for a long time, and have foreseen what might happen. It is not genius which reveals to me suddenly, secretly, what I have to say or to do in a circumstance unexpected by other people; it is reflection, it is meditation." - Napoleon I
|
|
|
|
|
Gerry Schmitz wrote: I use a dictionary to validate every word in my text-to-speech program. A least that can give a recognition quality comparable to word-by-word translation from one language to another, with no concern about context or grammar.
(I suspect that you intended to write "... in my speech-to-text program". If you really meant text-to-speech, that is a different, although related, problem. Are you then referring to a pronunciation dictionary? How do you handle homographs?)
|
|
|
|
|
Thanks! One of those where you read, but register something else.
Yes; I was thinking "Text to speech".
I use a dictionary to scan 3rd party text for words not in the dictionary (mostly English; including well known proper names). Spelling mistakes. Wrong "title case". Weird punctuation. Initials. "Item numbers". Abbreviations. Things that will cause issues with the speech engine. I run it through my parser until it "speaks" well; patching or adding "markup" as I go.
I have programmed a "speech to menu item".
"Before entering on an understanding, I have meditated for a long time, and have foreseen what might happen. It is not genius which reveals to me suddenly, secretly, what I have to say or to do in a circumstance unexpected by other people; it is reflection, it is meditation." - Napoleon I
|
|
|
|
|
It is possible. Just not very viable if other solutions exist.
So with someone that is mobility limited it can be done.
But for someone that can type, even just with a couple of fingers then that is how they should do it.
I used the following with google and found multiple solutions.
typing with speech to text code writer
|
|
|
|
|
jschell wrote: So with someone that is mobility limited it can be done. I have been doing some work with visually handicapped youth. You would be surprised to see how tolerant they are with their tools, learning very fast which mistakes the tools make. Text that looks gibberish to me makes perfect sense to them. Like "It always confuses 'I scream' and 'ice cream', that's no problem". They have no problems with, say, a course named "LOL Introduction to programming".
Handling zillions of such misinterpretations is like a "survival technique" for them, as way to manage even with mediocre tools. The negative side of it is that they do not care to report the problems to the programmers, so the tools can be improved. When I 'catch' them with such problems and ask them "Don't you want to report it, so it can be fixed?", the answer usually is something in the direction of "Naaah ... I understand what is meant. It isn't necessary."
IF we, as users (whether we are programmers or not), really should be much more eager to report bugs and problems to the developers. That is the best way to have the tools improved.
As programmers we know what information a software developer needs of information, and in which format. Some years ago, I received a Christmas Greeting from one developer of a tool I was using professionally: He wanted to express his gratitude for all the error reports I had delivered through the year: Always clear, to the point, with all unnecessary parts shaved off. He really wished that other users could learn to provide similar error reports
|
|
|
|
|
Was discussing the concept of "skinning"; putting material over a mesh. Or the way EV's have crate engines and anyone can now make a car and "skin" it into a super car.
AI kept rejecting me as a fugitive from The Silence of the Lambs.
"Before entering on an understanding, I have meditated for a long time, and have foreseen what might happen. It is not genius which reveals to me suddenly, secretly, what I have to say or to do in a circumstance unexpected by other people; it is reflection, it is meditation." - Napoleon I
|
|
|
|
|
At my previous job, we often tried to remove faces from the bodies.
CI/CD = Continuous Impediment/Continuous Despair
|
|
|
|
|
Undertaker?
Will Rogers never met me.
|
|
|
|
|
more like Hannibal Lecter
CI/CD = Continuous Impediment/Continuous Despair
|
|
|
|
|
I'm dying for code generator ideas to leverage my improved Slang/Deslang technology.
I've written enough parsers and lexer generators by now.
Does Microsoft already produce a JSON entity framework type thing that gives you typed access to JSON data?
Edit: NVM, I finally found one. Microsoft didn't write it, but it's much more ambitious than anything I'd come up with.
Any other ideas?
Check out my IoT graphics library here:
https://honeythecodewitch.com/gfx
And my IoT UI/User Experience library here:
https://honeythecodewitch.com/uix
modified 19-Nov-23 12:28pm.
|
|
|
|
|
I'd convert (load) JSON data to entity classes and use Entity Framework.
VS has XML to classes. Haven't looked for "JSON to classes" since I haven't needed it.
Later: Just looked. VS has JSON to classes under Paste Special.
"Before entering on an understanding, I have meditated for a long time, and have foreseen what might happen. It is not genius which reveals to me suddenly, secretly, what I have to say or to do in a circumstance unexpected by other people; it is reflection, it is meditation." - Napoleon I
|
|
|
|
|
God, you bang on about some rubbish
|
|
|
|
|
Gosh, and yet I'm not the one complaining for no reason when I could just keep scrolling.
Or creating an account just to say that. Get a job.
Check out my IoT graphics library here:
https://honeythecodewitch.com/gfx
And my IoT UI/User Experience library here:
https://honeythecodewitch.com/uix
|
|
|
|