Written by TeyeDoubleGuhRrr the 29 Feb 08 at 21:08.
Category: Others.
Related project:
Nothing/Others.
Status: Already implemented
Rationale
Having the ability to control the Ubuntu / Kubuntu interface via voice commands and the ability to dictate into applications would contribute immensely to the accessibility of the product.
Currently the Sphinx project of CMU (http://cmusphinx.sourceforge.net/html/cmusphinx.php) is one of the better known open-source speech recognition projects. Perhaps working with them to enhance their functionality would be beneficial to the Ubuntu meme as a whole.
I voted in favor, even though gnome-voice-control already exists and is available in the ubuntu repositories. However everyone who is interested in this functionality should go to http://www.voxforge.org/ and submit speech.
Without enough (GPL) speech speech recognition on linux will simply not happen even though there are many people out there who'd like to use speech recognition and there are programs out there too.
We need training data!!! I submitted hours myself, but we need many voices to make it 'speaker independent' and lots of speech per speaker helps as well. This is not something ubuntu can do for you without your speech.
It would be really great, when you can use (your own voice) for starting programms like
firefox or internet (and firefox will be starten)
amarok next (and in amarok the next song will be played)
or a nice feature in impress, okular, evience, ...
a presentation will be starten and when i say: next slide the next slide or side will be come.
I`d like to use my voice like shortcards. at the start the programm ask you how to start the programms and the programm also ask you some often used words (like next, ...) and in the programs you can put alternative words for next. like you can make it with shortcards in kde applications.
I`think it will be realy nice, when you have an presentation and when you say next slide the computer "works" for you.
What happens when I want to issue an administrative command that requires the root password via voice control? I don't want to say my password out loud for everyone to hear, is there a way to implement security features into voice recognition so that if someone does hear me say my password or passphrase they won't be able to use it to gain administrative access?
One idea would be to bind a separate voice command string to the root password, i.e when I say "Alpha 8 Gecko" the system will recognize it as the voice version of the root password. So basically you would have 2 different root passwords, one you can type in and one you say.