No edit summary |
No edit summary |
||
(4 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
__TOC__ | __TOC__ | ||
'''Back to''': [[Software]] | '''Back to''': [[Software]] | ||
== Tools == | |||
=== Tools === | |||
{| class="wikitable" | {| class="wikitable" | ||
!Name | !Name | ||
Line 10: | Line 11: | ||
!Info | !Info | ||
|- | |- | ||
|Braina | |[https://www.brainasoft.com/braina/ Braina] | ||
|Yes | |Yes | ||
|Yes | |Yes | ||
Line 17: | Line 18: | ||
|Dictate into third party software and websites, fill web forms and execute vocal commands. | |Dictate into third party software and websites, fill web forms and execute vocal commands. | ||
|- | |- | ||
|Caster | |[https://caster.readthedocs.io/en/latest/ Caster] | ||
| | | | ||
|Yes | |Yes | ||
Line 24: | Line 25: | ||
|Caster (built on the Dragonfly framework) | |Caster (built on the Dragonfly framework) | ||
|- | |- | ||
|Click by Voice (by mdbridge) | |[https://chromewebstore.google.com/detail/click-by-voice/dleiijbbjajmfcaiiiadgjpgfjmfdfen Click by Voice] (by mdbridge) | ||
|No | |No | ||
|Yes | |Yes | ||
Line 31: | Line 32: | ||
|Chrome extension allowing activating links & other HTML elements w/ voice commands | |Chrome extension allowing activating links & other HTML elements w/ voice commands | ||
|- | |- | ||
|Dragon Professional Individual (DPI) | |[https://www.nuance.com/en-gb/dragon/business-solutions/dragon-professional.html Dragon Professional Individual] (DPI) | ||
|Yes | |Yes | ||
|Yes | |Yes | ||
Line 52: | Line 53: | ||
|Add-Ons for Dragon Speech Recognition | |Add-Ons for Dragon Speech Recognition | ||
|- | |- | ||
|KnowBrainer | |[https://www.knowbrainer.com/ KnowBrainer] | ||
|No | |No | ||
|Yes | |Yes | ||
Line 59: | Line 60: | ||
| | | | ||
|- | |- | ||
|LilySpeech | |[https://lilyspeech.com/ LilySpeech] | ||
|Yes | |Yes | ||
|Limited | |Limited | ||
Line 66: | Line 67: | ||
| | | | ||
|- | |- | ||
|LipSurf | |[https://www.lipsurf.com/ LipSurf] | ||
|Yes | |Yes | ||
|Yes | |Yes | ||
Line 73: | Line 74: | ||
|Google Chrome extension, adds commands and (if you pay), dictation, inside Chrome. Interesting command called ‘Tags’ where all clickable UI elements in Chrome are numbered so you can click on them by voice. Similar to ‘Show numbers’ in KnowBrainer 2017, and a few other programs that offer the same functionality.Seems to use Google’s speech recognition engine. | |Google Chrome extension, adds commands and (if you pay), dictation, inside Chrome. Interesting command called ‘Tags’ where all clickable UI elements in Chrome are numbered so you can click on them by voice. Similar to ‘Show numbers’ in KnowBrainer 2017, and a few other programs that offer the same functionality.Seems to use Google’s speech recognition engine. | ||
|- | |- | ||
|Otter Voice Meeting Notes | |[https://otter.ai/ Otter Voice Meeting Notes] | ||
|Yes | |Yes | ||
|No | |No | ||
Line 87: | Line 88: | ||
|Nuance Communications acquired Philips owned. Medical industry focus according to Frost & Sullivan. Standalone or embedded. | |Nuance Communications acquired Philips owned. Medical industry focus according to Frost & Sullivan. Standalone or embedded. | ||
|- | |- | ||
|Speechnotes | |[https://speechnotes.co/ Speechnotes] | ||
|Yes | |Yes | ||
|No | |No | ||
Line 94: | Line 95: | ||
| | | | ||
|- | |- | ||
|Speech Productivity (SP) | |[https://www.speechproductivity.eu/ Speech Productivity] (SP) | ||
|Yes (needs Dagon installed) | |Yes (needs Dagon installed) | ||
|Yes | |Yes | ||
Line 115: | Line 116: | ||
|The developer has developed his own speech engine (based on Facebook technology). His model, called ‘Conformed D + Whisper’ is almost as good as Dragon for regular dictation, and much better than Dragon for voice commands. After 10 years of battling with Dragon, I think I may have finally found a '''Dragon killer'''! | |The developer has developed his own speech engine (based on Facebook technology). His model, called ‘Conformed D + Whisper’ is almost as good as Dragon for regular dictation, and much better than Dragon for voice commands. After 10 years of battling with Dragon, I think I may have finally found a '''Dragon killer'''! | ||
|- | |- | ||
|Tazti | |[https://www.tazti.com/ Tazti] | ||
| | | | ||
|Yes | |Yes | ||
Line 122: | Line 123: | ||
|Create speech command profiles to play PC games and control applications – programs. Create speech commands to open files, folders, webpages, applications. Windows 7, Windows 8 and Windows 8.1 versions. | |Create speech command profiles to play PC games and control applications – programs. Create speech commands to open files, folders, webpages, applications. Windows 7, Windows 8 and Windows 8.1 versions. | ||
|- | |- | ||
|Vocola / Unimacro | |[https://vocola.net/ Vocola] / [https://vocola.net/v2/UnimacroActions Unimacro] | ||
|No | |No | ||
|Yes | |Yes | ||
Line 129: | Line 130: | ||
|Open source alternative to KnowBrainer 2017 (see above). Commands stored as easy-to-edit text. | |Open source alternative to KnowBrainer 2017 (see above). Commands stored as easy-to-edit text. | ||
|- | |- | ||
|VoiceAttack | |[https://voiceattack.com/ VoiceAttack] | ||
|No | |No | ||
|Yes | |Yes | ||
Line 136: | Line 137: | ||
| | | | ||
|- | |- | ||
|VoiceBot | |[https://www.voicebot.net/ VoiceBot] | ||
| | | | ||
| | | | ||
Line 143: | Line 144: | ||
| | | | ||
|- | |- | ||
|VoiceComputer | |[https://voicecomputer.com/ VoiceComputer] | ||
| | | | ||
| | | | ||
Line 150: | Line 151: | ||
|add-on for Dragon | |add-on for Dragon | ||
|- | |- | ||
|VoiceMacro | |[https://www.voicemacro.net/ VoiceMacro] | ||
|No | |No | ||
|Yes | |Yes | ||
Line 164: | Line 165: | ||
|Crap, as usual, from Microsoft. | |Crap, as usual, from Microsoft. | ||
|- | |- | ||
| | |Microsoft's new | ||
* Voice access | |||
* Voice typing | |||
|Yes | |Yes | ||
|Limited | |Limited | ||
Line 177: | Line 181: | ||
* [https://mayecreate.com/blog/voice-recognition-a-brief-comparison-of-dictation-extensions-for-google-chrome/ Voice Recognition: a Brief Comparison of Dictation Extensions for Google Chrome] | * [https://mayecreate.com/blog/voice-recognition-a-brief-comparison-of-dictation-extensions-for-google-chrome/ Voice Recognition: a Brief Comparison of Dictation Extensions for Google Chrome] | ||
=== Still processing === | |||
* https://github.com/mozilla/DeepSpeech | |||
https://kaldi-asr.org/ | * https://kaldi-asr.org/ | ||
https://github.com/julius-speech/julius | * https://github.com/julius-speech/julius | ||
https://github.com/facebookresearch/wav2letter | * https://github.com/facebookresearch/wav2letter | ||
https://github.com/PaddlePaddle/DeepSpeech | * https://github.com/PaddlePaddle/DeepSpeech | ||
https://github.com/NVIDIA/OpenSeq2Seq | * https://github.com/NVIDIA/OpenSeq2Seq | ||
https://github.com/pytorch/fairseq | * https://github.com/pytorch/fairseq | ||
https://alphacephei.com/vosk/ | * https://alphacephei.com/vosk/ | ||
https://github.com/athena-team/athena | * https://github.com/athena-team/athena | ||
https://espnet.github.io/espnet/ | * https://espnet.github.io/espnet/ | ||
https://fosspost.org/open-source-speech-recognition-2020/ | * https://fosspost.org/open-source-speech-recognition-2020/ | ||
[[Category:Software]] | [[Category:Software]] | ||
[[Category:Dictation]] | [[Category:Dictation]] | ||
{{Back to the top}} | {{Back to the top}} |
Latest revision as of 18:07, 18 December 2023
Back to: Software
Tools
Name | Dictation? | Voice commands? | Dictate in browser | Dictate in any program | Info |
---|---|---|---|---|---|
Braina | Yes | Yes | Yes | Dictate into third party software and websites, fill web forms and execute vocal commands. | |
Caster | Yes | No | No | Caster (built on the Dragonfly framework) | |
Click by Voice (by mdbridge) | No | Yes | No | No | Chrome extension allowing activating links & other HTML elements w/ voice commands |
Dragon Professional Individual (DPI) | Yes | Yes | Yes | Yes | |
Dragon Professional Anywhere | Yes | Yes | Yes | Yes | |
DragonUtilities | Add-Ons for Dragon Speech Recognition | ||||
KnowBrainer | No | Yes | No | No | |
LilySpeech | Yes | Limited | Yes | Yes | |
LipSurf | Yes | Yes | Yes | No | Google Chrome extension, adds commands and (if you pay), dictation, inside Chrome. Interesting command called ‘Tags’ where all clickable UI elements in Chrome are numbered so you can click on them by voice. Similar to ‘Show numbers’ in KnowBrainer 2017, and a few other programs that offer the same functionality.Seems to use Google’s speech recognition engine. |
Otter Voice Meeting Notes | Yes | No | Yes | No | ‘Otter.ai frees you from taking notes at meetings, by automatically joining selected meetings, and producing live transcriptions that you and other participants can annotate and highlight in real time.’ |
SpeechMagic | Yes | Example | Nuance Communications acquired Philips owned. Medical industry focus according to Frost & Sullivan. Standalone or embedded. | ||
Speechnotes | Yes | No | Yes | No | |
Speech Productivity (SP) | Yes (needs Dagon installed) | Yes | No | Yes, via special ‘Dictation Box’ (using Dragon) | Offers a much improved Dictation Box. |
SpeechTexter | Yes | Yes | |||
Talon (voice command and dictation software) | Yes | Yes | Yes | Yes | The developer has developed his own speech engine (based on Facebook technology). His model, called ‘Conformed D + Whisper’ is almost as good as Dragon for regular dictation, and much better than Dragon for voice commands. After 10 years of battling with Dragon, I think I may have finally found a Dragon killer! |
Tazti | Yes | Create speech command profiles to play PC games and control applications – programs. Create speech commands to open files, folders, webpages, applications. Windows 7, Windows 8 and Windows 8.1 versions. | |||
Vocola / Unimacro | No | Yes | No | No | Open source alternative to KnowBrainer 2017 (see above). Commands stored as easy-to-edit text. |
VoiceAttack | No | Yes | No | No | |
VoiceBot | |||||
VoiceComputer | add-on for Dragon | ||||
VoiceMacro | No | Yes | No | No | |
Windows Speech Recognition (WSR) / Cortana | Yes | Limited | Yes | Yes | Crap, as usual, from Microsoft. |
Microsoft's new
|
Yes | Limited | Yes | Yes | Finally, Microsoft’s built-in speech recognition is improving. Microsoft recently bought Nuance (makers of Dragon), and I suspect the new ‘Voice Typing’ in Windows is based on this, as the quality is much better than the old WSR engine. Extremely limited commands, and the second you finish speaking it stops listening, but the recognition is as good as basically already as good as Dragon, but there is no need to install anything on your computer. |
External links
- https://en.wikipedia.org/wiki/List_of_speech_recognition_software
- https://www.knowbrainer.com/forums/forum/ (Best places to learn about dictation, voice commands, etc.)
- Voice Recognition: a Brief Comparison of Dictation Extensions for Google Chrome
Still processing
- https://github.com/mozilla/DeepSpeech
- https://kaldi-asr.org/
- https://github.com/julius-speech/julius
- https://github.com/facebookresearch/wav2letter
- https://github.com/PaddlePaddle/DeepSpeech
- https://github.com/NVIDIA/OpenSeq2Seq
- https://github.com/pytorch/fairseq
- https://alphacephei.com/vosk/
- https://github.com/athena-team/athena
- https://espnet.github.io/espnet/
- https://fosspost.org/open-source-speech-recognition-2020/