Dictation and voice command tools: Difference between revisions

Revision as of 15:34, 18 December 2023

Back to: Software

Please note that this page is a bit of a mess and need updating.

Tools

Name	Dictation?	Voice commands?	Dictate in browser	Dictate in any program	Info
Braina	Yes	Yes	Yes		Dictate into third party software and websites, fill web forms and execute vocal commands.
Caster		Yes	No	No	Caster (built on the Dragonfly framework)
Click by Voice (by mdbridge)	No	Yes	No	No	Chrome extension allowing activating links & other HTML elements w/ voice commands
Dragon Professional Individual (DPI)	Yes	Yes	Yes	Yes
Dragon Professional Anywhere	Yes	Yes	Yes	Yes
DragonUtilities					Add-Ons for Dragon Speech Recognition
KnowBrainer	No	Yes	No	No
LilySpeech	Yes	Limited	Yes	Yes
LipSurf	Yes	Yes	Yes	No	Google Chrome extension, adds commands and (if you pay), dictation, inside Chrome. Interesting command called ‘Tags’ where all clickable UI elements in Chrome are numbered so you can click on them by voice. Similar to ‘Show numbers’ in KnowBrainer 2017, and a few other programs that offer the same functionality.Seems to use Google’s speech recognition engine.
Otter Voice Meeting Notes	Yes	No	Yes	No	‘Otter.ai frees you from taking notes at meetings, by automatically joining selected meetings, and producing live transcriptions that you and other participants can annotate and highlight in real time.’
SpeechMagic	Yes		Example		Nuance Communications acquired Philips owned. Medical industry focus according to Frost & Sullivan. Standalone or embedded.
Speechnotes	Yes	No	Yes	No
Speech Productivity (SP)	Yes (needs Dagon installed)	Yes	No	Yes, via special ‘Dictation Box’ (using Dragon)	Offers a much improved Dictation Box.
SpeechTexter	Yes		Yes
Talon (voice command and dictation software)	Yes	Yes	Yes	Yes	The developer has developed his own speech engine (based on Facebook technology). His model, called ‘Conformed D + Whisper’ is almost as good as Dragon for regular dictation, and much better than Dragon for voice commands. After 10 years of battling with Dragon, I think I may have finally found a Dragon killer!
Tazti		Yes			Create speech command profiles to play PC games and control applications – programs. Create speech commands to open files, folders, webpages, applications. Windows 7, Windows 8 and Windows 8.1 versions.
Vocola / Unimacro	No	Yes	No	No	Open source alternative to KnowBrainer 2017 (see above). Commands stored as easy-to-edit text.
VoiceAttack	No	Yes	No	No
VoiceBot
VoiceComputer					add-on for Dragon
VoiceMacro	No	Yes	No	No
Windows Speech Recognition (WSR) / Cortana	Yes	Limited	Yes	Yes	Crap, as usual, from Microsoft.
Windows Voice Access (in Windows 11)	Yes	Limited	Yes	Yes	Finally, Microsoft’s built-in speech recognition is improving. Microsoft recently bought Nuance (makers of Dragon), and I suspect the new ‘Voice Typing’ in Windows is based on this, as the quality is much better than the old WSR engine. Extremely limited commands, and the second you finish speaking it stops listening, but the recognition is as good as basically already as good as Dragon, but there is no need to install anything on your computer.

External links

https://en.wikipedia.org/wiki/List_of_speech_recognition_software
https://www.knowbrainer.com/forums/forum/ (Best places to learn about dictation, voice commands, etc.)
Voice Recognition: a Brief Comparison of Dictation Extensions for Google Chrome

Still processing:
https://github.com/mozilla/DeepSpeech
https://kaldi-asr.org/
https://github.com/julius-speech/julius
https://github.com/facebookresearch/wav2letter
https://github.com/PaddlePaddle/DeepSpeech
https://github.com/NVIDIA/OpenSeq2Seq
https://github.com/pytorch/fairseq
https://alphacephei.com/vosk/
https://github.com/athena-team/athena
https://espnet.github.io/espnet/
https://fosspost.org/open-source-speech-recognition-2020/

Back to the top

@@ Line 1: / Line 1: @@
 __TOC__
   '''Back to''': [[Software]]
-== Tools ==
+* Please note that this page is a bit of a mess and need updating.
+=== Tools ===
 {| class="wikitable"
 !Name
@@ Line 10: / Line 13: @@
 !Info
 |-
-|Braina
+|[https://www.brainasoft.com/braina/ Braina]
 |Yes
 |Yes
@@ Line 17: / Line 20: @@
 |Dictate into third party software and websites, fill web forms and execute vocal commands.
 |-
-|Caster
+|[https://caster.readthedocs.io/en/latest/ Caster]
 |
 |Yes
@@ Line 24: / Line 27: @@
 |Caster (built on the Dragonfly framework)
 |-
-|Click by Voice (by mdbridge)
+|[https://chromewebstore.google.com/detail/click-by-voice/dleiijbbjajmfcaiiiadgjpgfjmfdfen Click by Voice] (by mdbridge)
 |No
 |Yes
@@ Line 31: / Line 34: @@
 |Chrome extension allowing activating links & other HTML elements w/ voice commands
 |-
-|Dragon Professional Individual (DPI)
+|[https://www.nuance.com/en-gb/dragon/business-solutions/dragon-professional.html Dragon Professional Individual] (DPI)
 |Yes
 |Yes
@@ Line 52: / Line 55: @@
 |Add-Ons for Dragon Speech Recognition
 |-
-|KnowBrainer 2017 Command Utility
+|[https://www.knowbrainer.com/ KnowBrainer]
 |No
 |Yes
@@ Line 59: / Line 62: @@
 |
 |-
-|LilySpeech
+|[https://lilyspeech.com/ LilySpeech]
 |Yes
 |Limited
@@ Line 66: / Line 69: @@
 |
 |-
-|LipSurf
+|[https://www.lipsurf.com/ LipSurf]
 |Yes
 |Yes
@@ Line 73: / Line 76: @@
 |Google Chrome extension, adds commands and (if you pay), dictation, inside Chrome. Interesting command called ‘Tags’ where all clickable UI elements in Chrome are numbered so you can click on them by voice. Similar to ‘Show numbers’ in KnowBrainer 2017, and a few other programs that offer the same functionality.Seems to use Google’s speech recognition engine.
 |-
-|Otter Voice Meeting Notes
+|[https://otter.ai/ Otter Voice Meeting Notes]
 |Yes
 |No
@@ Line 87: / Line 90: @@
 |Nuance Communications acquired Philips owned. Medical industry focus according to Frost & Sullivan. Standalone or embedded.
 |-
-|Speechnotes
+|[https://speechnotes.co/ Speechnotes]
 |Yes
 |No
@@ Line 94: / Line 97: @@
 |
 |-
-|Speech Productivity (SP)
+|[https://www.speechproductivity.eu/ Speech Productivity] (SP)
 |Yes (needs Dagon installed)
 |Yes
@@ Line 115: / Line 118: @@
 |The developer has developed his own speech engine (based on Facebook technology). His model, called ‘Conformed D + Whisper’ is almost as good as Dragon for regular dictation, and much better than Dragon for voice commands. After 10 years of battling with Dragon, I think I may have finally found a '''Dragon killer'''!
 |-
-|Tazti
+|[https://www.tazti.com/ Tazti]
 |
 |Yes
@@ Line 122: / Line 125: @@
 |Create speech command profiles to play PC games and control applications – programs. Create speech commands to open files, folders, webpages, applications. Windows 7, Windows 8 and Windows 8.1 versions.
 |-
-|Vocola / Unimacro
+|[https://vocola.net/ Vocola] / [https://vocola.net/v2/UnimacroActions Unimacro]
 |No
 |Yes
@@ Line 129: / Line 132: @@
 |Open source alternative to KnowBrainer 2017 (see above). Commands stored as easy-to-edit text.
 |-
-|VoiceAttack
+|[https://voiceattack.com/ VoiceAttack]
 |No
 |Yes
@@ Line 136: / Line 139: @@
 |
 |-
-|VoiceBot
+|[https://www.voicebot.net/ VoiceBot]
 |
 |
@@ Line 143: / Line 146: @@
 |
 |-
-|VoiceComputer
+|[https://voicecomputer.com/ VoiceComputer]
 |
 |
@@ Line 150: / Line 153: @@
 |add-on for Dragon
 |-
-|VoiceMacro
+|[https://www.voicemacro.net/ VoiceMacro]
 |No
 |Yes
@@ Line 164: / Line 167: @@
 |Crap, as usual, from Microsoft.
 |-
-|Windows Voice Typing
+|[https://support.microsoft.com/en-us/topic/get-started-with-voice-access-bd2aa2dc-46c2-486c-93ae-3d75f7d053a4#:~:text=Voice%20access%20is%20a%20new,author%20email%20using%20your%20voice. Windows Voice Access] (in Windows 11)
 |Yes
 |Limited