Windows 2 Apples Episode 24
Mar 14th, 2008 by inetsynch
Audio player at end of text.
MacSpeech Dictate, Nuance NaturallySpeaking
In past episodes, I bemoaned the lack of good speech to text recognition software for the Apple. At the time I began writing this blog and podcast the only product available for Intel Macintosh computers was iListen produced and marketed by MacSpeech. I had been accustomed to using Dragon NaturallySpeaking by Nuance and the excellent speech-recognition and translation software bundled with all flavors of Vista. In contrast, I saw iListen as a flawed product, which in my mind, should have been discontinued years ago.
In January 2008, MacSpeech announced a licensing agreement with Nuance to market a port of the Nuance speech engine to Intel Macintosh computers.MacSpeech offered many of its iListen customers what they referred to as a cross grade from iListen to Dictate for $79. The list price for MacSpeech Dictate is $200 US. Given the very sad performance of iListen, I felt that MacSpeech should have made the “Cross Grade” available at no charge. I suspect significant royalty obligations to Nuance, may have made it impossible for MacSpeech to do the right thing and absorb the cost of providing free updates to iListen customers.
MacSpeech Dictate is shipped with two CDs, one to install the actual application and the other to import the appropriate vocabulary tailored to your selected language and country. Installation was typically Mac, in that once the CD had been recognized by my computer, I simply moved the MacSpeech icon to the applications folder. After the application is installed a pop-up requests the insertion of the data disk.
I realized of course that when I tried to insert the data desk that the program disk had not been automatically ejected. I felt a bit of a blush from my embarrassment but realized that my instinctive move to insert the second required disk was born out of habit in working with Windows machines that typically eject disks before requesting the insertion of the additional CDs or DVDs.
After the software and data have been installed I was prompted to create a user profile which requires me to calibrate my USB interfaced headset and read approximately 5 minutes of text. Although the Windows Dragon NaturallySpeaking application can actually be used without any training at all the five minutes of training required by Dictate is innocuous and a huge improvement over the two week training period foisted upon iListen users.
Once installed and “trained”, I immediately noted accuracy had been dramatically improved over that seen in iListen and the equal of what I am accustomed to when using the Windows version of Dragon NaturallySpeaking.
The Dictate Quick-start flyer repeatedly refers to the “amazing accuracy” you should expect when using the product. Those who have struggled with trying to make iListen work will be blown away. However, those using other products such as Dragon and Vista Speech Recognition will be less impressed but breathe a sigh of relief that Nuance finally ported their speech engine to the Macintosh.
The only documentation shipped with Dictate is a three page flyer detailing installation and listing some common voice commands. A separate pop-up window displaying additional commands is launched when Dictate is activated. Given the complexity and novelty of speech to text software the lack of comprehensive documentation is a glaring omission that I hope is addressed quickly.
There is essentially no usable correction scheme which in my opinion severely limits its usefulness and certainly makes it impractical for many who are physically disabled. It seems to have some of the same quirks I noted in iListen. The Do Select, Do Delete, Insert before, etc., commands become useless if the text is manually edited. Manual edits seem to confuse Dictate and it loses track of dictated text. Several times complete paragraphs were deleted by Dictate when I was seeking to modify one word.
Additionally, correcting text manually seems to cause previously deleted text to randomly reappear when dictation is resumed. Once, after manually modifying text, the application stopped responding. The Dictate application showed that my speech was being monitored but no text was displayed. The only way I was able to resume dictation was by shutting down Dictate and restarting.
Interestingly, Chuck Rogers, the Chief Evangelist at MacSpeech blames the Mac operating system for their difficulties in creating a correction scheme that works at least as well as that available in Windows products. He goes on to suggest that Nuance has had similar problems in developing a workable correction scheme for Windows but have been able to resolve them because of the many additional years to iron out the kinks. Is he suggesting Windows is indeed a more mature OS offering solutions not currently available to Apple users? This would make for an interesting Windows versus Apple commercial if Microsoft were so inclined.
David Pogue, in his last review of Dictate, suggested several bugs would need to be squashed before shipping, however, it appears that more work needs to be done. Although accuracy it is greatly improved a workable correction scheme needs to be implemented before it can truly compete with the Windows version.
All of this said, Dictate is light years ahead of iListen and with improved correction options and the extermination of a few bugs, Dictate may finally offer Mac users a speech to text technology competitive with that we have grown accustomed to in the Windows world.
Link to youtube demo of MacSpeech Demo illustrating limited edit commands and extraneous text bug.
Dictated using MacSpeech Dictate, recorded using M-Audio and MixCraft 3 and tags edited with JetAudio.
NOTE:
Chuck Rogers disagrees with my characterization of his comments concerning issues in implementing speech technology on the Apple platform. He disclaims any allusion to Mac OS X as being “less robust or functional than Windows”. Please see his comment to get his insight into the technical issues facing MacSpeech.
Standard Podcasts [8:39m]: Play Now | Play in Popup | Download | Embeddable Player | Hits (592)















Sam:
I certainly didn’t “blame” the Mac OS and in no way did I infer Mac OS X is somehow less robust or less functional than Windows. In fact, quite the opposite is true. It is much easier to develop for Mac OS than Windows. You totally misunderstood my comments. Here’s the deal:
Windows hasn’t changed much in the way it handles text since Windows 95/NT, which happened about 14 years ago. Dragon has been around that entire time and has had all those years to work out how to deal with all the different methodologies used by developers of SELECTED applications.
MacSpeech Dictate, by contrast, is in its infancy. The same methods used by Dragon for manipulating text in an application don’t apply to Mac OS X. It is in no way because Mac OS X is in any way not as capable. It is simply that the methods used to manipulate in text under Windows simply don’t work with Mac OS X, which means all of that has to be written from scratch.
Chuck Rogers, Chief Evangelist MacSpeech, Inc.
Wow, what an incredible write-up! In my experiences with iListen, Dragon Naturally Speaking, and Dictate, I agree with you that
“Once installed and “trained”, I immediately noted accuracy had been dramatically improved over that seen in iListen and the equal of what I am accustomed to when using the Windows version of Dragon NaturallySpeaking.”
I also agree with your comment about it not having a useable correction scheme limiting the usefulness of the program. In my experiences working with Dictate, I still find myself occasionally at the keyboard, fixing leading space errors, or with my mouse selecting text for Dictate to correct. Hopefully they will come out with the correction features like Dragon in the not too distant future.
Sam,
What an excellent video. I commented on Youtube on how much I enjoyed it and linked to it at my site. Concerning the deleted or cut text bug, you are correct. If using Dictate to move around a document and/or cut and paste text by voice command, this is certainly a problem. While the voice recognition feature of Dictate is great, its ability to do the above could pose serious problems for users unable to do the above by mouse or keyboard. Hopefully this will soon be corrected in a future release. Great work!
Thanks for your kind comments. We are going to follow up with a few more and perhaps one or two comparing Vista Speech with MacSpeech Dictate.
Sam Windows 2 Apples
Once again, a great show. I wish more product were as thoghtful and upfront as yours. Too often, people (myself included) let personal opinion get in the way of a true product review. I’m a little in replying, but just wanted to share my thoughts with you.
I have played arount a bit more Vista’s speech recognition. It works well, but have found that it is less accurate when you have a fan running about two feet from your microphone. Oh well.
Keep up the good work!
Thanks for the kind remarks. I still find the Vista Speech engine a well kept secret. It has a few major issues I hope Microsoft will address posthaste. The most pressing is the lack of support for all Microsoft 2007 applications.
The glaring lack of support for Messenger and Excel make Vista a less attractive option for disabled. I understand Bill Gates is committed to Voice Control and conversion technologies and have fingers crossed the issues with some key Microsoft Applications will be addressed soon.
Sam
I had not realized there was such a compatibility problem with Office 2007. That is disappointing, especially considering how tight they are otherwise. I guess they spent more time making it look pretty ‘Vista-like’ than they did in other areas. Thanks for the heads up.
Microsoft’s Live Search for Windows Mobile has excellent voice recognition. I have a Motorola Q with WinMo 5 and installed the live search. I used it on a trip recently and the accuracy was pretty impressive.