Can you hear me now?

Improvements prep voice-recognition technology for a wider range of uses

The computer on the original "Star Trek" TV series set the bar awfully high for voice recognition. It not only understood human speech — even slang — but also replied clearly and with personality. Although earthbound voice-recognition technology is rapidly improving and is now useful for many office tasks, it has not yet attained the standard the starship Enterprise set.

Indeed, the technology needs a combination of vastly improved artificial intelligence technology and a more sophisticated speech-recognition engine before matching the performance of the USS Enterprise's system.

The good news is that the latest versions of Dragon NaturallySpeaking and IBM's ViaVoice do a pretty good job of figuring out what you're saying. As those products improve, they have a broader range of uses.

Transcription for the medical and legal fields continues to be one of the most frequent applications of voice-recognition technology. Enabling accessibility for users with disabilities runs a close second. Although most organizations that use a computer-assisted transcription process won't totally replace manual labor, they often use employees in quality assurance and editing roles rather than as professional transcribers.

For this review we looked at Dragon NaturallySpeaking Professional Version 8 and IBM ViaVoice Pro USB Edition Version 10. Both products come with a high-quality noise-canceling headset from Andrea Electronics, although they use different models. Dragon NaturallySpeaking comes with Model 91, while ViaVoice includes Model 61.

Both products offer similar speech-to-text capabilities, although the target market is obviously different. Dragon NaturallySpeaking comes with a number of features specifically for enterprises, including a feature that lets you store voice profiles on a central server and transcribe audio files from digital recorders or any handheld device that supports the Microsoft PocketPC operating system. ViaVoice focuses more on individual users, providing most of the same functions as Dragon NaturallySpeaking without the enterprise extras.

Getting started

During the setup process, both packages require you to configure the software to match the hardware, such as headsets or microphones, and specific users. During the first step, you speak into the microphone to set the audio level.

Then must train the algorithms to match your speech. To complete this learning process, you read large portions of text to train the software to recognize your speech patterns. Lastly, the program searches your computer for text files or e-mail messages, which helps the software learn your writing style.

My first attempt to train Dragon NaturallySpeaking was done in a room with a high level of ambient noise. The first step of the calibration process adjusts the volume level while the second step adjusts for the noise level. I was able to get past the first but not the second step in that room. Moving to a quieter room made all the difference, and the process proceeded without incident.

Dragon NaturallySpeaking also supports input from external recording devices including PocketPCs. Training the software to recognize the audio from one of those devices is not as accurate as that from a good noise-canceling headset. To alleviate this problem, you can read a large passage of text for 15 minutes from one of eight literary works during PocketPC's setup. Be careful which passage you select because I had trouble focusing — and not laughing — when reading "Dogbert's Top Secret Management Handbook."

IBM's ViaVoice product uses a similar setup process. I had no problem completing the configuration steps in a quiet room. I tried both products in the noisier room — noise created by a window air conditioner and occasionally by a high-speed server fan — after the training session and both performed acceptably.

In use

Both products instruct you to speak in your normal tone of voice and at the pace you would typically use. They also encourage you to pronounce your words clearly and distinctly to help the recognition process.

You need to become accustomed to watching text appear on the screen while speaking. Depending on your configuration and how fast you talk, you could speak an entire sentence before anything shows up on the screen.

To test the speech-recognition software, I used a second-grade grammar textbook and read a paragraph with a number of homonyms in it.

Both programs did a pretty good job of recognizing the difference between words such as "see" and "sea" using the context of the sentence. Dragon NaturallySpeaking couldn't seem to understand the word "homonym" while ViaVoice picked it right up. For other words, ViaVoice had problems while Dragon NaturallySpeaking got them right.

If the software misunderstands a word or phrase, you can correct the mistake so that it won't happen again. ViaVoice uses a correction pop-up menu activated by selecting the wrong word and speaking "Correct < text >." The menu then presents a list of possible replacements. If you find the right word in the list, you say, "Pick < n >" to select that word. You can also type in the correct word if the program can't figure it out.

You should remember to add punctuation to your speech when dictating. Both programs recognize keywords such as "period" to mean end the sentence and insert a period. Other phrases such as "new paragraph" instruct the software to end the sentence and start a new paragraph.

To get the software to recognize a keyword as text, you must speak the word as part of a sentence without pausing. There's also a spell mode that lets you spell out license plate numbers or proper names with multiple capital letters, for example.

Dragon NaturallySpeaking includes a set of tools under the Accuracy Center to add words to your vocabulary or perform additional training. You can add individual words or make the program analyze a document and let you add words to the software's vocabulary in bulk. Accuracy Center also lets you adjust your microphone settings in case you change environments or hardware.

Both programs use a toolbar that loads at the top of your screen by default. The Dragon NaturallySpeaking toolbar displays a number of color-coded menu items along with the name of the current user and the default input device. ViaVoice uses the toolbar to display what it thinks you said and to communicate error messages if it doesn't understand you.

ViaVoice includes a macro command feature to define new commands to insert special text or automate a particular function. One feature exclusive to ViaVoice is the ability to create a macro template form that you can fill out later.

ViaVoice's documentation uses the example of a form for a doctor's office that always includes patient information, symptoms and diagnoses. Both programs allow you to import and export those custom commands or macros for other users or computers.

Enterprise attention

Dragon NaturallySpeaking includes a number of features intended for enterprise users. For example, it can store user profile information on a server for access from more than one computer.

The professional version of Dragon NaturallySpeaking also includes software for personal digital assistants and digital dictation devices. I tried the software on a Hewlett-Packard iPaq hx2415 and found it to be more than adequate.

The product also supports multiple dictation sources for specific users. But you still must train each dictation device. Once you train the new device, you simply add it as another input device for a specific user. Dragon NaturallySpeaking automatically backs up user speech files after every fifth update, but you can change the frequency.

A Manage Users dialog box lets you choose options for setting backup and restore functions, importing/exporting custom commands, and selecting multiple dictation devices. You set those preferences at an individual computer used by multiple users or on a central file server for roaming users.

Accessibility options

Both programs make it possible to operate a computer virtually hands-free for individuals with physical challenges. The user manuals for Dragon NaturallySpeaking and ViaVoice show how to verbally execute basic Microsoft Windows functions, such as moving the cursor on the screen and clicking the mouse. They also include basic operating instructions for the most popular productivity applications.

Options for the visually impaired are limited to reading text from within a word-processing program or the scratch pad application. Both programs provide a simple scratch pad application that allows you to dictate text, copy it and then paste it into another application.

Bottom line

Don't expect to see "Star Trek"-level speech recognition anytime soon. Although some users have adopted voice recognition to help with physical problems, such as carpal tunnel syndrome or other physical limitations, you won't find a headset or microphone on most desks.

Curiously, this lack of general acceptance seems to have little to do with the technology's performance, as I found in reviewing these two products. Rather, user perception and lack of motivation rank as the two biggest challenges to widespread adoption.

Many people get along fine with the way they use the computer now and don't want or need another input device that has some limitations and takes some customization.

Dragon NaturallySpeaking works well at what it does: text transcription and dictation support. Although it costs more than ViaVoice, it also offers more features and functions to justify the price difference.

Ferrill, based in Lancaster, Calif., has been writing about computers and software for more than 15 years. He can be reached at paul.ferrill@verizon.net.

Dragon NaturallySpeaking 8

ScanSoft
(781) 565-5000
www.scansoft.com

Features: *****
Performance: ****
Ease of use: ****
Price: ***

Price: The government prices for Dragon NaturallySpeaking 8 Professional are $674.60 for single copies and $487.15 each for 150 licenses.

Pros: Dragon NaturallySpeaking provides a good overall experience and lots of features for enterprise deployment. Additional flexibility comes with support for after-the-fact transcription for recordings made with handheld digital dictation devices and Microsoft PocketPCs.

Cons: Users need some training to use the product, and they will have to tweak the voice-recognition engine to customize it. NaturallySpeaking lacks support for text-to-speech other than reading from document files.

IBM ViaVoice ProUSB 10

ScanSoft
(781) 565-5000
www.scansoft.com

Features: ****

Performance: ****

Ease of use: ****

Price: ****

Price: ViaVoice costs $189.99 for one copy.

Pros: ViaVoice did a good job of recognizing the majority of our speech tests, and it delivers all the functions at the individual level of the more expensive Dragon NaturallySpeaking. Other features, such as template fill-in, could be useful in smaller offices.

Cons: ViaVoice doesn't have enterprise features and doesn't support text-to-speech for anything other than documents.

X
This website uses cookies to enhance user experience and to analyze performance and traffic on our website. We also share information about your use of our site with our social media, advertising and analytics partners. Learn More / Do Not Sell My Personal Information
Accept Cookies
X
Cookie Preferences Cookie List

Do Not Sell My Personal Information

When you visit our website, we store cookies on your browser to collect information. The information collected might relate to you, your preferences or your device, and is mostly used to make the site work as you expect it to and to provide a more personalized web experience. However, you can choose not to allow certain types of cookies, which may impact your experience of the site and the services we are able to offer. Click on the different category headings to find out more and change our default settings according to your preference. You cannot opt-out of our First Party Strictly Necessary Cookies as they are deployed in order to ensure the proper functioning of our website (such as prompting the cookie banner and remembering your settings, to log into your account, to redirect you when you log out, etc.). For more information about the First and Third Party Cookies used please follow this link.

Allow All Cookies

Manage Consent Preferences

Strictly Necessary Cookies - Always Active

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Sale of Personal Data, Targeting & Social Media Cookies

Under the California Consumer Privacy Act, you have the right to opt-out of the sale of your personal information to third parties. These cookies collect information for analytics and to personalize your experience with targeted ads. You may exercise your right to opt out of the sale of personal information by using this toggle switch. If you opt out we will not be able to offer you personalised ads and will not hand over your personal information to any third parties. Additionally, you may contact our legal department for further clarification about your rights as a California consumer by using this Exercise My Rights link

If you have enabled privacy controls on your browser (such as a plugin), we have to take that as a valid request to opt-out. Therefore we would not be able to track your activity through the web. This may affect our ability to personalize ads according to your preferences.

Targeting cookies may be set through our site by our advertising partners. They may be used by those companies to build a profile of your interests and show you relevant adverts on other sites. They do not store directly personal information, but are based on uniquely identifying your browser and internet device. If you do not allow these cookies, you will experience less targeted advertising.

Social media cookies are set by a range of social media services that we have added to the site to enable you to share our content with your friends and networks. They are capable of tracking your browser across other sites and building up a profile of your interests. This may impact the content and messages you see on other websites you visit. If you do not allow these cookies you may not be able to use or see these sharing tools.

If you want to opt out of all of our lead reports and lists, please submit a privacy request at our Do Not Sell page.

Save Settings
Cookie Preferences Cookie List

Cookie List

A cookie is a small piece of data (text file) that a website – when visited by a user – asks your browser to store on your device in order to remember information about you, such as your language preference or login information. Those cookies are set by us and called first-party cookies. We also use third-party cookies – which are cookies from a domain different than the domain of the website you are visiting – for our advertising and marketing efforts. More specifically, we use cookies and other tracking technologies for the following purposes:

Strictly Necessary Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Functional Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Performance Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Sale of Personal Data

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.

Social Media Cookies

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.

Targeting Cookies

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.