Showing posts sorted by relevance for query Voice Recognition. Sort by date Show all posts
Showing posts sorted by relevance for query Voice Recognition. Sort by date Show all posts

Friday, June 11, 2010

Talk to Me

I am often asked about “talking computers.” It took me a while before I understood the question was not so much about computers that talk to you (voice out) – but about computers you talk to (voice in).

The concept is pretty fabulous; just tell the computer what to do. Forget about the screen, and the keyboard you can’t see, and pay no attention to that pesky mouse. This is what we all want, isn’t it, to give a command and have it carried out?

It is voice recognition that allows us to tell our phones who to call, ask 411 to give us a number, get schedule and fare information for trains or buses. The Mac OS lets us use some voice commands, Windows has built in a speech recognition feature, and Dragon programs are perceived as pure magic. They all require serious set up, and if you have trouble reading the screen, or operating a computer, you won’t get very far.

There is one person I know – only one --who uses a computer strictly with speech recognition. His name is Ben. He is not able to see the screen and was not previously a computer user. Yes, pretty remarkable, I would agree. What he doesn’t have in visual acuity he compensates for with pure tenacity.

Ben explored the Mac, and the PC, and Dragon – only to learn that they were not solutions for Ben. He came across assistive, third-party software called Guide and with a LOT of help from his family and friends, he began using it successfully to dictate emails and get to some web pages.

While he’s been happy to join the email generation and take a peek into the World Wide Web, Guide is no nirvana. It possesses the key characteristics of assistive, third-party programs – very glitchy and very pricey.

Ben dictated an email to me yesterday, telling me that he is getting ready to replace his computer and wanted to know if there is anything new that would do a better job for him. He thought we should show Guide to Apple and perhaps we could inspire their developers, who have proven to be the best of the best, to build us a comprehensive program that is as accessible, and as simple to operate as Voice Control in the iPhone.

I told Ben everybody wants exactly what he wants, and we will get it…soon. I’m sure Apple, Microsoft, IBM and many, many others are working diligently to make our wishes come true. It’s just not quite ready…yet.

There must be someone else out there, other than Ben, who has successfully integrated accessible speech recognition into their computing.

Talk to me ---tell me what you know.

Friday, February 26, 2010

Use Your Voice 2

I turned my TV on one night last week to settle in for an evening at the Olympics. As I pressed 4 on my remote, to change from NY1 to NBC, I heard a slight little “pop” and my picture was gone. Ugh! I held my breath, hoping it was just a temporary cable, or network, problem that would rectify itself.

I started changing channels, the cable guide at the bottom of the screen, which I can barely read with my nose pressed up against it, was changing to reflect that I was changing channels. The only thing on the screen was the channel guide – no picture, no sound. Ugh!

Lately, I’ve been noticing that many people are afraid of their TV. I’ve seen mothers completely freaked out by the sight of the clicker in the hands of their child, fearing one wrong touch of a button and the TV may be out for days. I’ve seen grown men cry when they accidentally knock the cable off line during a football game and have to call the cable company and have the nice lady walk them through the reconnect. I’ve been told by a friend, in her own home, that she did not know how to turn on “that TV”, then her husband came home and he couldn’t get it on either.

It’s hilarious…when it’s not happening to you.

So there I was turning the TV off and on, over and over, as if to purge the demons from it. When that did nothing, I turned the DVD player on and off a few times – for no good reason, really. Then I looked at the remote up close in my video magnifier, hoping to find a button labeled “fix it,” to no avail.

I had to break down and call the cable lady myself and I was not looking forward to chatting with her. She always asks questions beginning in “Can you see? Can you see the serial number on your cable box? Can you see the teeny tiny symbols in grey on your silver remote? Can you see the television…at all? Ugh.

As I dialed her number, I tried to visualize success, the way our Olympic athletes do. Keep your eye on the prize – TV on! My dread of the cable lady’s questions prompted me to choose the “automated system” for technical help (an option I almost never pick).

Once again, it reminded me how amazing voice recognition programs are when they work. This one, much to my surprise, was like magic. The automated voice never asked, “Can you see?” instead it somehow just took me through the steps one by one, asking a few questions, but mostly made it easy and pleasant to refresh my cable. No impatience, no emotion, no judgment – exactly what you want from a machine.

Within minutes I was reunited with Apolo Anton Ohno, thank goodness.

Monday, February 21, 2011

Bing Go

I was a little devastated, in November, when I learned that my favorite free directory assistance, GOOG 411. was about to be discontinued. What? How could this be? I stuck with GOOG for almost two years and I'd grown to love it.

Turns out GOOG 411 had a hidden agenda. Google was using my 'Long Island accent' to develop other, more advanced, speech recognition programs like Voice Search, Voice Input, and Voice Actions for mobile phones. In order to build a robust speech model they needed to collect thousands of pronunciations, syllables, intonations, and inflections.

The discovery that I was a participant in Google's ambitious project to move voice activated computing to a new level turned my devastation into delight. Among other things, maybe this massive collection of utterances will minimize the number of times I hear-- "Sorry I didn't get that," because the program is not processing my manner of speaking.

Directory assistance is quickly becoming a thing of the past -- it's demise accelerated buy the ridiculous $2 charge per call. Our phones are smart and the options for free information are abundant. A simple Internet search will deliver the phone number or address you need in less time than it takes to ask.

Just as I was learning to live without GOOG one of my best informants hooked me up with Bing 411 (compliments of Microsoft). I immediately got into it. Bing 411 will search by city for business listings and can connect to other directories for weather, movies, sports, news, time, travel, driving directions and even cheap gas. I like it.

Try it, you might like it too...800-BING 411 (800-246-4411).

Tuesday, May 5, 2009

What's It All About

For a while, I have been collecting items in a virtual folder labeled “Easy On The Eyes,” which I often pull from and share with people I encounter everyday. Over the last few years, I have come to realize that this resource filed in my head has relevance that reaches far beyond the population of people, who like me, have a serious vision impairment, well into the general population.

Most of us experience the same degree of difficulty in reading the text on a cell phone screen, or a Blackberry, or an iPod (until recently). Who isn’t reaching for the readers and the pocket magnifiers?

Happy to report, these issues are not going unnoticed. More and more mainstream corporations and makers of widely used electronic and technology devices are stepping into the arena of accessibility (simply means making things easy to deal with). After all, who can ignore a market 100 million strong and growing?

I am delighted to have the opportunity to launch this blog, and share with you the many wonderful developments in ‘vision-friendly technology.’ A good many things that make life easier for me, I've learned about from others, word of mouth, and I am pleased to be able to pass it along. So here we go, blogging our way to many, many times the numbers that good old word-of-mouth can reach.

There is a lot to cover already…and so much to come. Great news like crossing over of the iPod nano and shuffle with voice over for spoken menus) and the Kindle 2’s six font sizes, text-to-speech option and (soon) a big screen version, over-the-counter cell phones with voice recognition and audio output, talking ATMs and Metro Card Vending machines; and how Disney is making theme parks accessible with descriptive narration and GPS, and how IBM sees personal technology developing in the near term. And, more, more, more, more!
Remember, the fact is, when it’s good for people with impaired vision, it’s actually better for everyone!

Friday, January 22, 2010

Make It So

I spoke with a Microsoft executive last year about the inaccessible nature of Speech Recognition in Windows Vista. I was excited by the prospects presented by Microsoft – “You can dictate documents and emails in mainstream applications use voice commands to start and switch between applications, control the operating system, and even fill out forms on the Web.”

I expected it to work. Seemed like a natural for people with impaired vision. Unfortunately, it was not up to the task, and my friend at Microsoft told me that I was expecting “Star Trek Technology” and they were not quite there yet.

All I could think of were the words used by Captain Jean-Luc Picard of Star Trek, when commanding his crew to execute an order --- “Make it so!”
I know, we are not on the Starship Enterprise, but it is 2010 and there are a few things that cannot wait for the next generation:

Speech Recognition is still at the top of my list. I want an easy to access program built right into my computer. We can, after all, get movie tickets, conduct banking transactions, and pay our bills, at this point, just by speaking into the phone. Ford put Sync in cars to dial the phone or find our favorite music. It’s high time our computer can do the same.

The Kindle should make those menus talk!
In order to enjoy the text-to-speech feature, I must find the publication I want to read, but the print is not big enough, so I need it to speak. I can live with the Authors Guild restrictions, but I cannot use a Kindle without spoken menus.

Mobile phones should make their menus talk too! Not such a leap, if Apple could make it happen on a touch screen iPhone, it’s time to make it a build in option in all phones.

Give me access to the GPS that is apparently in mobile phones, at the touch of a button, I want to hear my exact location: You are on the northeast corner of Seventh Avenue at 24th Street.

The DVR (Digital Video Recorder) needs a voice as well. I want to DVR my favorite programs, like everyone else. Then again, will Web TV make the DVR mute?

I want to hear from the bar code scanners, too, in the grocery store, the drug store, the department store. It should just be standard…like the talking ATM.

Make it so.

Friday, February 19, 2010

Use Your Voice

Voice (or speech) recognition, when it works, is a very beautiful thing. It’s all around us, more and more.

I regularly pay my American Express bill by phone (800-IPAY-AXP), verbally giving my account number and stating the amount I want to pay, and which account I would like it deducted from. I find it much easier than writing a check. Voila…bill paid.

The easiest way to get schedules and fares for the Metro North and Long Island Railroad is their telephone information lines. All you do is answer the “lady’s” questions: What station are you leaving from? Going to? When? She can’t yet sell you a ticket, or tell you what track, but she is getting smarter all the time.

I love Fandango for the movies (phone
800-FANDANG). It’s all there for the asking: Do you want to find a theater? Find a movie? Get show times? Buy tickets? I give my AMX number to Fandango, with my voice, and purchase the tickets. At the theater I swipe my card and the tickets are mine.

Could it be simpler? Maybe…but only if they could read my mind.