You are here : Home » IndyBlogs Home

 Subscribe to RSS

« Keen on New Media: Weighing up plebisictory democracy | Main | Today in Politics: More trouble with the banks »

Thursday, 20 November 2008

Cyberclinic: Computer says "Sorry, didn't catch that"

By Rhodri Marsden

In the past few days, British iPhone users have been bellowing into their phones in a range of timbres and accents that make them sound like they're auditioning for a place at Jon Culshaw's Elementary School of Impressionism. That's because Google have just launched a Voice Search facility for their Google Mobile App; it senses when you lift the iPhone to your head, it beeps, awaits for search data to emerge from your mouth - like, I dunno, maybe "BNP postcode search" and then presents the results on the screen for you. That theoretically saves you approximately 10 seconds that you would have spent keying stuff in on-screen, although in practice it only works if you adopt the kind of ludicrous transatlantic accent deployed by TV chef Robert Irvine (oblique reference there, I know, but if you don't indulge in TV cooking shows as much as I do, just replace "Robert Irvine" with "Dick van Dyke".)

It's the latest development in a long and rocky relationship between our voices and technology. Macintosh computers have had some kind of elementary voice recognition software built in since 1993; Windows users had to wait another ten years or so, as while the functionality was included with Windows XP, to use it you had to jump through hoops and install a special SDK that wasn't officially supported. But neither of them were highly developed enough for obvious applications such as dictation; the Mac version did originally allow you to operate menus by holding down a key, then loudly and deliberately saying "Open" or "Copy", but it was such a resource hog that this was abandoned, and only reappeared about 4 years ago with the "Speakable Items" facility in OS 10.3.

Fact is, though, it wasn't really missed by the vast majority of people. We're used to interacting with our computers by pointing an clicking in blissful silence, and while it's briefly amusing to watch them respond to orders, it's only ever going to feel natural if they respond accurately to a casual murmur, regardless of any background noise. Barking loudly at an erratic machine that's unable to discern between accents isn't only stressful and unproductive, it has the capacity to turn the office environment into a bizarre cross between the noisy carriage on a commuter train and an army training ground. Of course, voice recognition technology is developing all the time - driven not least by US defence departments, who appear to place more faith in a computer's ability to activate weapon release systems via a voice command than a pilot's ability to press a button marked "fire".

So at some point we'll undoubtedly be freed from the tyranny of the keyboard and mouse as an input system (with services like SpinVox currently showing the way forward); while those who'd resent turning their music down to write an email probably couldn't care less, there'll be several thousand people with carpal tunnel or RSI who'll be punching the air.

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d8341d0e8d53ef0105360b1c10970c

Listed below are links to weblogs that reference Cyberclinic: Computer says "Sorry, didn't catch that":

Comments

Reading this article has just made me so aware that we have come a long way from the mobile phone to the mobile presence device.

This blog have good information. These information are very lovely and useful.

Sometimes such technology promotes laziness.

Haha I just had to say that the reference to BNP postcode search made me laugh, literally. I like the thought of dictation software but don't have the time to play around with getting one to listen to me properly.

Magnetic is the industry-leader in auto-stereoscopic 3D LCD displays, 3D content creation, 3D digital signage solutions and 3D software. See 3D without the glasses with Magnetic’s Enabl3d 3D displays.

It's really great to post my comments on such a blog. I would like to appreciate the great work done by the web master and would like to tell everyone that they should post their interesting comments and should make this blog interesting. Once again I would like to say keep it up to blog owner!!!! http://www.naturalherbalsinc.com

Fear not that the life shall come to an end, but rather fear that it shall never have a beginning.

I've been using iPhone for quite a while and i think That the sound feature is really helpful in saving our time . Though , it should be working perfectly without much errors.

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment