Comments
yourfanat wrote: I am using another tool for Oracle developers - dbForge Studio for Oracle. This IDE has lots of usefull features, among them: oracle designer, code competion and formatter, query builder, debugger, profiler, erxport/import, reports and many others. The latest version supports Oracle 12C. More information here.
Cloud Expo on Google News
SYS-CON.TV

2008 West
DIAMOND SPONSOR:
Data Direct
SOA, WOA and Cloud Computing: The New Frontier for Data Services
PLATINUM SPONSORS:
Red Hat
The Opening of Virtualization
GOLD SPONSORS:
Appsense
User Environment Management – The Third Layer of the Desktop
Cordys
Cloud Computing for Business Agility
EMC
CMIS: A Multi-Vendor Proposal for a Service-Based Content Management Interoperability Standard
Freedom OSS
Practical SOA” Max Yankelevich
Intel
Architecting an Enterprise Service Router (ESR) – A Cost-Effective Way to Scale SOA Across the Enterprise
Sensedia
Return on Assests: Bringing Visibility to your SOA Strategy
Symantec
Managing Hybrid Endpoint Environments
VMWare
Game-Changing Technology for Enterprise Clouds and Applications
Click For 2008 West
Event Webcasts

2008 West
PLATINUM SPONSORS:
Appcelerator
Get ‘Rich’ Quick: Rapid Prototyping for RIA with ZERO Server Code
Keynote Systems
Designing for and Managing Performance in the New Frontier of Rich Internet Applications
GOLD SPONSORS:
ICEsoft
How Can AJAX Improve Homeland Security?
Isomorphic
Beyond Widgets: What a RIA Platform Should Offer
Oracle
REAs: Rich Enterprise Applications
Click For 2008 Event Webcasts
Speech and Sound: The Next "Killer Paradigm Shift"..?
Speech recognition could impact the business and at a variety of levels

There was a time, not so very long ago, when IT directors and chief information officers dismissed the Internet as something of a passing fad. Somehow though, things took off pretty well with the whole web thing didn't they? Mobile telephony has also grown to a level of dominance that we could never have predicted when it first started appearing around 30 years ago.

Then came the tablet... just another fad right? Well, the first few were, but then "Magic Steve" produced the tablet we all love and cherish didn't he? (OK yes - I know Android is doing well in this space too, you don't need to write in)... so what's coming next?

What Is Our Next Killer Paradigm?
Many believe that "sound" will be the next killer element of "social computing" in terms of information share. After all, we share text in various forms, images and video and all the time. Shouldn't this mean that "sound" should be our next most logically interesting data-share element?

What kind of sound? Our own spoken voice, recorded speech, random commentary, music, environmental recordings -- it's a long list and you can certainly add at least one of your own if you give it a moment's thought. Yes we can link to each other's podcasts already, but we are talking about a level beyond that.

The next tier for sound is allied to its close first cousin "speech" and both could (arguably) be about to move from the playground to the boardroom and therefore potentially move into the CIO's line of sight.

The Speech Steeplechase
The problem is that in its early years, speech/voice recognition technology was something of a novelty. But look at the facts, fingerprint recognition biometrics only surfaced toward the end of the last millennium and now we have "secure USB flash drives" that work by a finger-swipe; so the rapid development curve for surface-level extremely user-facing technologies has been in overdrive for the last decade, if not more.

Speech recognition companies, like Nuance that produces the Dragon NaturallySpeaking off-the-shelf product, see a future in several corporate deployment scenarios for their technology which is grounded in individual user suitability. The company is something of a market leader with manufacturers from HP to Apple to IBM all working with its technology.

According to Nuance, the human voice is described as an "incredibly rich, natural and efficient means of communication" - and the industry is now working to build solutions that enable computers, phones, tablets, automobiles, TVs and consumer electronics to understand the human voice, providing a "natural interface" between man and machine.

Speech recognition could impact the business and at a variety of levels:

  • Speech is used in CRM analytics inside call center deployment scenarios so that customer conversations can be analyzed and filtered in order to discover what keywords customers are using.
  • Healthcare CIOs will already know that CLU (Clinical Language Understanding) technology has a huge role to play in terms of helping healthcare enterprises working to overcome challenges with "Big Data" and the ensuing challenges associated with the ability to collect, process, interpret and then utilise information.
  • Nuance is not alone...  Google is also said to be attempting to "pioneer" technology that will ultimately enable users to search by the spoken word. Microsoft has similar plans with Bing.
  • Mobile applications (at the consumer and enterprise level too) will have a large number of opportunities for speech recognition to be leveraged. From simple voice commands used to control smartphones, to more powerful voice-driven in-car entertainment and/or so-called "infotainment systems," speech arguably has a strong new role to play.

How Does It Work? Nuance Explains...

  1. A user speaks a command into a microphone
  2. System converts sound input into digital signal
  3. The signal is analyzed and chopped into component speech sounds called "phonemes"
  4. Each phoneme is examined in context with those around it and statistical probability algorithms used to determine the intended word from a stored list. This happens for each word
  5. Each word is examined in context with those around it and statistical probability algorithms used to determine the intended command
  6. The appropriate response for the command is triggered

The CIO's Central Message
It seems that many real-world scenarios could be using not only speech recognition technologies, but also its sister disciplines, i.e., text-to-speech technology and also document imaging and electronic dictation services, which do of course throw up their own data storage challenges.

Nuance VP Peter Mahoney has suggested that really robust industrial-grade speech recognition in the space-age style as depicted in Hollywood movies (or to give it its proper name - "robust natural language" technology) is not far off at all - and that we should see six to ten languages fully supported by this technology as soon as the end of this year.

It's not Star Trek quite yet, but we're close!

•   •   •

This post was first published on the Enterprise CIO Forum.

About Adrian Bridgwater
Adrian Bridgwater is a freelance journalist and corporate content creation specialist focusing on cross platform software application development as well as all related aspects software engineering, project management and technology as a whole.

In order to post a comment you need to be registered and logged in.

Register | Sign-in

Reader Feedback: Page 1 of 1

Latest AJAXWorld RIA Stories
Your job is mostly boring. Many of the IT operations tasks you perform on a day-to-day basis are repetitive and dull. Utilizing automation can improve your work life, automating away the drudgery and embracing the passion for technology that got you started in the first place. In...
CloudEXPO New York 2018, colocated with DevOpsSUMMIT and DXWorldEXPO New York 2018 will be held November 12-13, 2018, in New York City and will bring together Cloud Computing, FinTech and Blockchain, Digital Transformation, Big Data, Internet of Things, DevOps, AI and Machine Le...
All in Mobile is a place where we continually maximize their impact by fostering understanding, empathy, insights, creativity and joy. They believe that a truly useful and desirable mobile app doesn't need the brightest idea or the most advanced technology. A great product beg...
Dynatrace is an application performance management software company with products for the information technology departments and digital business owners of medium and large businesses. Building the Future of Monitoring with Artificial Intelligence. Today we can collect lots and l...
Bill Schmarzo, Tech Chair of "Big Data | Analytics" of upcoming CloudEXPO | DXWorldEXPO New York (November 12-13, 2018, New York City) today announced the outline and schedule of the track. "The track has been designed in experience/degree order," said Schmarzo. "So, that folks w...
Subscribe to the World's Most Powerful Newsletters
Subscribe to Our Rss Feeds & Get Your SYS-CON News Live!
Click to Add our RSS Feeds to the Service of Your Choice:
Google Reader or Homepage Add to My Yahoo! Subscribe with Bloglines Subscribe in NewsGator Online
myFeedster Add to My AOL Subscribe in Rojo Add 'Hugg' to Newsburst from CNET News.com Kinja Digest View Additional SYS-CON Feeds
Publish Your Article! Please send it to editorial(at)sys-con.com!

Advertise on this site! Contact advertising(at)sys-con.com! 201 802-3021


SYS-CON Featured Whitepapers
ADS BY GOOGLE