AT&T’s Watson Answers THE CALL
| By Masterson, Michele | |
| Proquest LLC |
AT&T and
Both companies are committed to reshaping the future of speech-enabled interactions
If you ask many technology enthusiasts to explain Watson technology, they'll likely point to
In addition to their name, the technologies share other similarities: Both were built at very targe and wellknown companies. Both technologies are speechenabled. And both products have the potential to significantly contribute to user interactions, customer experiences, and the Internet of Things. However, there are fundamental differences, too, such as their technological maturity levels and what they can contribute to your organization today. Read on to find out more. -By
Spanning decades, AT&T's Watson engine continues to evolve and entice developers By
"
At a basic level, AT&T's Watson is a speech and natural language engine that processes and analyzes speech input, performs at least one service, and returns a result in real time.
AT&T's Watson is frequently confused with
"The way we have architected AT&T Watson is that we realized that to build an application or a service, which is the ultimate goal here, there are many different technologies that need to be working synchronously. This is how you get a seamless, ubiquitous experience," Gilbert adds.
To that end, AT&T's Watson can combine speech with other modalities, such as touch screens (e.g., Find the closest
AT&T Launches Watson APIs
While AT&T's Watson has been used in IVRs for more than 20 years, it wasn't until
With these APIs, developers don't have to be specifically skilled in creating speech apps; they can send AT&T audio and it can return text of what an end user has said. "We're doing this so people don't have to reinvent the wheel," Gilbert said in an earlier interview with Speech Technology.
The AT&T Developer Platform provides access to software developer kits and code samples for several environments, such as
"The whole world of mobile development is so complicated now because of all the different options," says
Dahl offers that when it comes to mobile intelligent virtual assistants, the vocabulary in Nuance's Nina is the most customizable application. However, she maintains that AT&T's Watson offers a variety of contexts. Currently, there are nine speech contexts available: gaming, business search, social media, TV, Web search, general purpose or generic, voicemail to text, SMS, and question and answer.
As an example, the business search context is trained on tens of millions of local business entries, and lets users transcribe search queries. The question-and-answer context is trained on 10 million questions and enables users to transcribe questions and have the correct answer returned. "AT&T's option is a good balance between price, ease of use, and flexibility as far as customizing it for your own application," Dahl says. "The things that stand out to me are the levels of customization and the number of environments."
"The combination of these modalities is key to providing a ubiquitous experience because, given these different environments, such as sitting in your car, you want to have a very different experience than using it at home," Dahl says. "In some cases, more than one modality is required to fulfill an action. I'm not aware of a platform that brings all this rich technology into a single framework."
The API program has proved to be popular; currently there are 43,000 developers using the platform. AT&T offers a free 90-day trial period and a yearly subscription for
With QNX, Watson's speech engine analyzes words spoken by a driver and fits them into known patterns. What has been said is then routed from the cloud to the car. The in-vehicle intent engine from QNX performs the rest of the speech analysis to figure out how to act.
"Sharing the workload across client and server offers automotive manufacturers and end users the best of both worlds," said
The Heavy Hitters
While AT&T Watson-based APIs provide access for developers not trained to use speech apps or who work in different platforms, the company has also worked with developers who are using the fullfledged Watson engine.
"They're not just using Watson as an API; they want to customize and personalize and have proactive analytics and do sophisticated technologies like speaker verification," Gilbert says. "These developers understand the technology and want to create something unique in the market. Our enterprise customers are not looking for the technology, they're looking for an end-to-end service where speech and natural language is part of that service."
A third way AT&T works with customers is by licensing technology through a joint strategic agreement. "There's the idea of combining one plus one and [coming up with] more than two, and we can go after a market together," Gilbert says. "We can't do it ourselves, they can't do it themselves, and we join forces to create either a speech, virtual assistant biometric, or translation application for different verticals."
Interacting with Interactions
Interactions is a company in the natural language processing business that typically helps large organizations, such as
Enterprises can use intelligent virtual assistants in a variety of applications in both customer care and sales settings, serving to route or self-serve customers, explains
For example, Interactions completes
Iacobucci says the company has designed its operations with a human element that uses the same protocols as a speech recognition engine. It is a way of understanding extremely complex human dialogue where speech recognition historically experiences trouble, including open-ended sentences, alphanumeric data, out-of-grammar responses, and scenarios with background noise or accents.
The company's technology is focused on using the combination of humanassisted understanding and an automatic speech recognition (ASR) engine to create conversational, or humanlike, systems. "The application really doesn't know where the understanding is coming from," Iacobucci says. "Our technology determines where the understanding should come from and has a great level of understanding of languages and accents far beyond what's capable from speech recognition alone."
Iacobucci says the company designs its back-end recognition technology as part of its offering assuming that there will be a level of ASR to work in tandem with Human Assisted Understanding (HAU). ASR, he explains, takes on the parts of the conversation that can be easily automated, and HAU takes on tasks that are more difficult and not suited for ASR.
In
Developers need such sophisticated virtual assistants to recognize an open vocabulary, and not one that is suited just for retail or music, for example, Gilbert says. "You want natural language to go beyond recognizing entities, attributes, and intents," he says. "You want it to recognize context of words and phrases given a particular transaction. You have to include things like different kinds of parsing. These are all intelligence that you have to do as part of natural language."
Iacobucci says the company has found its partnership with AT&T is not only strong technically, but also that AT&T has been very supportive of its endeavors.
"This is really far more than a technical partnership," Iacobucci says. "Together we're building some really sophisticated, high-performing applications and delivering on the promise of speech recognition systems that are providing a great customer experience, and we're doing that hundreds of millions of times a year."
SpeakTolt
SpeakTolt is the creator of an intelligent virtual assistant for smartphones and tablets, and employs a team of 25 linguists and software engineers. The company's technology uses human-machine interfaces based on natural language interaction and predictive assistance. Its mobile application, SpeakTolt Assistant, has been downloaded more than 9 million times, most of which have been for
SpeakToIt's intelligent virtual assistant can update statuses on social networking sites such as Twitter and Facebook; access content from sites such as Google, Trip Advisor, Yelp, or Foursquare; shop on sites such as Amazon; and update calendars and notes using
While the SpeakTolt Assistant sounds a lot like Apple's
According to
Gelfenbeyn says that Watson offers the company the capability to do customer grammars and adjust the recognizers dynamically. "It allows us to do pretty neat things for personalization," he says. "We are looking forward to getting more options for recognizers in terms of languages and also accents.
"For us, it's a great solution in terms of flexibility," Gelfenbeyn says. "Our experience was extremely positive with the support the AT&T speech team and the API team has provided."
The End Game
Between its strategic partnerships, licensing agreements, and developer platform, it looks like the use of Watson's platform will continue to grow, especially considering that using speech is still a relatively novel concept in the world of business, aside from the call center.
"A lot of companies are still in the stage where they're trying to decide what they want to do with speech in a mobile environment," Dahl says. "They might think, 'Well, this is great, but how can it apply to my company?"'
If Gilbert has his way, that question will be answered from the thousands of developers using AT&T's APIs.
"I'm very excited [about] where we are," Gilbert says. "We're investing heavily into all these different areas. We're not trying to be the best speech company in the world, but we're using our partners and our assets to [better] position those that are in high, rapid-growth businesses." El
"The things that stand out...are the levels of customization and the number of environments."
"You want natural language to go beyond recognizing entities, attributes, and intents."
Staff Writer
| Copyright: | (c) 2014 Information Today, Inc. |
| Wordcount: | 2375 |



Erie pharmacy celebrates 60 years
IBM’s Watson BRINGS Cognitive Computing TO CUSTOMER ENGAGEMENT
Advisor News
- Rising healthcare costs impact 401(k) accounts
- What advisors think about pooled employer plans, alternative investments
- AI, stablecoins and private market expansion may reshape financial services by 2030
- Cheers to summer, and planning for what comes next
- Why seniors fear spending their own retirement wealth
More Advisor NewsAnnuity News
- AuguStar Retirement launches StarStream Variable Annuity
- Prismic Life Announces Completion of Oversubscribed Capital Raise
- Guaranteed income streams help preserve assets later in retirement
- MassMutual turns 175, Marking Generations of Delivering on its Commitments
- ALIRT Insurance Research: U.S. Life Insurance Industry In Transition
More Annuity NewsHealth/Employee Benefits News
- SEN. POORE EXPANDS COVERAGE FOR MENOPAUSE AND PERIMENOPAUSE CARE
- PA HOUSE FINANCE COMMITTEE ADDRESSES HEALTHCARE ACCESS AND AFFORDABILITY FOR WORKING PENNSYLVANIANS
- Providence to end most health insurance plans, forcing hundreds of thousands in Oregon to switch
- Flemington-Raritan Seeking Assistance From State Regarding Rising Health Insurance Costs
- Mandela Barnes proposes blocking use of AI to boost consumer prices
More Health/Employee Benefits NewsLife Insurance News
- Fitch Ratings revises EquiTrust’s outlook to Negative
- AI, stablecoins and private market expansion may reshape financial services by 2030
- Transgender plaintiffs win preliminary victories in three gender-affirming care lawsuits
- AM Best Upgrades Issuer Credit Rating of Southern Farm Bureau Life Insurance Company
- Industry Innovator Scores New High-Water Mark: Reliance Matrix Logs 8 Millionth Employee Benefit/Absence Claim
More Life Insurance News