SYS-CON MEDIA Authors: Yeshim Deniz, Elizabeth White, Peter Silva, Liz McMillan, Pat Romanski

News Feed Item

The International Computer Science Institute (ICSI) Leads Team Researching Ways to Build Speech Recognition Systems for New Languages Under Severe Data and Time Constraints

The International Computer Science Institute (ICSI) is leading a research team under the IARPA Babel Program that is focused on building speech recognition solutions with self-imposed time and data limitations for a variety of languages. The work aims to better understand fundamental challenges and discover new methods for development of speech models for languages that could emerge as important in the future.

“The goal of the Babel program is to rapidly build speech recognition systems to support effective keyword search for new languages using limited amounts of transcribed speech recorded in real-world conditions,” said Mary Harper, the IARPA Program Manager in charge of the Babel program.

Using only a fraction of the training data usually required, the team aims to build speech recognition systems for several languages in just one week by the end of the program.

“ICSI excels at intellectual challenges and unique approaches to research. This is an intriguing project that puts significant constraints on our researchers as a means to discover better ways to develop automatic speech recognition systems,” said Roberto Pieraccini, director and president of ICSI.

By working on a variety of languages with time and data restrictions, the team will research basic principles of speech technology rather than incremental improvements to existing technology. In addition, this research will be useful in enabling keyword-search systems for those languages that do not have large amounts of transcribed audio.

“The speech recognition systems we’ve built in the past have the curse of being reasonably good, particularly for a few languages and speech recorded in good acoustic conditions, which has often reduced the impetus to significantly change the technology,” said Professor Nelson Morgan, deputy director and leader of the Speech Group at ICSI. “This project strongly pushes us to solve fundamental problems in speech recognition to address the Babel challenge."

In each of the four periods of the project, the team will be given a set of languages and will be tasked with developing methods to quickly build a system. Speech recognition systems are typically trained on thousands of hours of transcribed audio. In this project, the team was initially given only 80 hours of conversational speech for each language, and in each succeeding period a smaller fraction of the audio is transcribed. At the end of each period, the team will be given a new language to build a system – initially in four weeks, but by the end of the program down to just one week.

In addition to Morgan, the leaders of the team are Steven Wegmann of ICSI, Professor Mari Ostendorf of the University of Washington, Professor Janet Pierrehumbert of Northwestern University, Professor Eric Fosler-Lussier of The Ohio State University, and Professor Dan Ellis of Columbia University. Morgan says an important element of the project is that these team leaders have had strong previous research ties with one another in research topics that are essential to the Babel problem.

The project is funded by the Intelligence Advanced Research Projects Activity (IARPA), a research arm of the Office of the Director of National Intelligence, which invests in high-risk/high-payoff research programs.

About ICSI

The International Computer Science Institute (ICSI) is a leading center for research in computer science and one of the few independent, nonprofit research institutes in the United States. With its unique focus on international collaboration and its affiliation with the University of California at Berkeley, ICSI brings together the most influential U.S. scientists and experts from around the world in areas such as computer networking and security, speech and language processing, algorithms, bioinformatics, computer architecture, computer vision, and artificial intelligence. For more information, check ICSI out on the Web:

www.ICSI.berkeley.EDU | http://twitter.com/ICSIatBerkeley | http://blog.ICSI.berkeley.EDU

www.facebook.com/ICSIatBerkeley | www.youtube.com/ICSIatBerkeley

More Stories By Business Wire

Copyright © 2009 Business Wire. All rights reserved. Republication or redistribution of Business Wire content is expressly prohibited without the prior written consent of Business Wire. Business Wire shall not be liable for any errors or delays in the content, or for any actions taken in reliance thereon.

Latest Stories
SYS-CON Events announced today that Open Data Centers (ODC), a carrier-neutral colocation provider, will exhibit at SYS-CON's 16th International Cloud Expo®, which will take place June 9-11, 2015, at the Javits Center in New York City, NY. Open Data Centers is a carrier-neutral data center operator in New Jersey and New York City offering alternative connectivity options for carriers, service providers and enterprise customers.
Skeuomorphism usually means retaining existing design cues in something new that doesn’t actually need them. However, the concept of skeuomorphism can be thought of as relating more broadly to applying existing patterns to new technologies that, in fact, cry out for new approaches. In his session at DevOps Summit, Gordon Haff, Senior Cloud Strategy Marketing and Evangelism Manager at Red Hat, will discuss why containers should be paired with new architectural practices such as microservices ra...
Thanks to Docker, it becomes very easy to leverage containers to build, ship, and run any Linux application on any kind of infrastructure. Docker is particularly helpful for microservice architectures because their successful implementation relies on a fast, efficient deployment mechanism – which is precisely one of the features of Docker. Microservice architectures are therefore becoming more popular, and are increasingly seen as an interesting option even for smaller projects, instead of bein...
SYS-CON Events announced today that Dyn, the worldwide leader in Internet Performance, will exhibit at SYS-CON's 16th International Cloud Expo®, which will take place on June 9-11, 2015, at the Javits Center in New York City, NY. Dyn is a cloud-based Internet Performance company. Dyn helps companies monitor, control, and optimize online infrastructure for an exceptional end-user experience. Through a world-class network and unrivaled, objective intelligence into Internet conditions, Dyn ensures...
SYS-CON Events announced today that Blue Box has been named “Bronze Sponsor” of SYS-CON's DevOps Summit New York, which will take place June 9-11, 2015, at the Javits Center in New York City, NY. Blue Box delivers Private Cloud as a Service (PCaaS) to a worldwide customer base. Built on a technology platform leveraging decades of operational expertise in cloud and distributed systems, Blue Box Cloud is a managed private cloud product available in both hosted and on-prem versions. Each Blue Box ...
SYS-CON Events announced today that Vicom Computer Services, Inc., a provider of technology and service solutions, will exhibit at SYS-CON's 16th International Cloud Expo®, which will take place on June 9-11, 2015, at the Javits Center in New York City, NY. They are located at booth #427. Vicom Computer Services, Inc. is a progressive leader in the technology industry for over 30 years. Headquartered in the NY Metropolitan area. Vicom provides products and services based on today’s requirements...
This builds on Puppet Labs' first class Windows support, including native .MSI packages for x32 and x64 operating systems, modules to extend common Windows server management tools, including Powershell, and integrations with Microsoft Azure and Visual Studio. By automating common Windows administration tasks, Puppet Labs is enabling users to adopt DevOps practices, thereby reducing the time needed to deploy applications from weeks to hours.
DevOps tends to focus on the relationship between Dev and Ops, putting an emphasis on the ops and application infrastructure. But that’s changing with microservices architectures. In her session at DevOps Summit, Lori MacVittie, Evangelist for F5 Networks, will focus on how microservices are changing the underlying architectures needed to scale, secure and deliver applications based on highly distributed (micro) services and why that means an expansion into “the network” for DevOps.
BlueBox bridge the chasm between development and infrastructure. Hosting providers are taking standardization and automation too far. For many app developers it does nothing but spawn mayhem and more work. They have to figure out how their creations live on a pre-fab infrastructure solution full of constraints. Operations-as-a-Service is what BlueBox does. BlueBox utilizes development tools such as OpenStack, EMC Razor, Opscode’s Chef and BlueBox's proprietary tools give the power to do the unor...
Application metrics, logs, and business KPIs are a goldmine. It’s easy to get started with the ELK stack (Elasticsearch, Logstash and Kibana) – you can see lots of people coming up with impressive dashboards, in less than a day, with no previous experience. Going from proof-of-concept to production tends to be a bit more difficult, unfortunately, and it tends to gobble up our attention, time, and money. In his session at DevOps Summit, Otis Gospodnetić, co-author of Lucene in Action and founder...
What’s inside the cloud? Hard work. Cloud operators know the world inside the datacenter is gritty. Vendor marketing speak and cloudwashing quickly melt in the heat of SLAs, uptime guarantees, and users who want it now. In his session at DevOps Summit, Hernan Alvarez, Chief Product Officer at Blue Box Group, will deliver an unvarnished look inside the world of cloud operators, from the perspective of someone who lives it. Attendees get a front-row look into the toolkits and processes that enabl...
Application metrics, logs, and business KPIs are a goldmine. It’s easy to get started with the ELK stack (Elasticsearch, Logstash and Kibana) – you can see lots of people coming up with impressive dashboards, in less than a day, with no previous experience. Going from proof-of-concept to production tends to be a bit more difficult, unfortunately, and it tends to gobble up our attention, time, and money. In his session at DevOps Summit, Otis Gospodnetić, co-author of Lucene in Action and founder...
We are all here because we are sold on the transformative promise of The Cloud. But what good is all of this ephemeral, on-demand infrastructure if your usage doesn't actually improve the agility and speed of your business? How must Operations adapt in order to avoid stifling your Cloud initiative? In his session at DevOps Summit, Damon Edwards, co-founder and managing partner of the DTO Solutions, will highlight the successful organizational, process, and tooling patterns of high-performing c...
The 3rd International @ThingsExpo, co-located with the 16th International Cloud Expo – to be held June 9-11, 2015, at the Javits Center in New York City, NY – is now accepting Hackathon proposals. Hackathon sponsorship benefits include general brand exposure and increasing engagement with the developer ecosystem. At Cloud Expo 2014 Silicon Valley, IBM held the Bluemix Developer Playground on November 5 and ElasticBox held the DevOps Hackathon on November 6. Both events took place on the expo fl...
Physical, virtual, containers. Private cloud, public cloud, hybrid cloud. IaaS, PaaS, SaaS. Windows, Linux, Mac. These are just some of the choices faced when architecting a datacenter of today. And the choice is not one or the other; instead, it is often a combination of many of these. HashiCorp builds software to ease these decisions by presenting solutions that bridge the gaps. HashiCorp's tools manage both physical machines and virtual machines, Windows, and Linux, SaaS and IaaS, etc. The co...