element14 Community
element14 Community
    Register Log In
  • Site
  • Search
  • Log In Register
  • About Us
  • Community Hub
    Community Hub
    • What's New on element14
    • Feedback and Support
    • Benefits of Membership
    • Personal Blogs
    • Members Area
    • Achievement Levels
  • Learn
    Learn
    • Ask an Expert
    • eBooks
    • element14 presents
    • Learning Center
    • Tech Spotlight
    • STEM Academy
    • Webinars, Training and Events
    • Learning Groups
  • Technologies
    Technologies
    • 3D Printing
    • FPGA
    • Industrial Automation
    • Internet of Things
    • Power & Energy
    • Sensors
    • Technology Groups
  • Challenges & Projects
    Challenges & Projects
    • Design Challenges
    • element14 presents Projects
    • Project14
    • Arduino Projects
    • Raspberry Pi Projects
    • Project Groups
  • Products
    Products
    • Arduino
    • Avnet Boards Community
    • Dev Tools
    • Manufacturers
    • Multicomp Pro
    • Product Groups
    • Raspberry Pi
    • RoadTests & Reviews
  • Store
    Store
    • Visit Your Store
    • Choose another store...
      • Europe
      •  Austria (German)
      •  Belgium (Dutch, French)
      •  Bulgaria (Bulgarian)
      •  Czech Republic (Czech)
      •  Denmark (Danish)
      •  Estonia (Estonian)
      •  Finland (Finnish)
      •  France (French)
      •  Germany (German)
      •  Hungary (Hungarian)
      •  Ireland
      •  Israel
      •  Italy (Italian)
      •  Latvia (Latvian)
      •  
      •  Lithuania (Lithuanian)
      •  Netherlands (Dutch)
      •  Norway (Norwegian)
      •  Poland (Polish)
      •  Portugal (Portuguese)
      •  Romania (Romanian)
      •  Russia (Russian)
      •  Slovakia (Slovak)
      •  Slovenia (Slovenian)
      •  Spain (Spanish)
      •  Sweden (Swedish)
      •  Switzerland(German, French)
      •  Turkey (Turkish)
      •  United Kingdom
      • Asia Pacific
      •  Australia
      •  China
      •  Hong Kong
      •  India
      •  Korea (Korean)
      •  Malaysia
      •  New Zealand
      •  Philippines
      •  Singapore
      •  Taiwan
      •  Thailand (Thai)
      • Americas
      •  Brazil (Portuguese)
      •  Canada
      •  Mexico (Spanish)
      •  United States
      Can't find the country/region you're looking for? Visit our export site or find a local distributor.
  • Translate
  • Profile
  • Settings
Publications
  • Learn
  • More
Publications
Blog Microsoft's live speech translator, your voice in other languages
  • Blog
  • Documents
  • Events
  • Files
  • Members
  • Mentions
  • Sub-Groups
  • Tags
  • More
  • Cancel
  • New
Join Publications to participate - click to join for free!
  • Share
  • More
  • Cancel
Group Actions
  • Group RSS
  • More
  • Cancel
Engagement
  • Author Author: Catwell
  • Date Created: 14 Nov 2012 5:39 PM Date Created
  • Views 410 views
  • Likes 1 like
  • Comments 2 comments
  • research
  • test
  • translation
  • microsoft
  • coice_translation
  • industry
  • hmi
  • on_campus
  • voice
  • Design
  • cabeatwell
  • prototyping
  • measurement
  • sensor
  • innovation
  • communication
Related
Recommended

Microsoft's live speech translator, your voice in other languages

Catwell
Catwell
14 Nov 2012

image

Microsoft tanslation demonstration, screen cap. (via Microsoft)

 

Computer user interfaces have done exceptionally well using typed words and haptic gestures to interact, but the tech had lagged behind when it comes to voice interaction. In 1979, hidden Markov modeling gave way to a better method of matching waveforms of spoken words to recordings for speech-to-text recognition. Behind this method, the technology improved slowly but reached a plateau, at its best, giving errors in recognition in 20% to 25% of words. In the meantime, significant improvements were developed in the translation of typed words giving way to new services like Google Translate and Bing Translate that can convert words, phrases and web pages from one language to another.

 

 

Microsoft has now improved on both of these technologies using computerized learning systems, based on neural networks, that improve speech recognition and allow speech-to- text-to-speech translation while producing output in the users own voice and including cadence. So far, the program gives translations of full sentences from English to Mandarin in just a few seconds.

 

 

The speech recognition was improved by using a new type of computerized learning method called deep neural networking (DNN), a refined artificial neural network, which uses mathematical models of the low-level circuits in the brain and describes learning and behavior. This technique was developed by Microsoft researchers and the University of Toronto. Using DNN, Microsoft researchers were able to reduce the error made by speech recognition software to 12.5% to 14%. Accuracy is believed to increase as more data is input into the system and thus allows for more learning using DNN. An improved recognition of speech allows for a more accurate feed into the Bing Translating software.

 

 

Translation then happens in two stages. First, each English word is translated to its Mandarin equivalent and then the words are reorganized properly.

 

 

After translation, comes the task of producing it using the users own voice including inflections that help proper translation. To do this, the computer learns from an hour session with the user and then manipulates stock recordings, also made by the user, in order to pronounce the translated text into speech. The software was built to achieve proper cadence with help of hours of speech recorded by a native Mandarin speaker. This is the first software to personalize speech-to-speech translations in this manner.

 

 

Microsoft’s chief researcher Rick Rashid demonstrated the translator to thunderous applause at Microsoft Research Asia’s 21st Century Computing event in Tianjin, China in late October. Although Rashid stated that the software has not yet been used to translate any conversation outside Microsoft offices, he comments, “We don’t yet know the limits on accuracy of this technology—it is really too new. As we continue to ’train’ the system with more data, it appears to do better and better.”

 

You don't have permission to edit metadata of this video.
Edit media
x
image
Upload Preview
image

 

Cabe

http://twitter.com/Cabe_e14

  • Sign in to reply
Parents
  • DAB
    DAB over 12 years ago

    Anything that improves good communication is good.

    I wonder how many wars were the result of a bad translation or just a simple word choice.

     

    Makes you wonder too, doesn't it.

     

    Just a thought,

    DAB

    • Cancel
    • Vote Up 0 Vote Down
    • Sign in to reply
    • More
    • Cancel
  • Catwell
    Catwell over 12 years ago in reply to DAB

    Makes me fear for the future, when translation software like this will be trusted to the point of ignorance.

     

    I tried communicating with a person from Italy, if I recall. Not being able to speak Italian, I used google translate, and so did the other party.

    Miscommunication then riddled our correspondence. The fellow eventually found a English speaking friend to help.

     

    C

    • Cancel
    • Vote Up 0 Vote Down
    • Sign in to reply
    • More
    • Cancel
Comment
  • Catwell
    Catwell over 12 years ago in reply to DAB

    Makes me fear for the future, when translation software like this will be trusted to the point of ignorance.

     

    I tried communicating with a person from Italy, if I recall. Not being able to speak Italian, I used google translate, and so did the other party.

    Miscommunication then riddled our correspondence. The fellow eventually found a English speaking friend to help.

     

    C

    • Cancel
    • Vote Up 0 Vote Down
    • Sign in to reply
    • More
    • Cancel
Children
No Data
element14 Community

element14 is the first online community specifically for engineers. Connect with your peers and get expert answers to your questions.

  • Members
  • Learn
  • Technologies
  • Challenges & Projects
  • Products
  • Store
  • About Us
  • Feedback & Support
  • FAQs
  • Terms of Use
  • Privacy Policy
  • Legal and Copyright Notices
  • Sitemap
  • Cookies

An Avnet Company © 2025 Premier Farnell Limited. All Rights Reserved.

Premier Farnell Ltd, registered in England and Wales (no 00876412), registered office: Farnell House, Forge Lane, Leeds LS12 2NE.

ICP 备案号 10220084.

Follow element14

  • X
  • Facebook
  • linkedin
  • YouTube