element14 Community
element14 Community
    Register Log In
  • Site
  • Search
  • Log In Register
  • About Us
  • Community Hub
    Community Hub
    • What's New on element14
    • Feedback and Support
    • Benefits of Membership
    • Personal Blogs
    • Members Area
    • Achievement Levels
  • Learn
    Learn
    • Ask an Expert
    • eBooks
    • element14 presents
    • Learning Center
    • Tech Spotlight
    • STEM Academy
    • Webinars, Training and Events
    • Learning Groups
  • Technologies
    Technologies
    • 3D Printing
    • FPGA
    • Industrial Automation
    • Internet of Things
    • Power & Energy
    • Sensors
    • Technology Groups
  • Challenges & Projects
    Challenges & Projects
    • Design Challenges
    • element14 presents Projects
    • Project14
    • Arduino Projects
    • Raspberry Pi Projects
    • Project Groups
  • Products
    Products
    • Arduino
    • Avnet Boards Community
    • Dev Tools
    • Manufacturers
    • Multicomp Pro
    • Product Groups
    • Raspberry Pi
    • RoadTests & Reviews
  • Store
    Store
    • Visit Your Store
    • Choose another store...
      • Europe
      •  Austria (German)
      •  Belgium (Dutch, French)
      •  Bulgaria (Bulgarian)
      •  Czech Republic (Czech)
      •  Denmark (Danish)
      •  Estonia (Estonian)
      •  Finland (Finnish)
      •  France (French)
      •  Germany (German)
      •  Hungary (Hungarian)
      •  Ireland
      •  Israel
      •  Italy (Italian)
      •  Latvia (Latvian)
      •  
      •  Lithuania (Lithuanian)
      •  Netherlands (Dutch)
      •  Norway (Norwegian)
      •  Poland (Polish)
      •  Portugal (Portuguese)
      •  Romania (Romanian)
      •  Russia (Russian)
      •  Slovakia (Slovak)
      •  Slovenia (Slovenian)
      •  Spain (Spanish)
      •  Sweden (Swedish)
      •  Switzerland(German, French)
      •  Turkey (Turkish)
      •  United Kingdom
      • Asia Pacific
      •  Australia
      •  China
      •  Hong Kong
      •  India
      •  Korea (Korean)
      •  Malaysia
      •  New Zealand
      •  Philippines
      •  Singapore
      •  Taiwan
      •  Thailand (Thai)
      • Americas
      •  Brazil (Portuguese)
      •  Canada
      •  Mexico (Spanish)
      •  United States
      Can't find the country/region you're looking for? Visit our export site or find a local distributor.
  • Translate
  • Profile
  • Settings
Design for a Cause - Design Challenge
  • Challenges & Projects
  • Design Challenges
  • Design for a Cause - Design Challenge
  • More
  • Cancel
Design for a Cause - Design Challenge
Blog Audio4Vision #10 - TTS: Text to Speech library
  • Blog
  • Forum
  • Documents
  • Polls
  • Files
  • Events
  • Mentions
  • Sub-Groups
  • Tags
  • More
  • Cancel
  • New
  • Share
  • More
  • Cancel
Group Actions
  • Group RSS
  • More
  • Cancel
Engagement
  • Author Author: pranjalranjan299
  • Date Created: 14 Sep 2018 5:55 PM Date Created
  • Views 604 views
  • Likes 3 likes
  • Comments 0 comments
Related
Recommended

Audio4Vision #10 - TTS: Text to Speech library

pranjalranjan299
pranjalranjan299
14 Sep 2018

Welcome to our 10th blog post! In this blog, we'll be talking about a python library, called TTS (Text-to-Speech).

 

Image captioning is all good and great until we realize that a caption is just a string - it still requires eyes to be read! A visually handicapped person won't find any information in such data as he won't be able to see it in the first place. Clearly, we need to convert the mode of communication from ocular to auditory - and this is where the TTS library comes in.

 

TTS or Text-to-Speech or Speech Synthesis is basically an artificial imitation of the human speech. A TTS system takes a string as input and gives the audio file of a "person" speaking that string out loud as output. This is ideal for our use case.

We use this library to implement TTS in our system. The idea is to send an image from the MKR to a server, have that server evaluate a caption for it, feed said caption to a TTS service, and send the resultant audio file back to the MKR.

 

We test out the library:

image

 

A file named audio.wav is created in the demo folder:

image

Here is the audio:

 

You don't have permission to edit metadata of this video.
Edit media
x
image
Upload Preview
image

 

Now, let's test it on an actual image, using our Flask server:

image

 

The audio file generated for this image is:

https://drive.google.com/open?id=1b6MKq6OV-XX62gxyC8_mCgLAVcJAE5Zu

 

This audio file will then be sent to the MKR via a GET request and played using the speaker mentioned in the previous blog.

 

That's it for this blog, which was more on the server side. The next one would be more Arduino-oriented, and we would be talking about the primary functions and POST/GET requests from the Arduino side.

Thanks for reading this blog, the next one is right around the corner!

  • Sign in to reply
element14 Community

element14 is the first online community specifically for engineers. Connect with your peers and get expert answers to your questions.

  • Members
  • Learn
  • Technologies
  • Challenges & Projects
  • Products
  • Store
  • About Us
  • Feedback & Support
  • FAQs
  • Terms of Use
  • Privacy Policy
  • Legal and Copyright Notices
  • Sitemap
  • Cookies

An Avnet Company © 2025 Premier Farnell Limited. All Rights Reserved.

Premier Farnell Ltd, registered in England and Wales (no 00876412), registered office: Farnell House, Forge Lane, Leeds LS12 2NE.

ICP 备案号 10220084.

Follow element14

  • X
  • Facebook
  • linkedin
  • YouTube