We are independent & ad-supported. We may earn a commission for purchases made through our links.

Advertiser Disclosure

Our website is an independent, advertising-supported platform. We provide our content free of charge to our readers, and to keep it that way, we rely on revenue generated through advertisements and affiliate partnerships. This means that when you click on certain links on our site and make a purchase, we may earn a commission. Learn more.

How We Make Money

We sustain our operations through affiliate commissions and advertising. If you click on an affiliate link and make a purchase, we may receive a commission from the merchant at no additional cost to you. We also display advertisements on our website, which help generate revenue to support our work and keep our content free for readers. Our editorial team operates independently from our advertising and affiliate partnerships to ensure that our content remains unbiased and focused on providing you with the best information and recommendations based on thorough research and honest evaluations. To remain transparent, we’ve provided a list of our current affiliate partners here.

What is Speech Synthesis?

By Kristina Choi
Updated Feb 04, 2024
Our promise to you
WiseGEEK is dedicated to creating trustworthy, high-quality content that always prioritizes transparency, integrity, and inclusivity above all else. Our ensure that our content creation and review process includes rigorous fact-checking, evidence-based, and continual updates to ensure accuracy and reliability.

Our Promise to you

Founded in 2002, our company has been a trusted resource for readers seeking informative and engaging content. Our dedication to quality remains unwavering—and will never change. We follow a strict editorial policy, ensuring that our content is authored by highly qualified professionals and edited by subject matter experts. This guarantees that everything we publish is objective, accurate, and trustworthy.

Over the years, we've refined our approach to cover a wide range of topics, providing readers with reliable and practical advice to enhance their knowledge and skills. That's why millions of readers turn to us each year. Join us in celebrating the joy of learning, guided by standards you can trust.

Editorial Standards

At WiseGEEK, we are committed to creating content that you can trust. Our editorial process is designed to ensure that every piece of content we publish is accurate, reliable, and informative.

Our team of experienced writers and editors follows a strict set of guidelines to ensure the highest quality content. We conduct thorough research, fact-check all information, and rely on credible sources to back up our claims. Our content is reviewed by subject matter experts to ensure accuracy and clarity.

We believe in transparency and maintain editorial independence from our advertisers. Our team does not receive direct compensation from advertisers, allowing us to create unbiased content that prioritizes your interests.

Speech synthesis is a process where verbal communication is replicated through an artificial device. A computer that converts text to speech is one kind of speech synthesizer.

The earliest forms of speech synthesis were implemented through machines designed to function like the human vocal tract. The speaking machine created by Wolfgang von Kempelen in the 1700’s is an example. With this device, speech was produced through a kitchen bellow, a bagpipe reed and a clarinet bell. The kitchen bellow was designed to act like a lung, while the glottis (the area of the vocal cords) was represented through the bagpipe reed. The clarinet bell served as the mouth.

Operation of the device was completely manual. The right hand controlled a series of levers while the left hand manipulated the clarinet bell (mouth). There was also the option of plugging the ‘nostrils’, to produce a less nasal sound. Either way, as long as the basic controls were properly used, the machine received airflow. This airflow determined the types of sounds that would be produced.

Subsequent speaking machines throughout the 18th and 19th centuries maintained this setup, though there were improvements. For example, in the late 1800s, Joseph Faber created a speaking machine that could receive input through a keyboard and a pedal. The machine was also very creative, as the sound came out through an artificial ‘face.’

When the 20th century came around, innovations in electronics allowed speech synthesis to take an even more powerful direction. Although the premise of imitating the human vocal tract was still the same, early 20th century speaking machines could produce better sounds since the input was more precise.

However, it wasn’t until the advent of computers that speech synthesis could actually be used outside of the entertainment arena. This is mainly because speech synthesizers could be stored in software instead of a separate machine. Additionally, with computers as an aid, speech synthesis could take on a different form; using human voices as the main source for sound.

This form of speech synthesis is known as concatenative. The process works by connecting various recordings of human speech. The resulting sound is much more natural and pleasing to the ear. This is in contrast to programs that use articulatory synthesis, where speech is replicated through a computerized model of the vocal tract.

Commercial speech synthesizers can employ either concatenative or articulatory methods, but both are able to achieve the same objective; being able to give people an opportunity to hear text. This is especially helpful in situations where reading is obtrusive or impossible.

In the business world, such situations are very common, especially for telephone transactions. Without text-to-speech (TTS) alternatives, business owners would have to spend money hiring even more customer service personnel. Synthesized solutions avoid this problem, since everything is done by computer; not a human being.

Synthesized speech also plays a role in daily life, especially for individuals who are disabled. Talking clocks, dictionaries and other devices can make things easier for people who have trouble seeing or reading. Synthesized speech is even able to give a voice to individuals who couldn’t speak at all. Steven Hawking, a famous physicist, is a prominent example. Since Lou Gehrig’s disease has rendered him mute, Hawking uses a voice synthesizer to communicate with people.

There are also TTS applications available to help assist people with various computer activities. To obtain these types of applications, most users will have to buy separate software or download patches. The latter option is usually free, depending on the operating system or word processing program being used. However, if a person decides to buy separate software, they could have access to a higher-quality system. Specific examples can be seen through Natural Reader 7 and Text Aloud 2.

Ultimately, speech synthesis is technology that has revolutionized how mankind communicates. In a sense it gives text a life of its own. It also gives the world an opportunity to hear the thoughts of brilliant individuals who would have normally been voiceless.

WiseGEEK is dedicated to providing accurate and trustworthy information. We carefully select reputable sources and employ a rigorous fact-checking process to maintain the highest standards. To learn more about our commitment to accuracy, read our editorial process.

Discussion Comments

WiseGEEK, in your inbox

Our latest articles, guides, and more, delivered daily.

WiseGEEK, in your inbox

Our latest articles, guides, and more, delivered daily.