I am looking for some easy to install text to speech software for Ubuntu that sounds natural. I've installed Festival
, Gespeaker
, etc., but nothing sounds very natural. All very synthetic and hard to understand.
Any recommendations out there?
I am looking for some easy to install text to speech software for Ubuntu that sounds natural. I've installed Festival
, Gespeaker
, etc., but nothing sounds very natural. All very synthetic and hard to understand.
Any recommendations out there?
SVOX pico2wave
A very minimalistic TTS, a better sounding than espeak or mbrola (to my mind). Some information here.
I don't understand why pico2wave is, compared to espeak or mbrola, rarely discussed. It's small, but sounds really good (natural). Without modification you'll hear a natural sounding female voice.
AND ... compared to Mbrola, it recognise Units and speaks it the right way!
For example:
After installation I use it in a script:
Then run it with the desired text:
or read the contents of an entire file:
That's all to have a lightweight, stable working TTS on Ubuntu.
Pico and espeak are fun and easy to get to work, but they're not all that good. The default Festival voices are also not that good. However, Festival is a scheme-based speech framework, where a number of researchers have built much better plug-in voices. You can easily surpass the pico2wave quality on stock Ubuntu, because one of those voices is available as a ready-made package.
To make Festival sound natural, here's what to do:
You can do it from the command line by using
-b
(or--batch
) and putting each command into single quotes:You can get other quite good voices from the Nitech repository, but installing them is finicky, and the default paths changed so the file name references in the bundled scheme files may need to be manually edited to work on stock Ubuntu.
SpeakIt!
I believe Ive found the best TTS software for free using a Google Chrome extension called "SpeakIt". This only works in the Chrome browser for me on Ubuntu. It doesnt work with Chromium for some reason. SpeakIt comes with two female voices which both sound very realistic compared to everything else out there. There are at least four more male & female voices listed s Chrome extensions if you search the Chrome Web Store using "TTS" as your query.
Usage: For use on a website. you highlight the text you want to be read and either right click and "SpeakIt" or click the SpeakIt icon docked on the Chrome top bar.
Firefox users also have two options. Within Firefox addons, do a search for TTS and you should find "Click Speak" and also "Text to Voice". The voices are not as good as the Chrome SpeakIt voices, but are definitely usable.
The SpeakIt extension uses iSpeech technology and for a price of $20 a year, the site can convert text to MP3 audio files. You can input text, URLs, RSS feeds, as well as documents such as TXT, DOC, and PDF and output to MP3. You can make podcast, embed audio, etc. Here is a link, and a sample of their audio (don't know how long the link will last).
Simple Google™ TTS
Update from project page (2016): This project is currently unmaintained and will remain so for the foreseeable future.
Because of the lack of a better alternative I wrote a bash script that interfaces with a perl script by Michal Fapso to provide TTS via Google Translate. From the project description:
The main features are:
Installation and usage are documented on the project page.
I'd be glad if you gave it a try. Bug reports and any other feedback are welcome!
I have looked high and low for text to speech for Ubuntu that is high quality. There is none. My vocal cords are paralyzed so I needed TTS to add voice instructions to my Ubuntu videos. You can get commercial high quality Linux text to speech software here. It's just really expensive. I ended up buying Natural Reader for Windows (doesn't work in Ubuntu under Wine) for $40. Maybe later I will get the Linux one.
I have been conducting research on the best sounding and easily tuned text to speech voices. Below is a listing of what I thought were the top 5 products in order of sound quality. Most of the websites associated with these product have an interactive demo that will allow for you to make your own determination.
gTTS, Google Text-to-Speech
Cons: CLI-only. Need to be online as it requires to request to Google public open endpoint.
Usage
Documentation and more examples
Others
Some were already mentioned
Mimic. Installation:
eSpeak + Gespeaker (GUI) (Gespeaker source code)
Cons: Old and ugly
Firefox
Chromium/Brave/Chrome
I find Nitech HTS voices on festival very natural and comforting over any other voices I have heard. See this link on how to set up Nitech and other sounds with festival. I have not found a good gui which I can use to configure those voices but setting them via festival.scm still works. That post is very old and you might want to find the actual installation directory using "locate festival" command
Combine SVOX tools (pico) with LibreOffice:
SVOX (pico) tools are easy to install and brings good quality voices in Ubuntu. Install it:
You can use LibreOffice in combination with SVOX (pico) tools by install the "Read Text" extension and you obtain a "GUI" for this excellent TTS software:
Set up Read Text Extension's options with Tools - Add-ons - Read selection.... Use /usr/bin/python as the external program. Select a command line option that includes the token (PICO_READ_TEXT_PY), you may want to experiment some of them.
Now you only have to select some text in LO Writer, Calc, Impress or Draw and clic on the icon added as a tool bar (a happy face with a ballon).
Here is what I did to have pure natural speech for pdf and other text files(other solutions are not natural or they're just paid services). This is actually a work around using chromium or chrome but works fast and easy.
There's also ways to open other files like .doc and .txt in chrome and do the same. There's other extensions for chrome that view pdf files, check if it fits you better. Besides you can upload all kind of texts in Google Drive and use SpeakIt! to read it for you. Another extension called 'Speak text' works the same way and has natural speech.