Microsoft text-to-speech voices

From Wikipedia, the free encyclopedia

The Microsoft text-to-speech voices are speech synthesizers provided for use with applications that use the Microsoft Speech API (SAPI) or the Microsoft Speech Server Platform. There are client, server, and mobile versions of Microsoft text-to-speech voices. Client voices are shipped with Windows operating systems; server voices are available for download for use with server applications such as Speech Server, Lync etc. for both Windows client and server platforms, and mobile voices are often shipped with more recent versions.

Voices[edit]

Windows 2000 and Windows XP[edit]

A speech sample of Microsoft Sam.
The first part uses a variation of the "The quick brown fox jumps over the lazy dog" panagram, while the second part showcases the "soi" glitch associated with Sam.

Microsoft Sam is the default text-to-speech male voice in Microsoft Windows 2000 and Windows XP. It is used by Narrator, the screen reader program built into the operating system.

Microsoft Mike and Microsoft Mary are optional male and female voices respectively, available for download from the Microsoft website. Michael and Michelle are also optional male and female voices licensed by Microsoft from Lernout & Hauspie, and are available through Microsoft Office XP and Microsoft Office 2003 or Microsoft Reader.

There are both SAPI 4 and SAPI 5 versions of these text-to-speech voices. SAPI 4 voices are only available on Windows 2000 and later Windows NT-based operating systems, but are also available for download on Windows 9x operating systems as well. While SAPI 5 versions of Microsoft Mike and Microsoft Mary are only downloadable as a Merge Module,[1] the installable versions may be installed on end users' systems by speech applications such as Microsoft Reader. SAPI 4 redistributable versions were downloadable for Windows 9x, however they are no longer offered from the Microsoft website.

The SAPI 4 versions of Microsoft Sam, Microsoft Mike and Microsoft Mary can be used on Windows XP, Vista and later with a third-party program (like Speakonia and TTSReader) installed on the machine that supports these operating systems; however, as expected, the speech patterns differed from the SAPI 5 versions of these voices. In addition, the Lernout & Hauspie voices Michael and Michelle will also work on Windows Vista and later if the SAPI 4 versions of the voices in British English is downloaded and used with a third-party program like Speakonia (Conversely, said voices are also compatible with XP and prior as well).

Windows Vista and Windows 7[edit]

Beginning with Windows Vista and Windows 7, Microsoft Anna is the default English voice. It is a SAPI 5-only female voice and is designed to sound more natural than Microsoft Sam.[2] Microsoft Streets & Trips 2006 and later install the Microsoft Anna voice on Windows XP systems for the voice-prompt direction feature. There are no male voices shipping with Windows Vista and Windows 7, and neither Microsoft Mike or Mary will work on Windows 7.

A female voice called Microsoft Lili that replaces the earlier male SAPI 5 voice "Microsoft Simplified Chinese" is available in Chinese versions of Windows Vista and Windows 7. It can also be obtained in non-Chinese versions of Windows 7 or Vista by installing the Chinese language pack.

In 2010, Microsoft released the newer Speech Platform compatible voices for Speech Recognition and Text-to-Speech for use with client and server applications. These voices are available in 26 languages[3] and can be installed on Windows client and server operating systems. Speech Platform voices, unlike SAPI 5 voices, are female-only; no male voices were ever released.

Windows 8 and Windows 8.1[edit]

In Windows 8, there are three new client (desktop) voices - Microsoft David (US male), Hazel (UK female) and Zira (US female) which are intended to sound more natural than Microsoft Anna. The server versions of these voices are available via the above-mentioned Speech Platform for operating systems earlier than Windows 8. Other voices are available for specific language versions of either Windows 8 or Windows 8.1.[4]

Unlike Windows 7 or Vista, one cannot use any third-party program for Microsoft Anna because there is no Anna Voice API for download (especially since there was never a SAPI 4 version of Microsoft Anna).

Windows 10[edit]

In Windows 10, Microsoft Hazel was removed from the US English Language Pack and the Microsoft voices for Mobile (Phone/tablet) are available (Microsoft Mark and Microsoft Zira). These are the same voices found on Windows Phone 8, Windows Phone 8.1 and Windows 10 Mobile.

Also with these voices language packs are also available for a variety of voices similar to that of Windows 8 and 8.1. None of these voices match the Cortana text-to-speech voice which can be found on Windows Phone 8.1, Windows 10, and Windows 10 Mobile.

In an attempt to unify its software with Windows 10, all of Microsoft's current platforms use the same text-to-speech voices except for Microsoft David and a few others.

Mobile[edit]

Every mobile voice package has the combination of male/female, while most of the desktop voice packages have only female voices. All mobile voices have been made universal and any user who downloads the language pack of that choice will have one extra male and female voice per that package.

A hidden text-to-speech voice in Windows 10 called Microsoft Eva Mobile is present within the system. Users can download a pre-packaged registry file from the windowsreport.com website. Microsoft Eva is believed to be the early voice for Cortana until Microsoft replaced her with the voice of Jen Taylor in most areas.

These voices are updated with Windows to sound more natural than in the original version as seen in updated retail builds of Windows 10.

Windows 11[edit]

In Windows 11, it introduced three new "natural voices" starting with version 22H2: Microsoft Aria, Jenny, and Guy.[5] These natural voices are currently only available through Narrator and are not available for any programs outside of it (including all first-party and third-party applications), even though the voices themselves are directly taken from Microsoft’s Azure cloud computing platform.

The voices from Windows 10 were now reclassified as "legacy voices", however David was still used as the default for the desktop client.

In popular culture[edit]

In the mid 2000's, the Microsoft Sam voice was associated with a popular internet meme called "ROFLcopter". ROFLcopter is a slang term for "ROFL" (Rolling On the Floor Laughing), being a portmanteau of the words "ROFL" and "helicopter". The "ROFLcopter" itself was commonly depicted as an ASCII helicopter with the words "LOL" and "ROFL" being used as helicopter blades. The ROFLcopter itself predates its association with the Microsoft Sam voice (the first instances of "ROFLcopter" was in 2004), which was not used until mid 2006. The sound that it later makes uses Sam's pronunciation of "soy" (and more commonly "soi"), in which if a user types "soi" or similar, the voice will create sounds similar to that of a helicopter. This is due to a glitch within Sam's speech patterns pronouncing the aforementioned words.[6]

See also[edit]

References[edit]

  1. ^ Speech SDK 5.1
  2. ^ Chambers, Rob (August 29, 2006). "Microsoft Anna - The new TTS voice in Vista". MSDN Blogs. Microsoft. Retrieved June 26, 2015.
  3. ^ "Microsoft Speech Platform". 20 January 2015.
  4. ^ Free text-to-speech (TTS) or speech synthesizers in Microsoft Windows
  5. ^ "Windows 11's Narrator Is Getting Better Voices". How-To Geek. 27 January 2022.
  6. ^ "ROFLcopter". Know Your Meme. 12 December 2008. Retrieved August 27, 2023.

External links[edit]