audio

Text to Speech

Hosted neural TTS — turns text into a natural-sounding MP3 in seconds. Chain it after Summarize, Translate, OCR, or Redact to get audio out of any tool that produces text. Uses 2 credits per run.

Checking access…

About Text to Speech

Text to Speech turns written text into a natural-sounding MP3 in seconds using a hosted neural TTS model. Reach for it when you want to listen to something instead of read it, or to add a voiceover to content you've produced with other tools. It's a Pro tool that runs on hosted infrastructure and uses 2 credits per run.

Category
audio
Input
Accepts: text/plain.
Output
Outputs: audio/mpeg.
Cost
Credit-metered
Memory
low
Privacy: Text to Speech runs entirely on your device. Files you provide never leave your browser — no uploads, no server, no tracking. The page works offline once loaded.

Common uses

  • Chain it after Summarize to turn a long article into a short audio briefing for your commute
  • Voice a translated paragraph from Translate so you can hear how a phrase sounds in another language
  • Convert OCR'd text from a scanned page into spoken audio for hands-free review
  • Generate narration for a slideshow or product demo from a written script
  • Produce an MP3 of redacted notes so a teammate can listen without seeing the original document
  • Create an audio version of a blog post for accessibility or a podcast feed

Frequently asked questions

What format is the output?

An MP3 audio file (audio/mpeg) generated from your plain-text input.

How much does it cost?

It's a Pro tool that uses 2 credits per run. The free in-browser tools remain free; anything backed by a hosted model is Pro.

Does my text get sent to a server?

Yes. Unlike the fully in-browser tools, this one sends your text to a hosted neural TTS model to synthesize the audio. It's processed for the request and the MP3 is returned to you.

What can feed into it?

Any plain text. It's designed to chain after tools that output text — Summarize, Translate, OCR, or Redact — so you can get audio out of nearly any pipeline.

How long can the text be?

It handles typical paragraphs and articles comfortably. Very long inputs are best split into sections, both for cost control and to keep each clip a manageable length.

Keywords

  • tts
  • text-to-speech
  • audio
  • voice
  • mp3
  • pro
  • hosted

Try next