Skip to content
Heads up — these docs cover Voice Monkey API v3, the current version. If you signed up before the v3 launch, your account is still on API v2 for a limited transition period and the examples below will not work against your account.

Media

Voice Monkey supports three kinds of media — audio, video and images — in announcements and Flow nodes. Anywhere a node or Announcement API parameter takes a URL (audio, image, video, background_audio, website_url), you can use any of the sources below.

You have three options:

  1. Self-hosted URL — paste any public HTTPS URL. Common choices: S3/R2 buckets, Dropbox public links, Google Drive direct links.
  2. Upload to Voice Monkey (Hobby and Ultimate) — upload via the console at app.voicemonkey.io/media. Each uploaded file gets a public https://media.voicemonkey.io/… URL — copy it from the Copy URL button on the file card and paste it into any node or API call. Uploaded files are capped at 100 MB each.
  3. Curated library — pre-made soundscapes, sounds, ambient videos and images that ship with Voice Monkey. See below.

In Flow nodes and the Playground, the Browse button on every URL field opens a media picker that lets you pick from any of these sources without copying URLs by hand.

Voice Monkey ships a library of pre-made media — Halloween soundscapes, sleep sounds, ambient videos for Echo Show, themed background images and more. Items are referenced with the short tag syntax:

media://<slug>

For example:

{
"audio": "media://halloween-creepy-piano",
"image": "media://winter-fireplace"
}

The API resolves the tag to the real underlying URL at request time. The real URL is intentionally hidden — it can change without breaking your saved flows, and library content can’t be hotlinked from outside Voice Monkey.

Browse the curated library by hitting Browse on any media URL field in a Flow node or in the Playground, then switching to the Library tab. The picker writes the right media://<slug> value for you — no need to remember slugs.

  • Displayed on screen-enabled Echo devices (Echo Show, Echo Show 15).
  • Supported formats: JPG, PNG. Keep size under 5 MB for best reliability.
  • Use the image parameter on the Announcement API or an Image action in a Flow.
  • Displayed on Echo Shows only.
  • Supported formats: MP4 (H.264). Aim for ≤ 30 MB for fast startup.
  • Use the video parameter on the Announcement API or a Video action in a Flow.
  • Ambient videos loop indefinitely — add a Wait + Stop Media if you want a time limit.
  • Supported formats: MP3 (recommended), MP4. 44.1 kHz / 48 kHz, mono or stereo.
  • Longer clips (up to ~240s) are supported, but short stingers perform best.
  • Use the audio parameter on the Announcement API or an Audio action in a Flow.
  • Most pre-built soundscapes live in the curated library — reference them with media://<slug>.
FeatureFreeHobbyUltimate
Reference external URLsyesyesyes
Use the curated libraryyesyesyes
Upload your own filesnoyesyes