Media
Voice Monkey supports three kinds of media — audio, video and images — in announcements and Flow nodes. Anywhere a node or Announcement API parameter takes a URL (audio, image, video, background_audio, website_url), you can use any of the sources below.
Where the media comes from
Section titled “Where the media comes from”You have three options:
- Self-hosted URL — paste any public HTTPS URL. Common choices: S3/R2 buckets, Dropbox public links, Google Drive direct links.
- Upload to Voice Monkey (Hobby and Ultimate) — upload via the console at app.voicemonkey.io/media. Each uploaded file gets a public
https://media.voicemonkey.io/…URL — copy it from the Copy URL button on the file card and paste it into any node or API call. Uploaded files are capped at 100 MB each. - Curated library — pre-made soundscapes, sounds, ambient videos and images that ship with Voice Monkey. See below.
In Flow nodes and the Playground, the Browse button on every URL field opens a media picker that lets you pick from any of these sources without copying URLs by hand.
Curated library
Section titled “Curated library”Voice Monkey ships a library of pre-made media — Halloween soundscapes, sleep sounds, ambient videos for Echo Show, themed background images and more. Items are referenced with the short tag syntax:
media://<slug>For example:
{ "audio": "media://halloween-creepy-piano", "image": "media://winter-fireplace"}The API resolves the tag to the real underlying URL at request time. The real URL is intentionally hidden — it can change without breaking your saved flows, and library content can’t be hotlinked from outside Voice Monkey.
Browse the curated library by hitting Browse on any media URL field in a Flow node or in the Playground, then switching to the Library tab. The picker writes the right media://<slug> value for you — no need to remember slugs.
Using images
Section titled “Using images”- Displayed on screen-enabled Echo devices (Echo Show, Echo Show 15).
- Supported formats: JPG, PNG. Keep size under 5 MB for best reliability.
- Use the
imageparameter on the Announcement API or an Image action in a Flow.
Using videos
Section titled “Using videos”- Displayed on Echo Shows only.
- Supported formats: MP4 (H.264). Aim for ≤ 30 MB for fast startup.
- Use the
videoparameter on the Announcement API or a Video action in a Flow. - Ambient videos loop indefinitely — add a Wait + Stop Media if you want a time limit.
Using audio
Section titled “Using audio”- Supported formats: MP3 (recommended), MP4. 44.1 kHz / 48 kHz, mono or stereo.
- Longer clips (up to ~240s) are supported, but short stingers perform best.
- Use the
audioparameter on the Announcement API or an Audio action in a Flow. - Most pre-built soundscapes live in the curated library — reference them with
media://<slug>.
Plan limits
Section titled “Plan limits”| Feature | Free | Hobby | Ultimate |
|---|---|---|---|
| Reference external URLs | yes | yes | yes |
| Use the curated library | yes | yes | yes |
| Upload your own files | no | yes | yes |