Skip to main content
The AI Audio node uses the AI to convert text to speech. This is useful for creating audio files from text. It uses ElevenLabs under the hood.

Common options

Inputs

Modes

These are the modes that you can choose from:
1

Text to Speech

Converts text to speech using AI via ElevenLabs
2

Sound Effect

Converts text to sound effect using AI via Elevenlabs

Text to Speech

Here are the settings that you can use to customize your text to speech audio:
1

Text to Speech Script

The actual text that will get spoken by the AI.
2

Voice to use

The voice to use when speaking the text.

Sound Effect

1

Describe the sound effect

The description of the sound effect you want to generate.

Output

The output of this node is a URL to the audio file that was generated. Here’s an example.
https://milkylabs.s3.amazonaws.com/f-af400b06-101b-46db-864b-ab86ac27ddcc-Ztl0RB-1744552978537.mp3

Video Walkthrough

Below is a video walkthrough of how to use the AI Audio node.