Skip to content
  • There are no suggestions because the search field is empty.

Speech API Reference

Speech

Converts the given audio file to text.

Azure OpenAI

Request

POST https://api.core42.ai/v1/audio/speech

OpenAI

Request

POST https://api.core42.ai/v1/audio/speech

Request Parameters 

Name

Required

Type

Description

input

true

file

The text to generate audio for. The maximum length is 4096 characters.

model / deployment-id

true

string

Model ID to use for the request. The available TTS models are gpt-4o-mini-tts.

Voice

true

string

The voice to use when generating the audio. Supported voices are alloy, ash, ballad, coral, echo, fable, onyx, nova, sage, shimmer, and verse. Previews of the voices are available in the Text to speech guide.

instructions

false

string

Control the voice of your generated audio with additional instructions.

prompt

false

string

Optional text to guide the model's style or continue a previous audio segment. Ensure the prompt matches the audio language.

response_format

false

string

Format of the audio output. Supported output formats: mp3, opus, aac, flac, wav, and pcm. Default output is mp3.

Speed

false

number

The speed of the generated audio. Select a value from 0.25 to 4.0. Default is 1.0.

stream_format

false

string

The format to stream the audio in. Supported formats are sse and audio. Default stream value is audio.