Speech API Reference
Speech
Converts the given audio file to text.
Azure OpenAI
Request
POST https://api.core42.ai/v1/audio/speech
OpenAI
Request
POST https://api.core42.ai/v1/audio/speech
Request Parameters
Name |
Required |
Type |
Description |
---|---|---|---|
input |
true |
file |
The text to generate audio for. The maximum length is 4096 characters. |
model / deployment-id |
true |
string |
Model ID to use for the request. The available TTS models are gpt-4o-mini-tts. |
Voice |
true |
string |
The voice to use when generating the audio. Supported voices are |
instructions |
false |
string |
Control the voice of your generated audio with additional instructions. |
prompt |
false |
string |
Optional text to guide the model's style or continue a previous audio segment. Ensure the prompt matches the audio language. |
response_format |
false |
string |
Format of the audio output. Supported output formats: mp3, opus, aac, flac, wav, and pcm. Default output is mp3. |
Speed |
false |
number |
The speed of the generated audio. Select a value from 0.25 to 4.0. Default is 1.0. |
stream_format |
false |
string |
The format to stream the audio in. Supported formats are sse and audio. Default stream value is audio. |