Transcription model. Currently: whisper-1.
Audio file to transcribe. Supported: mp3, mp4, mpeg, mpga, m4a, wav, webm. Max 25MB.
ISO-639-1 language code (e.g. en, ja, fr, de, es). Providing this improves accuracy and speed.
Optional text to guide the model’s style or continue a previous segment.
Output format. Options: json, text, srt (subtitles), vtt (web subtitles), verbose_json.
Sampling temperature 0–1. 0 for deterministic output.
curl https://www.samuraiapi.in/v1/audio/transcriptions \
-H "Authorization: Bearer $SAMURAI_API_KEY" \
-F model="whisper-1" \
-F file="@podcast.mp3" \
-F language="en" \
-F response_format="text"
{
"text": "Welcome to Samurai AI. Today we're going to talk about building AI-powered applications using our unified API gateway."
}