Skip to main content
POST
/
audio
/
transcriptions
curl https://www.samuraiapi.in/v1/audio/transcriptions \
  -H "Authorization: Bearer $SAMURAI_API_KEY" \
  -F model="whisper-1" \
  -F file="@podcast.mp3" \
  -F language="en" \
  -F response_format="text"
{
  "text": "Welcome to Samurai AI. Today we're going to talk about building AI-powered applications using our unified API gateway."
}
model
string
required
Transcription model. Currently: whisper-1.
file
file
required
Audio file to transcribe. Supported: mp3, mp4, mpeg, mpga, m4a, wav, webm. Max 25MB.
language
string
ISO-639-1 language code (e.g. en, ja, fr, de, es). Providing this improves accuracy and speed.
prompt
string
Optional text to guide the model’s style or continue a previous segment.
response_format
string
default:"json"
Output format. Options: json, text, srt (subtitles), vtt (web subtitles), verbose_json.
temperature
number
default:"0"
Sampling temperature 01. 0 for deterministic output.
curl https://www.samuraiapi.in/v1/audio/transcriptions \
  -H "Authorization: Bearer $SAMURAI_API_KEY" \
  -F model="whisper-1" \
  -F file="@podcast.mp3" \
  -F language="en" \
  -F response_format="text"
{
  "text": "Welcome to Samurai AI. Today we're going to talk about building AI-powered applications using our unified API gateway."
}