Simple pass-through to enclave audio transcription endpoint with encrypted payload. Rvenc = raw volatile encrypted. Accepts an encrypted audio file and inference parameters. Transcribes audio into the input language. Only provides authentication, security checks, and rate limiting. No file storage or custom features.
| Option | Default | Description |
|---|---|---|
file | required | Audio file (ReadStream, Buffer, Blob, etc.) |
model | required | Model ID (e.g., 'openai/whisper-large-v3') |
language | auto-detect | ISO-639-1 language code (e.g., 'en', 'de', 'fr') |
prompt | optional | Text prompt to guide the model |
response_format | 'json' | Format: 'json', 'text', or 'verbose_json' |
temperature | 0 | Sampling temperature (0-1) |
timestamp_granularities | optional | Array of 'word' and/or 'segment' for timestamps |
Note: openai/whisper-large-v3 model supports non-streaming only.
Send your access token as header Authorization: Bearer {accessToken}
Your API key that starts with sk_live or sk_test. You can create yours at go.prem.io/api-keys.
Request body for rvenc (raw volatile encrypted) audio transcription. Contains an encrypted payload with cryptographic materials needed for decryption.
Encrypted JSON string containing all audio transcription parameters. When decrypted, this string must match the structure shown in the expandable _decryptedInference property below (reference only - do not send this property).
Cipher text for shared secret generation (ECDH key exchange)
Nonce used for encrypting the inference payload
The encrypted audio file name. The file name is encrypted using XChaCha20-Poly1305 with the shared secret and fileNameNonce.
Nonce used for encrypting the encrypted audio file name
The encrypted audio file data. The audio content is encrypted using XChaCha20-Poly1305 with the shared secret and fileNonce.
Nonce used for encrypting the encrypted audio file
Encrypted audio transcription response. The response contains the encrypted transcription result which must be decrypted using the shared secret derived from the request's cipherText, along with the response nonce.
Status code of the response
200, 201, 202 Encrypted audio transcription response. Contains the encrypted transcription result that must be decrypted using the shared secret derived from the request's cipherText.
{
"encryptedResponse": "a1b2c3d4e5f67890abcdef1234567890abcdef123456789012345678901234567890abcdef...",
"nonce": "f6e5d4c3b2a19876543210fedcba9876"
}Message of the response, human readable
"Resource created successfully"
API environment
development, production Error message of the response, human readable
"Invalid email address"
Useful informaiton, not always present, to debug the response
{ "request_id": "req_1234567890" }"Some pertinent log message"
Validator response object, each key is the field name and value is the error message
{
"email": "Invalid email address",
"password": "Password is required"
}Support ID linked to the response, used to identify it when talking with our team
"support_uuidv7-something-else"