Crear traducción

curl -X POST "https://api.tokenlab.sh/v1/audio/translations" \
  -H "Authorization: Bearer sk-your-api-key" \
  -F "file=@german_audio.mp3" \
  -F "model=whisper-1"

from openai import OpenAI

client = OpenAI(
    api_key="sk-your-api-key",
    base_url="https://api.tokenlab.sh/v1"
)

with open("german_audio.mp3", "rb") as audio_file:
    response = client.audio.translations.create(
        model="whisper-1",
        file=audio_file
    )

print(response.text)

import OpenAI from 'openai';
import fs from 'fs';

const client = new OpenAI({
  apiKey: 'sk-your-api-key',
  baseURL: 'https://api.tokenlab.sh/v1'
});

const response = await client.audio.translations.create({
  model: 'whisper-1',
  file: fs.createReadStream('german_audio.mp3')
});

console.log(response.text);

<?php
$ch = curl_init('https://api.tokenlab.sh/v1/audio/translations');

$file = new CURLFile('german_audio.mp3', 'audio/mpeg', 'german_audio.mp3');

curl_setopt_array($ch, [
    CURLOPT_RETURNTRANSFER => true,
    CURLOPT_POST => true,
    CURLOPT_HTTPHEADER => [
        'Authorization: Bearer sk-your-api-key'
    ],
    CURLOPT_POSTFIELDS => [
        'file' => $file,
        'model' => 'whisper-1'
    ]
]);

$response = curl_exec($ch);
curl_close($ch);

$data = json_decode($response, true);
echo $data['text'];

{
  "text": "Hello, my name is Wolfgang and I come from Germany. Where are you from?"
}

{
  "task": "translate",
  "language": "german",
  "duration": 8.470000267028809,
  "text": "Hello, my name is Wolfgang and I come from Germany. Where are you from?",
  "segments": [
    {
      "id": 0,
      "seek": 0,
      "start": 0.0,
      "end": 4.0,
      "text": " Hello, my name is Wolfgang and I come from Germany.",
      "tokens": [50364, 2425, 11, 452, 1315, 307, 25329, 293, 286, 808, 490, 5765, 13, 50564],
      "temperature": 0.0,
      "avg_logprob": -0.45,
      "compression_ratio": 1.0,
      "no_speech_prob": 0.0
    }
  ]
}

Resumen

Traduce audio en cualquier idioma compatible a texto en inglés. A diferencia de la transcripción, este endpoint siempre devuelve texto en inglés independientemente del idioma de entrada.

Cuerpo de la solicitud

Tiempo de espera de solicitudes síncronas: este endpoint no-chat espera a que el modelo enrutado termine. Entradas grandes, audio largo o lotes grandes pueden superar los valores predeterminados habituales de 30s del cliente, así que configura el timeout de tu cliente HTTP en al menos 120s.

file

requerido

El archivo de audio a traducir. Formatos compatibles: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, webm. El tamaño máximo del archivo es de 25 MB.

model

string

predeterminado:"whisper-1"

El modelo que se utilizará. Actualmente, solo se admite whisper-1.

prompt

string

Un texto opcional para guiar el estilo del modelo o continuar un segmento anterior. Debe estar en inglés.

response_format

string

predeterminado:"json"

El formato de la salida. Opciones: json, text, srt, verbose_json, vtt.

temperature

number

La temperatura de muestreo, entre 0 y 1. Los valores más altos, como 0.8, producen una salida más aleatoria, mientras que los valores más bajos, como 0.2, hacen que la salida sea más enfocada y determinista.

Respuesta

text

string

El texto traducido en inglés.

Para el formato verbose_json, la respuesta también incluye:

language

string

El idioma detectado del audio de entrada.

duration

number

La duración del audio de entrada en segundos.

segments

array

Segmentos del texto traducido con marcas de tiempo.

curl -X POST "https://api.tokenlab.sh/v1/audio/translations" \
  -H "Authorization: Bearer sk-your-api-key" \
  -F "file=@german_audio.mp3" \
  -F "model=whisper-1"

from openai import OpenAI

client = OpenAI(
    api_key="sk-your-api-key",
    base_url="https://api.tokenlab.sh/v1"
)

with open("german_audio.mp3", "rb") as audio_file:
    response = client.audio.translations.create(
        model="whisper-1",
        file=audio_file
    )

print(response.text)

import OpenAI from 'openai';
import fs from 'fs';

const client = new OpenAI({
  apiKey: 'sk-your-api-key',
  baseURL: 'https://api.tokenlab.sh/v1'
});

const response = await client.audio.translations.create({
  model: 'whisper-1',
  file: fs.createReadStream('german_audio.mp3')
});

console.log(response.text);

<?php
$ch = curl_init('https://api.tokenlab.sh/v1/audio/translations');

$file = new CURLFile('german_audio.mp3', 'audio/mpeg', 'german_audio.mp3');

curl_setopt_array($ch, [
    CURLOPT_RETURNTRANSFER => true,
    CURLOPT_POST => true,
    CURLOPT_HTTPHEADER => [
        'Authorization: Bearer sk-your-api-key'
    ],
    CURLOPT_POSTFIELDS => [
        'file' => $file,
        'model' => 'whisper-1'
    ]
]);

$response = curl_exec($ch);
curl_close($ch);

$data = json_decode($response, true);
echo $data['text'];

{
  "text": "Hello, my name is Wolfgang and I come from Germany. Where are you from?"
}

{
  "task": "translate",
  "language": "german",
  "duration": 8.470000267028809,
  "text": "Hello, my name is Wolfgang and I come from Germany. Where are you from?",
  "segments": [
    {
      "id": 0,
      "seek": 0,
      "start": 0.0,
      "end": 4.0,
      "text": " Hello, my name is Wolfgang and I come from Germany.",
      "tokens": [50364, 2425, 11, 452, 1315, 307, 25329, 293, 286, 808, 490, 5765, 13, 50564],
      "temperature": 0.0,
      "avg_logprob": -0.45,
      "compression_ratio": 1.0,
      "no_speech_prob": 0.0
    }
  ]
}

Traducción vs Transcripción

Característica	Traducción	Transcripción
Idioma de salida	Siempre inglés	Igual que la entrada
Caso de uso	Convertir audio en otro idioma a inglés	Preservar el idioma original
Parámetro de idioma	No aplicable	Pista opcional

El endpoint de traducción detecta automáticamente el idioma de origen y traduce al inglés. El parámetro language de la transcripción se ignora.

Crear transcripción Realtime WebSocket

​Resumen

​Cuerpo de la solicitud

​Respuesta

​Traducción vs Transcripción

Resumen

Cuerpo de la solicitud

Respuesta

Traducción vs Transcripción