Skip to content

OpenAI Audio Speech

文本转语音接口,用于把输入文本生成音频文件。

接口信息

项目
方法POST
路径/v1/audio/speech
完整地址https://api.gemai.cc/v1/audio/speech
鉴权Authorization: Bearer sk-你的令牌

请求体示例

json
{
  "model": "tts-1",
  "voice": "alloy",
  "input": "你好,欢迎使用哈基米 API。",
  "response_format": "mp3"
}

代码示例

bash
curl --location 'https://api.gemai.cc/v1/audio/speech' \
  --header "Authorization: Bearer $GEMAI_API_KEY" \
  --header 'Content-Type: application/json' \
  --data '{
    "model": "tts-1",
    "voice": "alloy",
    "input": "你好,欢迎使用哈基米 API。",
    "response_format": "mp3"
  }' \
  --output speech.mp3
javascript
import fs from 'node:fs/promises'

const response = await fetch('https://api.gemai.cc/v1/audio/speech', {
  method: 'POST',
  headers: {
    Authorization: `Bearer ${process.env.GEMAI_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'tts-1',
    voice: 'alloy',
    input: '你好,欢迎使用哈基米 API。',
    response_format: 'mp3',
  }),
})

const buffer = Buffer.from(await response.arrayBuffer())
await fs.writeFile('speech.mp3', buffer)
python
import os
import requests

response = requests.post(
    'https://api.gemai.cc/v1/audio/speech',
    headers={
        'Authorization': f"Bearer {os.environ['GEMAI_API_KEY']}",
        'Content-Type': 'application/json',
    },
    json={
        'model': 'tts-1',
        'voice': 'alloy',
        'input': '你好,欢迎使用哈基米 API。',
        'response_format': 'mp3',
    },
    timeout=120,
)

response.raise_for_status()
with open('speech.mp3', 'wb') as file:
    file.write(response.content)

返回结果

成功时接口通常返回音频二进制内容,需要按文件保存,不要当作 JSON 解析。

注意事项

  • 输出格式和音色以模型实际支持为准。
  • 长文本建议分段生成,避免单次请求过长。