Gladia I Audio Transcription API

Accelerate your roadmap with top-tier models for speech recognition and analysis

Transcribe calls in milliseconds

Gladia’s speech-to-text engine converts calls and meetings into text in real time or asynchronously, making it easy to integrate conversational features, advanced note-taking and search into your platform.

Key information and insights with no errors

Leveraging our audio intelligence add-ons, you can retrieve key information and insights in real time for meeting notes, CRM enrichment and other LLM-powered capabilities of your product. 100% accuracy where it matters the most.

Integrate advanced real-time guidance features

Our real-time transcription API is optimized to provide next-best-action recommendations to customer support and sales agents while on-call. Compatible with SIP and WebSockets.

COMING SOON

Build AI conversational call agents and bots

Gain access to the best ASR models and tools to create autonomous AI voice agents capable of understanding speech and handling more complex customer queries in real-time.

Trusted by leading organizations worldwide
‍Here's what they're saying

"There’s a lot more than one can get out of audio than just transcription, and Gladia understood that. Feature rollouts are proactive, and anticipate our needs as a platform. Their API performs very well with noisy telephony and stereo audio and does an excellent job with languages."

Alexandre Bouju

CTO Deputy Manager

"The quality of the output from our platform, everything that we do based on this transcription became better after we switched to Gladia."

Valentin van Gastel

VP of Product & Engineering

"We are 100% benchmark and evaluation driven. Gladia was one of the best providers selected on merit to transcribe user videos, especially for non-English languages. Their reactive customer support and data compliance make their offer really compelling."

Kojo Hinson

Group Engineering Manager

"We initially attempted to host Whisper Al, which required significant effort to scale. Switching to Gladia's transcription service brought a welcome change."

Robin Lambert

CPO

"Having tried numerous speech-to-text solutions, I can confidently say: Gladia's API outshines the rest. Their balance of accuracy, speed, and precise word timings is unparalleled."

Jean Patry

Co-founder

"Gladia has a clear-cut advantage when it comes to European languages. With their API, we acquired new users in countries like Finland and Sweden, who say it's the best transcription they've ever tried."

Lazare Rossillon

CEO

"It's the first time we've been able to transcribe video with such accuracy and speed - including when the conversation is technical. Whatever the language or accent, the quality is always there."

Robin Bonduelle

CEO

Alexandre Bouju

CTO Deputy Manager

"The quality of the output from our platform, everything that we do based on this transcription became better after we switched to Gladia."

Valentin van Gastel

VP of Product & Engineering

Kojo Hinson

Group Engineering Manager

"We initially attempted to host Whisper Al, which required significant effort to scale. Switching to Gladia's transcription service brought a welcome change."

Robin Lambert

CPO

"Having tried numerous speech-to-text solutions, I can confidently say: Gladia's API outshines the rest. Their balance of accuracy, speed, and precise word timings is unparalleled."

Jean Patry

Co-founder

Lazare Rossillon

CEO

"It's the first time we've been able to transcribe video with such accuracy and speed - including when the conversation is technical. Whatever the language or accent, the quality is always there."

Robin Bonduelle

CEO

High precision and instant results at no deployment cost

Built for developers

"In less than a day of dev work we were able to release a state-of-the-art speech-to-text engine!"

Xavier G

CTO at ScorePlay

 async function makeFetchRequest(url: string, options: any) {
  const response = await fetch(url, options);
  return response.json();
}
 
async function pollForResult(resultUrl: string, headers: any) {
  while (true) {
    console.log("Polling for results...");
    const pollResponse = await makeFetchRequest(resultUrl, { headers });
 
    if (pollResponse.status === "done") {
      console.log("- Transcription done: \n ");
      console.log(pollResponse.result.transcription.full_transcript);
      break;
    } else {
      console.log("Transcription status : ", pollResponse.status);
      await new Promise((resolve) => setTimeout(resolve, 1000));
    }
  }
}
 
async function startTranscription() {
  const gladiaKey = "YOUR_GLADIA_API_TOKEN";
  const requestData = {
    audio_url:
      "YOUR_AUDIO_URL",
  };
  const gladiaUrl = "https://api.gladia.io/v2/transcription/";
  const headers = {
    "x-gladia-key": gladiaKey,
    "Content-Type": "application/json",
  };
 
  console.log("- Sending initial request to Gladia API...");
  const initialResponse = await makeFetchRequest(gladiaUrl, {
    method: "POST",
    headers,
    body: JSON.stringify(requestData),
  });
 
  console.log("Initial response with Transcription ID :", initialResponse);
 
  if (initialResponse.result_url) {
    await pollForResult(initialResponse.result_url, headers);
  }
}
 
startTranscription();

 import requests
import time
 
def make_fetch_request(url, headers, method='GET', data=None):
    if method == 'POST':
        response = requests.post(url, headers=headers, json=data)
    else:
        response = requests.get(url, headers=headers)
    return response.json()
 
gladia_key = "YOUR_GLADIA_API_TOKEN"
request_data = {"audio_url": "YOUR_AUDIO_URL"}
gladia_url = "https://api.gladia.io/v2/transcription/"
 
headers = {
    "x-gladia-key": gladia_key,
    "Content-Type": "application/json"
}
 
print("- Sending initial request to Gladia API...")
initial_response = make_fetch_request(gladia_url, headers, 'POST', request_data)
 
print("Initial response with Transcription ID:", initial_response)
result_url = initial_response.get("result_url")
 
if result_url:
    while True:
        print("Polling for results...")
        poll_response = make_fetch_request(result_url, headers)
        
        if poll_response.get("status") == "done":
            print("- Transcription done: \n")
            print(poll_response.get("result", {}).get("transcription", {}).get("full_transcript"))
            break
        else:
            print("Transcription status:", poll_response.get("status"))
        time.sleep(1)

ss Lower AI infrastructure costs. We leverage a proprietary know-how to fit more AI on less hardware — without compromising on quality and performance.

ss Technical edge. With Gladia, you get access to an optimized version of the most sophisticated ASR models and regular software upgrades at no extra cost.

ss Reduced time-to-market. By embedding advanced AI into your applications directly, your users can derive full value from your product from day one.

ss Easy-to-scale. Increase your processing capacity easily with our pay-as-you go system. Our enterprise-grade API is built to adapt to your ever-growing needs.

All your questions. Answered.

What are the key features of Gladia’s audio transcription API?

What languages does Gladia’s speech-to-text API support?

How can I get started with implementing Gladia’s API in my product?

How does Gladia’s Speech-to-Text API work?

Do you offer support for multiple programming languages?

What audio formats does Gladia support?

What type of companies use Gladia’s audio transcription API?

Is Gladia secure?

	async function makeFetchRequest(url: string, options: any) {
	const response = await fetch(url, options);
	return response.json();
	}

	async function pollForResult(resultUrl: string, headers: any) {
	while (true) {
	console.log("Polling for results...");
	const pollResponse = await makeFetchRequest(resultUrl, { headers });

	if (pollResponse.status === "done") {
	console.log("- Transcription done: \n ");
	console.log(pollResponse.result.transcription.full_transcript);
	break;
	} else {
	console.log("Transcription status : ", pollResponse.status);
	await new Promise((resolve) => setTimeout(resolve, 1000));
	}
	}
	}

	async function startTranscription() {
	const gladiaKey = "YOUR_GLADIA_API_TOKEN";
	const requestData = {
	audio_url:
	"YOUR_AUDIO_URL",
	};
	const gladiaUrl = "https://api.gladia.io/v2/transcription/";
	const headers = {
	"x-gladia-key": gladiaKey,
	"Content-Type": "application/json",
	};

	console.log("- Sending initial request to Gladia API...");
	const initialResponse = await makeFetchRequest(gladiaUrl, {
	method: "POST",
	headers,
	body: JSON.stringify(requestData),
	});

	console.log("Initial response with Transcription ID :", initialResponse);

	if (initialResponse.result_url) {
	await pollForResult(initialResponse.result_url, headers);
	}
	}

	startTranscription();

	import requests
	import time

	def make_fetch_request(url, headers, method='GET', data=None):
	if method == 'POST':
	response = requests.post(url, headers=headers, json=data)
	else:
	response = requests.get(url, headers=headers)
	return response.json()

	gladia_key = "YOUR_GLADIA_API_TOKEN"
	request_data = {"audio_url": "YOUR_AUDIO_URL"}
	gladia_url = "https://api.gladia.io/v2/transcription/"

	headers = {
	"x-gladia-key": gladia_key,
	"Content-Type": "application/json"
	}

	print("- Sending initial request to Gladia API...")
	initial_response = make_fetch_request(gladia_url, headers, 'POST', request_data)

	print("Initial response with Transcription ID:", initial_response)
	result_url = initial_response.get("result_url")

	if result_url:
	while True:
	print("Polling for results...")
	poll_response = make_fetch_request(result_url, headers)

	if poll_response.get("status") == "done":
	print("- Transcription done: \n")
	print(poll_response.get("result", {}).get("transcription", {}).get("full_transcript"))
	break
	else:
	print("Transcription status:", poll_response.get("status"))
	time.sleep(1)

Audio infrastructure

to transform

note-taking

customer support

sales assistance

user experience

note-taking

Audio infrastructure

to transform

note-taking

customer support

sales assistance

user experience

note-taking

Audio infrastructure

to transform

note-taking

customer support

sales assistance

user experience

note-taking

Everything starts with reliable transcription. From async to live streaming, our API empowers your platform with accurate, multilingual speech-to-text and actionable insights.

Accelerate your roadmap with top-tier models for speech recognition and analysis

Transcribe calls in milliseconds

Key information and insights with no errors

Integrate advanced real-time guidance features

Build AI conversational call agents and bots

Trusted by leading organizations worldwide‍Here's what they're saying

High precision and instant results at no deployment cost

High precision and instant results at no deployment cost

High precision and instant results at no deployment cost

Latency

Accuracy

Language support

Quick integration

High security at scale

Optimized for enterprise use cases

Customer experience

Sales enablement

Meeting assistants

Content and media

Built for developers

From audio to knowledge

Subscribe to receive latest news, product updates and curated AI content.

All your questions. Answered.

Trusted by leading organizations worldwide
‍Here's what they're saying