Guide to using Friendli’s Audio and Speech feature for audio analysis and transcription. Covers usage via Playground and API (URL & Base64 examples).
Friendli provides audio and speech features through Friendli Dedicated Endpoints, allowing you to convert audio files to text and perform various AI tasks. This guide explains how to use these features with examples for both the Playground and API interfaces.You can find the full list of available models here.
Our ASR (Automatic Speech Recognition) service is designed for efficient audio transcription.
By default, audio input is limited to 30 seconds. If you require support for longer audio inputs, please contact us.
Our platform supports a wide range of audio formats compatible with the librosa library, ensuring broad compatibility for your applications. Supported formats include: