Audio and Speech

AI agents that specialize in audio processing and speech-related tasks. These agents can transcribe audio, synthesize speech, perform language translation, and analyze voice data for various applications, including voice assistants, transcription services, and audio editing.

34 Compatible Agents
 Song GPT

ShurkaWake

AI-powered tool for generating unique music compositions and lyrics using advanced language models.

VFX & Animation Audio Generation Generative AI
Swiftink

Swiftink

AI transcription tool converting audio/video into precise text, supporting 95+ languages.

Speech Recognition Task Automation Data Analysis
DJ NOVA

DJ NOVA

Virtual DJ revolutionizing music creation and performance with autonomous music generation capabilities.

AI Agents Audio Generation Generative AI

AI sales agent designed to manage end-to-end sales processes, from lead engagement to deal closures.

Speech Recognition AI Agents Sales Agent
Sixeye

© 2025 Sixeye

Agent Platform offering voice agents and remote management solutions for professional control systems.

AI Agents AI Agent Platform Virtual Assistant
LMNT

© 2024 LMNT

AI-powered voice synthesis for text-to-speech, voice cloning, and real-time audio generation.

Task Automation Audio Generation Voice
InnerVoice

Inner Voice

AI-driven mental wellness platform offering emotion analysis, CBT tools, and personalized support.

Chatbots AI Agents Healthcare
Digitar AI

Digitar AI Inc.

Real-time voice AI solutions, enhancing customer and employee interactions with intelligent speech-to-speech.

Speech Recognition Audio Generation Customer Service Agent
Deepgram

Deepgram Inc.

Deepgram provides advanced speech-to-text, text-to-speech, and language intelligence capabilities.

Speech Recognition Data Analysis Audio Generation
Ultravox AI

2024 Fixie

Ultravox.ai offers AI-powered voice solutions for transcription, audio generation, and more.

Speech Recognition Audio Generation Customer Service Agent
Choruz AI

Choruz AI

Choruz AI is the first Web3 music platform powered by AI, revolutionizing music creation and distribution.

Task Automation VFX & Animation Generative AI
Papercup Ai

Papercup Ltd.

An AI-powered dubbing platform that translates and localizes video content into multiple languages using voice

AI Agents Audio Generation Voice
Lovo Ai

Lovo Inc.

An AI-driven platform offering realistic text-to-speech voice generation and video editing tools.

Video Editing Audio Generation Text Generation
Murf Ai

Murf Ai

AI that converts text into lifelike voiceovers, offering over 120 voices in 20+ languages for diverse apps.

AI Agents Audio Generation Productivity
AI Voice Agents

AI Voice Agents Platform

Platform for creating, licensing, and monetizing personalized voice models.

AI Agents Audio Generation AI Agent Platform
Krisp

Krisp Technologies

AI-powered noise cancellation tool for clear audio during calls and recordings.

AI Agents Task Automation Audio Generation
Siri

Apple

Voice-activated assistant for Apple devices with a focus on privacy and integration.

AI Agents Task Automation Decision Support
ElevenLabs

ElevenLabs

Best text-to-speech models and conversational AI platform.

Speech Recognition AI Agents Voice
Rask AI

Rask AI

AI-powered translation and dubbing tool for videos and audio.

Video Generation Audio Generation Generative AI
Adauris

Adauris ai

AI-powered text-to-audio conversion for content creators and publications.

Audio Generation Generative AI Text Generation
Resemble AI

Resemble AI

AI-powered voice generation platform for creating personalized, synthetic voices.

AI Agents Audio Generation Generative AI
Uberduck

Uberduck.ai

AI platform for voice cloning and text-to-speech generation.

Image Generation Generative AI Voice
Speechify

Speechify Inc.

AI-powered Text-to-Speech and Voice Cloning platform.

Audio Generation Generative AI Productivity
VoiceSpin

VoiceSpin

AI-powered contact center solutions for efficient customer service and streamlined quality assurance.

AI Agents Task Automation Sales Agent
Speechly

Speechly AI 2024

Real-time speech recognition for building voice-enabled apps.

Speech Recognition Audio Generation Voice

Build robust AI voice and multimodal agents with LiveKit's framework, tailored for real-time interaction.

Speech Recognition AI Agents Task Automation
xgager.com

Tom van den Bogaart

Engage Better on X

Tools Library