The Cartesia MCP server enables AI clients like Cursor, Claude Desktop, and OpenAI agents to interact with Cartesia's API for speech localization, text-to-audio conversion, voice infilling, and voice transformation capabilities.
List all available voices from the Cartesia API.
Convert text to audio using a specified voice from Cartesia.
Localize a voice to a different language using Cartesia's API.
Infill audio between two specified audio files using Cartesia.
Modify an audio file to use a different voice via Cartesia.
{
"mcpServers": {
"cartesia-mcp": {
"command": "<absolute-path-to-executable>",
"env": {
"CARTESIA_API_KEY": "<insert-your-api-key-here>",
"OUTPUT_DIRECTORY": "// directory to store generated files (optional)"
}
}
}
}