This workflow is ideal for businesses or developers wanting to integrate voice-based chat applications with dynamic responses and conversational memory.
It enables AI-powered voice conversations that maintain context between sessions, automatically converting speech-to-text and text-to-speech for a seamless experience.
The workflow receives audio input, transcribes it using OpenAI, processes the conversation with the Google Gemini Chat Model (or alternatively, OpenAI Chat Model), and converts responses back to speech via ElevenLabs.
You'll need API keys for:
Google AI Studio
)"Path"
parameter to (voice_message)
, which will act as the parameter name for the voice message in the HTTP Post request.Your connected stack awaits to automate AI workflows with 24-7 uptime performance and engagement
"I highly recommend Rantir, they are a great dev team with quick turn around on all projects and requests. We recently worked with them on updating our website and any changes, updates or modifications I needed were always taken care of quickly!"
"The team at Rantir has lived up to every definition of the word "partner". They're adaptive, fast, and flexible (all the things you'd hope for). We're so thrilled with what we've accomplished so far and look forward to working alongside them in the future."
"Working with the Rantir team was a pleasure. They guided us through the whole process from design to implementation, creating a great site on a tight deadline. They were responsive and adaptable throughout, and we'd be happy to work with them again in the future."
"Working with the Rantir team early on made combined design and development with early conversations to implement AI within Onder. We were happy to work together to help bring no-code, with code and AI."
Rantir University for learning how to build powerful AI Agents & Software you own.