Real time multi modal api capabilities announced by OpenAI

02 Oct 2024

Recently, OpenAI has released a new set of APIs in beta release which enable to get responses in text and audio in real time using web sockets. This is an interesting development along with third party integration services to popular communication platforms like Twilio which have also been shared in the announcement towards the end.

Documentation guide to get you started can be found here with a sample project code base in github.

Third party integrations with the realtime api can be found for below three vendors:

Hope you build some interesting applications using it!

LEAVE A COMMENT
Comments are powered by Utterances. A free GitHub account is required. Comments are moderated. Be respectful. No swearing or inflammatory language. No spam.

I reserve the right to delete any inappropriate comments. All comments for all pages can be viewed and searched online here. To edit or delete your comment: click the "Comments" link at the top of the comments section below where it says how many comments have been left (this will take you to a GitHub page with all comments for this page) --> find your comment on this GitHub page and click the 3 dots in the top-right --> click "Edit" or "Delete". Editing or adding a comment from the GitHub page also gives you a nicer editor.

satej@home:~$

Archive

About

RSS

Real time multi modal api capabilities announced by OpenAI