OpenAI: MultiModal Chat AI action
- Updated: 2024/07/10
OpenAI: MultiModal Chat AI action
The OpenAI: MultiModal Chat AI action allows you
to integrate OpenAI
gpt-4o
and OpenAI's vision capabilities into your workflows. This means
your automations can now process and answer questions about images, going beyond just
text-based interactions.
Prerequisites
- You must have the Bot creator role to use the OpenAI MultiModal Chat AI action in a bot.
- Ensure that you have the necessary credentials to send a request and have included OpenAI: Authenticate action before calling any OpenAI actions.
This example shows how to send multiple images using the OpenAI MultiModal Chat AI actions and ask questions about what is present in the images.
Procedure
The response of the above automation is as follows: