Dec. 13 (Portaltic/EP) –
OpenAI has announced a new feature for ChatGPT’s Advanced Voice Mode, which will integrate real-time video processing, allowing its chatbot to offer more concrete responses to user requests based on their context and what can identify through the device’s cameras.
The company announced a week ago that it would inaugurate a series of 12 days of news, in which it has already announced the launch of the o1 Artificial Intelligence (AI) model, a new level of ChatGPT Pro and the general availability of the Sora video tool.
In these sessions, he has also made reference to Advanced voice modea functionality that it announced with the presentation of its GPT-4o model and that allows you to choose between a series of voices to customize the interaction with the ‘chatbot’.
Although this feature was going to be tested with a group of users in July, OpenAI announced that it was delaying its launch to continue testing its reliability. Thus, it was in September when it finally began to be implemented for users subscribed to the Plus and Teams version, although those from the European Union, Switzerland, Iceland, Liechtenstein and Norway were excluded.
The company has now announced the addition of video input in Advanced Voice Mode, which will allow the multimodal model to process images in real time, as well as access applications being used on the device using the option ‘Share Screen’ or ‘Share screen’.
As a result of this integration, “Advanced Voice Mode conversations will have a much more natural rhythm” and you will be able to delve into aspects such as the rhythm or tone of the voice in more than 50 languages, as explained by those responsible for the firm in a video.
Thanks to this functionality, which can be used with either the front or rear camera, ChatGPT will be able to tell a person what steps they must take to prepare a coffee with the elements that it identifies and that are located in front of the lens.
Likewise, with ‘Share Screen’, the user can ask the ‘chatbot’ for help to execute actions. For example, to respond to a message from the Messages application on your smartphone, so that it will give you the relevant instructions to respond with the chosen tone.
OpenAI has confirmed that it will bring this feature to Europe “as soon as it can” and that it will offer early access to subscribers of the Enterprise and Edu plans before 2025. Likewise, it has announced that it has customized ChatGPT’s Advanced Voice Mode with a Santa Claus mode for the Christmas holidays.
Santa Claus mode can be activated throughout the month of December by clicking on the snowflake icon, which appears next to the message bar, or through Voice Settings. This feature works in applications mobile, iOS, Android and the web version of ChatGPT.
Add Comment