ChatGPT's new vision feature interpreting images


By MYBRANDBOOK


ChatGPT's new vision feature interpreting images

The vision feature of OpenAI's ChatGPT now enables users to communicate with the chatbot through the use of photographs. This capability has been used by users for a variety of tasks, including creating code from screenshots and understanding diagrams. The new features will offer a more user-friendly interface, bridging the gap between verbal and visual comprehension. 

 

OpenAI is using the multimodal abilities of GPT-3.5 and GPT-4 in order to power the Image understanding of ChatGPT. ChatGPT vision feature is currently only available to the company's Plus and Enterprise users and OpenAI has promised to make it available for developers in the coming weeks. 

 

According to OpenAI, the new voice and image capabilities in ChatGPT offer “a new, more intuitive type of interface by allowing you to have a voice conversation or show ChatGPT what you're talking about”. Several users shared on X (formerly Twitter) how they leveraged ChatGPT's vision feature. 

 

A user leveraged ChatGPT's vision capabilities to help understand the diagram of a human cell, pointing out a potential use case for the chatbot in the education sector. Another user asked ChatGPT to help understand the meaning of an image and the chatbot complied by giving a point-by-point explanation on the topic. 

 E-Magazine 
 VIDEOS  Placeholder image

Copyright www.mybrandbook.co.in @1999-2024 - All rights reserved.
Reproduction in whole or in part in any form or medium without express written permission of Kalinga Digital Media Pvt. Ltd. is prohibited.
Other Initiatives : www.varindia.com | www.spoindia.org