Grok can now process and answer image queries with a new feature
Elon Musk, the founder of artificial intelligence (AI) company xAI, announced a new feature for Grok on Monday. The internal AI chatbot will now have the ability to understand images, allowing it to process and analyze the content of an image. Users can now upload an image and ask the AI questions based on it. Notably, xAI released the Grok-2 AI model in August. The company announced at the time that the AI model would soon support different modalities.
Grok AI gains the ability to understand images
In one after on X (formerly known as Twitter), Grok’s official handle announced the new image understanding capability for the AI chatbot. Image understanding, also called computer vision, allows an AI system to see and process visual data in an image or video. Currently this capability is only available for static images.
Musk too posted about the new feature, highlighting that the AI chatbot can perform a deeper analysis of the image and even explain the meaning of a visual joke. The billionaire shared an example and asked Grok to explain a joke in an image. The AI was able to explain the premise of the joke, the twist and the visual joke within it.
However, computer vision is not a new capability for AI systems, and almost every major AI model offers this feature, including Gemini, ChatGPT, Copilot, Claude, and more. An X user highlighted this and expressed concern that Grok is still missing many basic features.
In one remark In Musk’s post, the user said the AI chatbot still lacks file upload and image generation capabilities. The billionaire entrepreneur responded, “Not for long. We will be ready in a few months, what everyone else would have done in years.” These capabilities may be added to Grok in the near future.
In August, xAI released the Grok-2 and Grok-2 Mini AI models, as an upgrade to the pilot version of the large language model (LLM). Both models are available in the Grok chatbot for X Premium and X Premium+ users. The company claimed it outperformed both the Claude 2.5 Sonnet and GPT-4 Turbo AI models.