Google Photos’ new AI feature lets users edit pictures with their voice

Conversational AI is now turning heads, as it brings ease and comfort to common users and pros. Even those who are annoyed by typing and chatting can now enjoy the features fully, simply by using a voice note in plain language. According to a Google spokesperson via TechJuice, “Instead of navigating complex menus, you can now describe the edits you want in plain language.” 

Now they don’t need those manual tools and sliders. The feature, powered by Gemini AI, is launching for U.S. Android users. How will it actually help the user? Google’s blog stated it as follows: ‘Just tap ‘Help me edit’ in the editor, describe what you want changed, and voila, Photos takes care of it for you, with a little help from advanced Gemini capabilities.’

Feature Introduction

Feature Introduction

In Google Photo Editor, users typically type phrases like “help me edit” or “edit my photo,” but now, the edit can also be requested through voice searches. Google Blog announcement was “We’re expanding the ability to edit your images by simply asking in Google Photos.” 

The platform offers instant edits with minimal effort, allowing for simple text or voice notes, such as “brighten the image”, “remove its background”, “enhance the resolution”, “set dimensions to….”, and more. 

As soon as the user types, the Gemini AI interprets the request and applies the changes instantly. It offers both types of editing, ranging from basic fixes (e.g., changing background color, incorporating one more image, and text correction) to advanced levels (e.g., adding sunglasses, changing effects to a day or night effect).

Technology Behind It

The software is built on the Google Gemini AI model. It is designed to understand natural language and make necessary changes according to the instructions. 

Its surface image processing is quite simple, although the image creation process behind it involves multiple steps. Users can complete multiple editing tasks in a single prompt, like he can say “brighten her face, make the hair curly, add some makeup, and show her sitting in front of a laptop”.

As stated by iGeeksBlog coverage of Google’s launch, “Gemini AI lets you describe multiple edits at once, from lighting tweaks to background swaps.” 

It uses CPA credentials for transparency. C2PA helps software tag AI-edited images with metadata, boosting transparency in the era of deepfake viral content. According to the TechCrunch report on C2PA transparency, “Edited photos include metadata showing when and how Gemini AI made changes.” 

User Experience

  • The feature is limited to U.S 18-plus-aged citizens right now. 
  • Right now, it supports only the English language. 
  • The app doesn’t require a complete restart to refine the images. They can simply send follow-up prompts and get instant results within seconds.
  • The most important point is that the original photos remain accessible throughout the editing process. The user can refine them if they feel the results are not satisfactory.

It Made Photo Enhancement Accessible To All Skill Levels

The feature has already been launched. Initially, it was only compatible with the Pixel 10. However, now every eligible Android device can be connected to it. To date, there is no confirmed timeline for the launch in other countries worldwide. However, according to Google Vision, it appears that this will happen soon. 

Similar Posts

Leave a Reply

Your email address will not be published. Required fields are marked *