AI Image Generation Showdown: Copilot, ChatGPT, and Gemini Face Off!

The world of AI-powered image generation is heating up! Microsoft Copilot has recently upped its game, now boasting enhanced image generation and even some editing features, reportedly powered by the same GPT-4o model as ChatGPT. But how does it stack up against established players like ChatGPT and the ever-evolving Gemini? I put them to the test, and the results were, let's say, enlightening – especially for Gemini.



Copilot Steps Up: From Photo to Anime

My first experiment was with Copilot. I uploaded a photo of myself (just about a year old, so fairly recent!) and prompted it: "turn this image into Japanese anime style." The result? Pretty impressive! Copilot managed to capture the essence and transform it effectively.

Interestingly, when I tried the same prompt with Gemini initially, it told me it was still learning or that the request was against its guidelines. Gemini often seems a bit more restricted than its counterparts. However, trying again later yielded... well, a result that left me speechless, and not in a good way.

ChatGPT, on the other hand, handled the anime transformation remarkably well. It picked up on details like my beard and mustache, and even when I asked it to remove my hand, it maintained the character's integrity.

I even tried the reverse with Copilot – turning an anime picture into a realistic one – and it did an outstanding job, which is quite impressive for a free tool!

Editing Test: Adding Objects to Images

Next, I wanted to see how these AI tools handled image editing, specifically adding an object. I uploaded the same photo and asked each platform to "put a basketball in my hand."

Copilot's Attempt: Image generation, especially editing, can be slow. After a few minutes, Copilot produced an image. It added the basketball, but it also seemed to regenerate my face, making me look like an older version of myself. It wasn't quite me anymore. On the plus side, when using Copilot on the web (as opposed to the app), it did provide landscape images as requested.

Gemini's Surprise: Gemini, which struggled with style transformation, was surprisingly adept here. It's generally faster than other image generators, and this is where it shone. Gemini successfully added the basketball to my hand without altering my face or the main subject. The original me was still there, just holding a basketball. This seems to be Gemini's strong suit: adding or removing objects without disturbing the rest of the image. I've tested this multiple times, and Gemini consistently does this better.

The Verdict: Which AI Reigns Supreme for Images?

Based on these tests, here's a quick rundown:

  • For Transforming Image Styles (e.g., photo to anime, realistic to cartoon): ChatGPT seems to be the most proficient, delivering high-quality and accurate transformations. Microsoft Copilot is a strong runner-up and a great free alternative, especially with its new GPT-4o integration.

  • For Editing Images (adding/removing objects, replacing backgrounds): Gemini is surprisingly the winner here. It excels at seamlessly integrating or removing elements without altering the main subject of the image.

  • Speed: Gemini is generally the fastest for generating images, especially for edits. ChatGPT can be slow, particularly the free version.

I have also made a video in this topic so watch the video down below to see them in action.

Check out my other posts, I post useful tutorials and tech tips, maybe you will find something useful 😉.