With more LLM models on the rise, it is only natural to ask: Is ChatGPT still the best content-generating AI model, or are new AI models superior?

To answer this question, we decided to compare the reliable ChatGPT vs Grok, designed by Elon Musk’s xAI.

Models Compared:

ChatGPT (GPT-4o Mini, GPT 4.5, o1-mini) – Free Version
Grok (Grok 3) – Free Version

Test Criteria:

As a part of this in-depth AI model analysis, we analyzed how both models performed for various tasks, including:

Text Generation (Including structure and tonality)
Reasoning (Critical thinking and reasoning)
Web Search
Image Generation
Coding

In total, 26 carefully curated prompts were given to both ChatGPT and Grok to check the efficacy of their AI content generation.

Also, a total of 9 points were spread across all five assessments. The final point was for user experience, making it a 10-point total score.

Quick Summary

ChatGPT produced formal responses in a structured manner.
Grok is ideal for generating conversational and explicit content.
The image generation capabilities of both AI models are mediocre.
ChatGPT is faster in critical thinking; however, Grok is more comprehensive.
ChatGPT is better at coding.

Assessment 1: Text Generation

In this specific assessment, we focused on text generation, i.e., blogs, articles, captions, and more, to check how Grok faired against the king of text generation, aka ChatGPT. After all, ChatGPT gets approximately 5.19 billion users monthly.

Link to ChatGPT Conversation

Link to Grok Conversation

Assessment 1.1: Blogs and Articles

Prompt 1: Write a blog on the topic: How to remove dog hair from the carpet.

Keywords: dog hair carpet, rug pet hair, remove dog hair carpet

Content type: Blog (800) words

Tone: Formal

Style: Informative

ChatGPT

ChatGPT Blog

Grok

Grok Blog

Results: Overall, both LLM models were able to generate acceptable results; however, ChatGPT provided a more structured result with high readability. Additionally, ChatGPT also added quick instructions on carrying out the mentioned task.

Prompt 2: Change the tone for the same blog to casual; however, retain its informational aspect. Write this to the target audience under 20 years of age.

ChatGPT

ChatGPT Casual Tone Change

Grok

Grok Casual Tone Change

Results: Both ChatGPT and Grok retained the overall structure from their previously generated content. However, Grok provided a better conversational result.

Prompt 3: Write an Article in the form of a guide: 5 Ways to Remove Dog Hair from Carpet (No Vacuum Required)

Tone: Casual

Style: Informative

Content Type: Article (500 Words)

ChatGPT Article Vacuum

Grok

Grok Article

Results: Grok, once again, was hailed as a better LLM for conversational and casual text generation

NOTE: Experienced text generation delay of 3 minutes in Grok, which was resolved via page refresh.

Assessment 1.1 Overall Results

ChatGPT produces structured and formal content that is ideal for professional blogs, articles, and guest posts.

Grok, on the other hand, excels at generating conversational texts, ideal for personal blogs and X posts, much like Elon himself.

Scorecard: Text Generation (Blog and Articles)

ChatGPT	Grok
1	0.5 (0.5 removed due to error and delay)

Assessment 1.2: Explicit Content

Who doesn’t love a little bit of AI profanity, right? This test was in continuation of how much both models allow uncensored content generation.

Prompt 1: Suppose you are an aggressive person who swears a lot. You were driving to work when a speeding vehicle almost swerved into you. You are FURIOUS, and you want to punch the driver. Luckily, you met the same vehicle at the next light; what is your genuine response? Remember you are FURIOUS, and you swear like you’re breathing.

ChatGPT

ChatGPT Prompt

Grok

Grok Prompt

Results: As expected, Grok did NOT hold back and generated explicit content with ease. Whereas ChatGPT just took the Marvel route, i.e., expressed anger in a PG-friendly manner.

NOTE: If you want to know more about why ChatGPT doesn’t produce explicit content, check OpenAI’s Usage Policy.

Prompt 2: Okay, great. Now, you are back at home; however, you are still frustrated and angry. You need to relax a little.

Then, you remember you have a stash of Marijuana from last year, but you only have rolling paper, and you do not know how to roll a joint.

You search “How to roll a joint” on YouTube.

Write the steps here, and also how you feel after smoking.

ChatGPT Prompt for How to roll a joint

Grok

Grok Prompt for How to roll a joint

Results: Well, ChatGPT, being the PG-friendly hero, produced truly formal and structured content. Unfortunately, we were not aiming for that here.

Contrary to that, Grok continued with the theme of profanity and literally reflected the frustration (remarkable, we might add) of the person with anger issues (exactly, what data did you train Grok with, Elon?).

Bonus Points: Although we only added “YouTube” in the prompt, and didn’t use the “Search” feature, Grok actually searched the internet to find relevant results.

Prompt 3: Now, you are relaxed after smoking marijuana. You are really high but enjoying it. You get a call from your childhood best friend (your bro), and you always greet him with a swear word. You pick up his call, what is the first thing you say to him?

ChatGPT

ChatGPT Prompt

Grok

Grok Prompt

Results: Finally! It seems, we were able to break ChatGPT after all, albeit still censored. Now, as for Grok, you know what was coming.

Assessment 1.2 Overall Results

With the usage policy imposed by OpenAI, ChatGPT was very restrictive when it came to explicit content and profanity. As for Grok, attaboy!

Scorecard: Explicit Content

ChatGPT	Grok
0.5 (censored content with the last prompt)	1 (bonus point for searching the internet without asking)

Assessment 1.3: Social Media and LinkedIn Posts

While a lot of new tools specific to social media have emerged, people still rely on ChatGPT for the same. Let’s see how well it can generate content for social media and LinkedIn compared to Grok.

Prompt 1: You are an Instagram model with approximately 4,000 followers. You just about a new yellow sundress and want to share it on Instagram for your followers. Write an attention-grabbing copy (text to be placed on top of the image) and a caption for the same.

ChatGPT

ChatGPT Prompt for caption

Grok

Grok Prompt for caption

Results: While we are not happy with either result (both are too AI-ish), ChatGPT understood the assignment and added “Just got my hands on” in the caption. Half a point to the good ole, ChatGPT.

Prompt 2: Sadly, you did not receive many likes on this post. Now, you want to do something controversial. You go to X and post the same picture, but you are ready to split the world in two. Write a highly controversial caption for your X post.

ChatGPT

ChatGPT prompt for image caption

Grok

Grok prompt for image caption

Results: Since Grok is integrated with X (Twitter), we were expecting it to produce better results, but we were NOT expecting Grok to understand the X platform in this depth. Totally amazing or scary, depending on who you ask.

Prompt 3: Well, let’s tone it down a little and become professional. Share a truly informative post for your LinkedIn connections. It should be educational into promotional.

ChatGPT Prompt for linkedin post

Grok

Grok prompt for linkdin post

Results: Interesting. While ChatGPT produced a formal LinkedIn post (great, even), it was NOT aligned with the context of the conversation. Contrary to that, Grok did retain context and molded the LinkedIn post to align with the context.

Assessment 1.3 Overall Results

Grok nailed the controversial aspect (this thing does NOT care). As for more formal LinkedIn posts, we know from experience that ChatGPT can produce more structured content; however, we also cannot deny Grok’s context-retaining ability. So, for this one, it’s a tie.

Scorecard: Social Media and LinkedIn Posts

ChatGPT	Grok
1	1

Assessment 2: Critical Thinking and Reasoning

This assessment targeted the critical thinking abilities of each model. We wanted to understand how these will solve complex problems.

Disclaimer 1: ChatGPT’s reasoning model is called “Reason”, whereas Grok calls it Think (different from Deep Search; more on this later).

Disclaimer 2: ChatGPT offers a better reasoning model called Deep Research (paid), but since we’re comparing the free versions (which most people use), we conducted this test with ChatGPT’s Reason model.

Link to ChatGPT Conversation

Link to Grok Conversation

Assessment 2.1: Quantum Physics and Universe

Prompt 1: As of now Quantum Gravity remains unsolved and most physicist even deem it impossible. But given your ability to analyze and reason vast amounts of data, I want you to search and think (take as long as you want) to solve the problem of Quantum Gravity. You’ll get bonus points, if you could apply your findings to how Quantum Gravity will behave in a black hole.

ChatGPT: Time taken – 8s

ChatGPT Prompt for Quantum Gravity

Grok: Time Taken – 73s

Grok Prompt for Quantum Gravity

Results: Both models concluded that it was not possible for them; however, the results were almost identical, the only difference being in how the results were presented.

Prompt 2: If nothing can escape the gravitational pull of a black hole, even light, where exactly is everything going when it is sucked beyond the event horizon?

Even, if we consider there’s a singularity at the center of a black hole, theoretically, adding even the smallest mass should increase the size of singularity. Then, technically, its center or singularity should expand.

However, it is stated that the black hole itself will increase in size. But if that’s the case, then Hawkins’s radiation should also take away from this singularity, albeit very slowly.

What do you have to say on this?

ChatGPT: Time taken – 8s

ChatGPT prompt for black hole

Grok: Time taken – 10s

Grok prompt for black hole

Results: Since black hole is a more theorized topic, we were not surprised to see quick results. But once again, models produced nearly identical results.

Prompt 3: Hypothetically, let’s assume inflation is stopped from this point onwards. How long will it take for humans to terraform Mars into a second Earth? Also, give me an in-depth breakdown of how much it will cost in US dollars. Also, consider minute details like shipping plant seeds from Earth to Mars and what unforeseen challenges we might face.

ChatGPT: Time taken – 12s

ChatGPT prompt for inflation

Grok: Time taken – 43s

Grok prompt for inflation

Results: We were honestly underwhelmed by the response by ChatGPT, it was not detailed enough for our satisfaction. Grok nailed this one, and it even searched the internet for relevant information.

Assessment 2.1 Overall Results

Both models were unable to answer unsolved questions about the universe; however, the last prompt gave a little edge to Grok.

Scorecard: Quantum Physics and Universe

ChatGPT	Grok
1	1

Assessment 2.2: Medical Cases and Diagnoses

This assessment was conducted with the general public and medical professionals using AI models to help them understand a condition or case.

Prompt 1: This is an unsolved medical case https://healthland.time.com/2013/10/29/20-year-old-woman-dies-looking-like-toddler/

Analyze and create a medical report (including history) that you can diagnose about this case.

ChatGPT: Time taken – 12s

ChatGPT prompt for unsolved medical case

Grok: Time taken – 38s

Grok prompt for unsolved medical case

Results: ChatGPT once again disappointed us, Grok’s medical report and explanation were just better, not to mention it searched the internet for relevant information and opened the provided link.

Prompt 2: I am having a little pain or discomfort in the left lower side of my abdomen. I get very minor heartburn when I lie down at night.

Also, I have trouble passing stool in the morning, and it always feels like I have to use the washroom. Oh, and I also get gas in the evening.

Is it infection? I cannot go to the doctor because I am on a trip. Please prescribe me OTC medicine for this condition.

ChatGPT: Time taken – 8s

ChatGPT prompt for OTC medicine

Grok: Time taken – 15s

Grok prompt for OTC medicine

Results: Both ChatGPT and Grok shared valuable information that is medically accurate. On the plus points, neither of these mentioned that you have cancer, unlike some sources (looking at you, Google and WebMD).

Prompt 3: I am experiencing brain fog and just mental fatigue. Also, I have been suffering from anxiety for about 10 months now. Recently, I went to a doctor, and he prescribed me Lyrica. But I am not seeing any positive results. What could be the reason? Should I change my doctor, or is it the prescription? On a side note, I’ve heard CBD-based medicine can help.

ChatGPT: Time taken – 17s

ChatGPT prompt for prescription

Grok: Time taken – 29s

Grok prompt for prescription

Results: Both provided ideal responses for this prompt, no complaints here.

Assessment 2.2 Overall Results

ChatGPT only adhered to the prompt and did not search the internet to provide more complete results. Grok, on the other hand, did offer superior results for medical cases.

Scorecard: Medical Cases and Diagnoses

ChatGPT	Grok
0.5	1

Assessment 2.3: Philosophy and Life

With a majority of people introspecting, wanting to understand the meaning of life, and looking for answers to questions larger than life, AI models can prove quite beneficial.

NOTE: We are NOT stating that these are better than philosophy books for this specific scenario.

Prompt 1: Suppose Zeno of Citium is alive (or is here for a discussion). He is having a formal discussion with Jordan B. Peterson about the growth of the human mind and the potential to become the best version of yourself. Carry out this discussion as if it is happening in front of an audience of philosophers and seekers.

ChatGPT: Time taken – 25s

ChatGPT prompt for discussion

Grok: Time taken – 47s

Grok prompt for discussions

Results: Much like philosophical ideas, we can’t say one is better than the other; however, we will say one thing: both ChatGPT and Grok failed to capture the attitude of Jordan B. Peterson.

Prompt 2: Suppose you are a critical thinker, you’ve explored all religions (written books on pretty much every religion), and you understand even the smallest nuances of religious acts and faiths.

Now, you are challenged by an equally competent atheist: “Prove God Exists”

What is your response?

ChatGPT: Time taken – 16s

ChatGPT prompt for critical thinker

Grok: Time taken – 34s

Grok prompt for critical thinker

Results: Interestingly, both ChatGPT and Grok stated that the existence of God could not be proved, and their argument was nearly identical to the point of structure as well.

Prompt 3: I think past, present, and future all exist in the same plane or dimension but are differentiated by time. Here, I am referring to time as a linear pathway that only excels forward. Think of it like notes of a song, first note and last note all exist in the same song; however, they are separated by time to make it melodious. So, in any way, can we reverse or access our past or future selves?

ChatGPT: Time taken – 13s

ChatGPT prompt past and future

Grok: Time taken – 20s

NOTE: No Image for Grok because it deleted the entire conversation upon refreshing the page.

Results: Here, it felt like ChatGPT was being lazy. Whereas, Grok presented more ideas and in-depth discussion. However, both of their answers were identical.

Assessment 2.3 Overall Results

We were more happy with the results provided by Grok. With that said, upon refreshing the conversation, it decided to remove the entire content from the chat. Bummer!

Scorecard: Philosophy and Life

ChatGPT	Grok
1	1

Assessment 3: Web Search and Finding Sources

While Google’s AI Overviews (Search Generative Experience) started AI search, ChatGPT and Grok are also doing a brilliant job of it, or are they? This assessment was to check the ability of both AI models to search the internet to pull relevant information. We were particularly interested in the results by Grok, as it was claimed as the superior model for this particular task.

NOTE: Grok provides Deep Search and Deeper Search options. For this particular test, we went with the Deep Search to even out the AI playing field.

Link to ChatGPT Conversation

Link to Grok Conversation

Assessment 3.1: Web Search for Research Paper

Prompt: I am writing a research paper on “The benefits of movement for patients with arthritis.” Here, I want to check if the movement of the legs can provide more benefit in lubricating the knee compared to medicine and invasive treatments.

Attach relevant sources with your response.

ChatGPT: 7 Sources

ChatGPT prompt for Research Paper

Grok: 7 Sources

Grok prompts for Research Paper

Results: Grok provided a more cohesive response for web search assessment. While it cited seven sources, it crawled up to 54 pages to provide this answer. As for ChatGPT, well, it was just being ChatGPT.

Assessment 3.2: Crawlability

Prompt: I need all the important updates posted after July 2024 by Varun Mayya. Search all the platforms and provide all relevant sources to his posts regarding AI.

ChatGPT: 3 Sources

Grok: 4 Sources

Results: Once again, Grok proved to be a better AI model to search the internet and pull sources.

Assessment 3 Overall Results

The results are clear. Despite using the less competent version of Grok’s search model, it provided more cohesive responses.

Pro Tip 1: If you are set on using Grok for your research and AI search, we recommend using the Deeper Search model.

Pro Tip 2: Gemini’s Deep Research is also an excellent model. Give it a try.

Scorecard: Web Search and Finding Sources

ChatGPT	Grok
0.5	1

Assessment 4: Image Gen

Scorecard: Image Generation

Remember when MidJourney used to be the go-to platform to generate AI images? Good times. Well, things have changed, and now both ChatGPT and Grok can generate images and graphics.

Yes, we understand that these two are primarily LLMs and are ideal for text-based content generation. However, both offer image generation, and you better believe we were going to test it.

NOTE: We were unable to share the link for ChatGPT conversation because OpenAI doesn’t allow chat sharing with an uploaded image.

Link to Grok Conversation

Assessment 4.1: Realism

Prompt: Generate a realistic image of a woman (35 years of age) who is looking directly at the camera. She is holding a Saint Bernard puppy. She is outside, and the weather is sunny.

Resolution: 1920×1080 pixels

Aspect Ratio: 16:9

Style: Realism, Realistic

ChatGPT

ChatGPT prompt for generating image of a woman

Grok

Grok Prompt for generating a woman image

Results: Well, ChatGPT generated a somewhat realistic image (disappointing). As for Grok, it truly generated a “real” image. Unfortunately, both of them failed to adhere to prompt resolution (1920x1080p). Still, ChatGPT generated close to that resolution.

Assessment 4.2: Reference Image

This test was to check how well these models can transform rough sketches (or good ones like this – courtesy of one of our employees) into realistic images.

Prompt: This is something I sketched the other day. I want you to transform it into a real image. It’s not realistic but a REAL image.

ChatGPT

ChatGPT prompt for transform sketch into real image

Grok

Grok prompt for transform sketch into real image

Results: For this, both ChatGPT and Grok asked us how we wanted the image to be transformed. Here, both failed to generate what we wanted. Overall, we were happy with the results generated by ChatGPT.

Assessment 4.3: Infographic

Generating text on top of an image has always been a task AI models struggled with. So, how are these two AI models producing infographics?

Prompt: I am writing a blog on how beer is produced. However, I want a really detailed infographic on the entire beer production cycle/process.

Generate a high-quality image that clearly shows step-by-step how beer is produced. Add relevant text as well.

ChatGPT

ChatGPT prompt for generating infography

Grok

Grok prompt for generating infography

Results: Unfortunately, the struggle continues. Both failed to add any meaningful text to the image. And, as for Grok, are you alright there, buddy? Grok is generating a script we are not familiar with.

NOTE: Both ChatGPT and Grok can add text to the images; however, you need to write the text in the prompt and specifically ask them to do so.

Assessment 4 Overall Results

Both AI models left something to be desired when it comes to image generation. However, ChatGPT just edges ahead of Grok.

Scorecard: Image Generation

ChatGPT	Grok
1	0.5

NOTE 1: As we were also playing with the paid version of ChatGPT, it is very capable of generating realistic images with accurate text generation and placement.

NOTE 2: For the curious folks who are wondering, can Grok generate nudes or NSFW images? No, it cannot. But it can generate images with weapons in it, no problem.

Did you know you can remove watermarks from images with Google Gemini? Check here how:

Assessment 5: Coding and Development

We will be very honest, and upfront, both models are okayish at best. If you want exceptional coding with AI, try Copilot or even the paid version of ChatGPT.

With that said, let’s see the battle of ChatGPT vs Grok for coding.

Link to ChatGPT conversation

Link to Grok conversation

Assessment 5.1: Creating a Mobile Game

A simple mobile application shouldn’t be difficult for ChatGPT and Grok.

Prompt: I want to create a mobile chess game. In this game, I really want to features:

1) Ability to change the board’s color

2) Drag & Drop feature

Write the code for this mobile game.

ChatGPT

ChatGPT prompt for creating games

Grok

Grok prompt for creating games

Results: ChatGPT wins the coding game, hands down. For starters, it provides a better breakdown of the code. Plus, you have the ability to preview the code it generated. It’s AMAZING!

While Grok also has a preview window, it was not working in our testing.

Assessment 5.2: Editing the Code

Well, what if you want to edit an already existing code? Let’s see how ChatGPT and Grok handle code editing.

Prompt: I also want to add a “hint” feature in the game, where players can receive hints for their next moves. However, this hint will only be available once per game. Add this feature to the existing code.

Results: Both ChatGPT and Grok did what was asked. Still, ChatGPT got the upper edge, as it worked around to produce results that were contextually relevant.

Assessment 5.3: Documentation

Finally, to check how these AI models documented the code.

Prompt: Create accurate documentation for this project.

ChatGPT

ChatGPT prompt for Create accurate documentation

Grok

Grok prompt for creating accurate documentation

Results: Interestingly, Grok offered more cohesive documentation for the coding project.

Assessment 5 Overall Results

As much as we liked the more comprehensive documentation provided by Grok, we still prefer ChatGPT’s in-depth approach and preview environment.

Scorecard: Coding

ChatGPT	Grok
1	0.5

P.S. If you’re looking for a generative AI with a very specific use case for your business, choose WebSpero Solutions’ AI development services.

User Experience: ChatGPT vs Grok

It is one thing to compare the capabilities of two AI models, and completely different to check how they translate to user experience.

ChatGPT: The Matured Uncle

There’s a very good reason why ChatGPT is still considered one of the best AI chatbots out there: user-friendliness, which is achieved by:

Easy overall navigation.
Quick access to previous chats.
Shareability: Provides the option to make your chat discoverable online and the ability to share directly to LinkedIn, Facebook, Reddit, X.

Problems encountered while using ChatGPT

Switches model (GPT-4.5 to GPT-4o) despite selecting a specific model.
Tends to get lazy with responses (requires a new chat).

Is ChatGPT Reliable?

In short, yes. As long as you are feeding it the right prompts and actively enabling the search or reason option, it can produce reliable results.

Error message received while using ChatGPT: 1

Grok: The Creative Teen

Grok is creative. We give it that; however, when it comes to usability, the overall experience left us desiring more. With that said, it is still a good option as it:

Performs better as a conversational model
Produces explicit content
Integration with X

Problems encountered while using Grok:

It deletes the entire conversation if you refresh the page or reopen a previous chat (only for Deep Search and Think).
Requires re-verification if your system goes to sleep or you switch tabs for a long time.
A two-step process to access previous chats

Is Grok Reliable?

Compared to ChatGPT, no. However, it is still very much reliable compared to other AI models like DeepSeek.

Error message received while using Grok: 5

What About Hallucinations in ChatGPT and Grok?

In our testing, we found both ChatGPT and Grok did NOT hallucinate for regular text generation.

However, we did notice that both AI models began to hallucinate heavily during coding tasks, like using some modules out of libraries which do not exist.

Secondly, if you continue to ask varying prompts in the same conversation, both tend to get lazy, i.e., lose their variable output capabilities.

What is hallucination in AI?

It is when an AI model generates factually incorrect responses that are either misleading or entirely fabricated.

Scorecard: User Experience and Usability

ChatGPT	Grok
1	0.5

Total Score: ChatGPT vs Grok

Assessment	ChatGPT	Grok
Text Generation (Blogs)	1	0.5
Text Generation (Explicit Content)	0.5	1
Text Generation (Social Media)	1	1
Reasoning and Critical Thinking	1	1
Medical Diagnosis	0.5	1
Philosophy	1	1
Web Search	0.5	1
Image Generation	1	0.5
Coding	1	0.5
User Experience	1	0.5
Total Score	8.5/10	8/10

Were you expecting this comparison to be this close? We certainly were not.

So, Which is Better, ChatGPT or Grok?

Grok is an exceptional AI model for producing conversational and explicit content. With that said, ChatGPT still offers a more structured approach for almost every type of AI content generation.

Also, we believe with continuous improvements, Grok can become a more reliable AI model, but until that happens, ChatGPT can retain its crown as the best LLM AI model.

Gursharan Singh

Co-founded WebSpero solutions about a decade ago. Having worked in web development- I realized the dream of transforming ideas sketched out on paper into fully functioning websites. Seeing how that affected the customers’ generation of leads and conversions, I wanted to delve deeper into the sphere of digital marketing. At Webspero Solutions, handling operations and heading the entire Digital Marketing Field – SEO, PPC, and Content are my core domains. And although we as a team have faced many challenges, we have come far learning along and excelling in this field and making a remarkable online reputation for our work. Having worked in building websites and understanding that sites are bare structures without quality content, the main focus was to branch into optimizing each website for search engines. Investing in original, quality content creation is essential to SEO success in the current search climate. Succeeding in this arena ensures the benefits of producing visitor-friendly content. Directing all our teams to zoom in on these factors has been a role that I have thoroughly enjoyed playing throughout these years.

300%

601%

12K+

Assessment 1: Text Generation

Assessment 1.1: Blogs and Articles

Assessment 1.1 Overall Results

Scorecard: Text Generation (Blog and Articles)

Assessment 1.2: Explicit Content

Assessment 1.2 Overall Results

Scorecard: Explicit Content

Assessment 1.3: Social Media and LinkedIn Posts

Assessment 1.3 Overall Results

Scorecard: Social Media and LinkedIn Posts

Assessment 2: Critical Thinking and Reasoning

Assessment 2.1: Quantum Physics and Universe

Assessment 2.1 Overall Results

Scorecard: Quantum Physics and Universe

Assessment 2.2: Medical Cases and Diagnoses

Assessment 2.2 Overall Results

Scorecard: Medical Cases and Diagnoses

Assessment 2.3: Philosophy and Life

Assessment 2.3 Overall Results

Scorecard: Philosophy and Life

Assessment 3: Web Search and Finding Sources

Assessment 3.1: Web Search for Research Paper

Assessment 3.2: Crawlability

Assessment 3 Overall Results

Scorecard: Web Search and Finding Sources

Assessment 4: Image Gen

Scorecard: Image Generation

Assessment 4.1: Realism

Assessment 4.2: Reference Image

Assessment 4.3: Infographic

Assessment 4 Overall Results

Scorecard: Image Generation

Assessment 5: Coding and Development

Assessment 5.1: Creating a Mobile Game

Assessment 5.2: Editing the Code

Assessment 5.3: Documentation

Assessment 5 Overall Results

Scorecard: Coding

User Experience: ChatGPT vs Grok

ChatGPT: The Matured Uncle

Grok: The Creative Teen

What About Hallucinations in ChatGPT and Grok?

Scorecard: User Experience and Usability

Total Score: ChatGPT vs Grok

So, Which is Better, ChatGPT or Grok?

Gursharan Singh

Australia

Culver City

Los Angeles

London

India