Share with your friends!

Is gpt-5 really worse than gpt-4o ars — The recent launch of OpenAI's GPT-5 model has prompted widespread user feedback, leading to the reinstatement of its predecessor, GPT-4o, as a viable option for those seeking a more satisfactory AI experience..

Is Gpt-5 Really Worse Than Gpt-4o Ars

The recent launch of OpenAI’s GPT-5 model has prompted widespread user feedback, leading to the reinstatement of its predecessor, GPT-4o, as a viable option for those seeking a more satisfactory AI experience.

Introduction to GPT-5 and User Reactions

OpenAI’s latest iteration of its language model, GPT-5, was released amid high expectations from both users and industry experts. However, the rollout has not been smooth. Many users have expressed dissatisfaction with the new model, citing issues such as a more sterile tone, diminished creativity, and an increase in inaccurate or misleading information—often referred to as confabulations. The backlash from the user community has been significant enough that OpenAI decided to reinstate the GPT-4o model, allowing users to choose between the two versions.

Comparative Testing of GPT-5 and GPT-4o

In an effort to objectively evaluate the differences between GPT-5 and GPT-4o, Ars Technica conducted a series of tests using both models. The testing involved a set of eight prompts designed to reflect contemporary user needs and expectations from language models. While these prompts do not constitute a comprehensive evaluation, they provide insight into the stylistic and substantive differences between the two models.

Methodology of the Tests

The test prompts included a variety of complex requests that are likely to be encountered by modern users. Some of the prompts were adapted from earlier tests to facilitate a comparison with other language models, such as Google Gemini and Deepseek. The choice of prompts aimed to challenge both models in areas where users typically seek assistance.

Key Findings from the Tests

The results of the testing revealed several noteworthy distinctions between GPT-5 and GPT-4o. Below are some of the major areas of difference observed:

Tone and Style: Users reported that GPT-5 often exhibited a more sterile and formal tone compared to GPT-4o, which was perceived as more engaging and conversational.
Creativity: Feedback indicated that GPT-5’s responses lacked the creative flair that users had come to expect from earlier versions, particularly GPT-4o.
Accuracy: GPT-5 was noted to have a higher incidence of confabulations, where the model generated plausible but false information, raising concerns about its reliability.
Response Relevance: While both models aimed to provide relevant answers, users felt that GPT-4o was more adept at understanding nuanced queries and delivering contextually appropriate responses.

Specific Prompt Comparisons

To illustrate the differences further, Ars Technica provided examples of specific prompts and the respective responses from both models. Below are some summarized comparisons:

Prompt 1: Creative Writing

When tasked with generating a short story based on a given theme, GPT-4o produced a narrative that was rich in detail and character development. In contrast, GPT-5’s story was described as more formulaic, lacking the depth and emotional resonance that users found appealing in the earlier model.

Prompt 2: Technical Explanation

For a prompt requiring a technical explanation of a complex concept, both models performed adequately. However, GPT-4o’s response was noted for its clarity and accessibility, while GPT-5’s explanation was criticized for being overly complex and less user-friendly.

Prompt 3: Conversational Engagement

In a simulated conversation, GPT-4o demonstrated a more engaging and interactive style, responding to follow-up questions with a sense of continuity. GPT-5, on the other hand, was perceived as more rigid, with responses that felt disjointed and less connected to the flow of conversation.

Community Response and OpenAI’s Actions

The user feedback regarding GPT-5 has prompted OpenAI to take immediate action. The decision to reintroduce GPT-4o as an option for users indicates a recognition of the community’s concerns. This move not only provides users with a choice but also reflects a commitment from OpenAI to prioritize user experience and satisfaction.

Implications for Future Developments

The mixed reception of GPT-5 raises important questions about the direction of AI language model development. As OpenAI navigates these challenges, it must consider user feedback as a critical component of its ongoing improvement efforts. The contrasting performance of GPT-5 and GPT-4o may lead to a reevaluation of what users expect from AI-driven communication tools.

Conclusion

The rollout of OpenAI’s GPT-5 has sparked significant discussion within the tech community, highlighting the complexities involved in advancing AI language models. While GPT-5 introduces new features and capabilities, the user backlash underscores the importance of maintaining a balance between innovation and user satisfaction. As OpenAI continues to refine its models, the feedback from users will likely play a pivotal role in shaping future iterations.

Source: Original reporting

Further reading: related insights.