Skip to main content

Digital Trends may earn a commission when you buy through links on our site. Why trust us?

GPT-4 vs. GPT-3.5: how much difference is there?

Have you heard of GPT-4? It’s the latest language model for OpenAI’s natural language chatbot, and it’s said to be much better than GPT 3.5, which is available on ChatGPT. But how much better is it and what makes it different?

That’s what we’re here to find out, with a direct head-to-head comparison of these two exciting, but distinctly different language models. Ultimately, we can decide whether it’s worth paying the extra for a ChatGPT Plus subscription, or if it’s better to just use GPT-4 for free.

What are GPT 3.5 and GPT-4?

ChatGPT New chat screen.
screenshot

Both GPT-3.5 and GPT-4 are natural language models used by OpenAI’s ChatGPT and other artificial intelligence chatbots to craft humanlike interactions. They can both respond to prompts like questions or requests, and can provide responses very similar to that of a real person. They’re both capable of passing exams that would stump most humans, including complicated legal Bar exams, and they can write in the style of any writer with publicly available work.

But GPT-4 is the newer of the two models, so it comes with a number of upgrades and improvements that OpenAI believes are worth locking it behind a paywall — at least for now.

How can you use GPT 3.5 and GPT-4?

GPT-3.5 is fully available as part of ChatGPT, on the OpenAI website. You’ll need an account to log in, but it’s entirely free, and you’ll have the ability to chat with ChatGPT as much as you like, assuming the servers aren’t too busy. You can also find GPT 3.5 being used by a range of other chatbots that are widely available across different sites and services.

GPT-4, on the other hand, is a little harder to come by. You can use it through the OpenAI website as part of its ChatGPT Plus subscription. It’s $20 a month, but you’ll get priority access to ChatGPT as well, so it’s never too busy to have a chat. There are some ways to use GPT-4 for free, as well including using Bing Chat, but those sources tend to have a limited number of questions, or don’t always use GPT-4 due to limited availability.

What’s the difference between GPT 3.5 and GPT-4?

GPT 3.5 was trained on data that ultimately gave it the ability to consider 175 billion parameters depending on the prompt it receives. That gave it some impressive linguistic abilities, and let it respond to queries in a very humanlike fashion. However, GPT-4 is based on a lot more training data, and is ultimately able to consider over 1 trillion parameters when making its responses. GPT-4 was also trained through human and AI feedback for a further six months beyond that of GPT-3.5, so it has had many more corrections and suggestions on how it can improve.

GPT 4 is also trained on newer data. While GPT 3.5 was limited to information prior to June 2021, GPT-4 was trained on data up to September 2021, with some select information from beyond that date, which makes it a little more current in its responses.

All of that gives GPT-4 much greater ability to craft nuanced responses that are both more accurate and less prone to what OpenAI describes as “hallucinations.” That means it shouldn’t make up information as often, and will state that it doesn’t know the answer to something more readily.

Trying to use GPT-4 for illegal business ideas.
It was worth a shot. Image used with permission by copyright holder

GPT-4 also incorporates many new safeguards that OpenAI put in place to make it less prone to delivering responses that could be considered harmful or illegal. OpenAI claims that GPT-4 is “82% less likely to respond to requests for disallowed content.” There are still ways you can jailbreak ChatGPT, but it’s much better at dodging them.

OpenAI also took great steps to improve informational synthesis with GPT-4. That makes it more capable of understanding prompts with multiple factors to consider. You can ask it to approach a topic from multiple angles, or to consider multiple sources of information in crafting its response. This can also be seen in GPT-4’s creative efforts, where asking it to generate an original story will see it craft something much more believable and coherent. GPT-3.5 has a penchant for losing threads halfway through, or making nonsensical suggestions for characters that would be physically or canonically impossible.

The improved context window of GPT-4 is another major standout feature. It can now retain more information from your chats, letting it further improve responses based on your conversation. That works out to around 25,000 words of context for GPT-4, whereas GPT-3.5 is limited to a mere 3,000 words.

That additional understanding and larger context window does mean that GPT-4 is not as fast in its responses, however. GPT-3.5 will typically respond in its entirety within a few seconds, whereas GPT-4 will take a minute or more to write out larger responses.

Advanced programming

One of the coolest features of GPT-3.5 is its ability to write code. However, it wasn’t great at iterating upon it, leaving programmers trying to use ChatGPT and other AI tools to save time often spending more time bug fixing than if they’d just written the code themselves. GPT-4, on the other hand, is vastly superior in its initial understanding of the kind of code you want, and in its ability to improve it.

GPT-4 can take prompts like “improve performance,” or “this code gives me error X, can you fix it?” GPT-3.5 wouldn’t have fully understood those prompts, but GPT-4 can, and will act upon them effectively, allowing it to improve its own responses in future attempts. The ability to give it initial tasks beyond the original goal is an impressive advancement of GPT-4.

Understanding images

GPT-3.5 is primarily a text tool, whereas GPT-4 is able to understand images. If you provide it with a photo, it can describe what’s in it, understand the context of what’s there, and make suggestions based on it. This has led to some people using GPT-4 to craft recipe ideas based on pictures of their fridge. In other cases, GPT-4 has been used to code a website based on a quick sketch.

Some people have even started to combine GPT-4 with other AIs, like Midjourney, to generate entirely new AI art based on the prompts GPT-4 itself came up with.

Editors' Recommendations

Jon Martindale
Jon Martindale is the Evergreen Coordinator for Computing, overseeing a team of writers addressing all the latest how to…
Here’s why people are claiming GPT-4 just got way better
A person sits in front of a laptop. On the laptop screen is the home page for OpenAI's ChatGPT artificial intelligence chatbot.

It appears that OpenAI is busy playing cleanup with its GPT language models after accusations that GPT-4 has been getting "lazy," "dumb," and has been experiencing errors outside of the norm for the ChatGPT chatbot circulated social media in late November.

Some are even speculating that GPT-4.5 has secretly been rolled out to some users, based on some responses from ChatGPT itself. Regardless of whether or not that's true, there's definitely been some positive internal changes over the past behind GPT-4.
More GPUs, better performance?
Posts started rolling in as early as last Thursday that noticed the improvement in GPT-4's performance. Wharton Professor Ethan Mollick, who previously commented on the sharp downturn in GPT-4 performance in November, has also noted a revitalization in the model, without seeing any proof of a switch to GPT-4.5 for himself. Consistently using a code interpreter to fix his code, he described the change as "night and day, for both speed and answer quality" after experiencing ChatGPT-4 being "unreliable and a little dull for weeks."

Read more
What is Grok? Elon Musk’s controversial ChatGPT competitor explained
A digital image of Elon Musk in front of a stylized background with the Twitter logo repeating.

Grok! It might not roll off the tongue like ChatGPT or Windows Copilot, but it's a large language model chatbot all the same. Developed by xAI, an offshoot of the programmers who stuck around after Elon Musk purchased X (formerly known as Twitter), Grok is designed to compete directly with OpenAI's GPT-4 models, Google's Bard, and a range of other public-facing chatbots.

Launched in November 2023, Grok is designed to be a chatbot with less of a filter than other AIs. It's said to have a "bit of wit, and has a rebellious streak."
It's only for X Premium users

Read more
Google might finally have an answer to Chat GPT-4
ChatGPT versus Google on smartphones.

Google has announced the launch of its most extensive artificial intelligence model, Gemini, and it features three versions: Gemini Ultra, the largest and most capable; Gemini Pro, which is versatile across various tasks; and Gemini Nano, designed for specific tasks and mobile devices. The plan is to license Gemini to customers through Google Cloud for use in their applications, in a challenge to OpenAI's ChatGPT.

Gemini Ultra excels in massive multitask language understanding, outperforming human experts across subjects like math, physics, history, law, medicine, and ethics. It's expected to power Google products like Bard chatbot and Search Generative Experience. Google aims to monetize AI and plans to offer Gemini Pro through its cloud services.

Read more