share_log

Mark Zuckerberg's Meta Says Llama 3 Beats Google's Gemini, Mistral And Jeff Bezos-backed Anthropic's Claude 3, But OpenAI's GPT-4 Is Notably Missing From Its Comparison

Benzinga ·  Apr 19 17:14

Mark Zuckerberg's Meta Platforms Inc. (NASDAQ:META) revealed that its new large language model, Llama 3, has surpassed other AI models in benchmark tests. However, OpenAI's latest flagship model, GPT-4, is notably missing from its comparison.

What Happened: The Llama 3 model, which Meta unveiled on Thursday, has outperformed several other AI models in benchmark tests, including those from Alphabet Inc.'s (NASDAQ:GOOGL) (NASDAQ:GOOG) Google, Jeff Bezos-backed Anthropic, and Mistral AI.

The new model is expected to be available to cloud providers like Amazon.com Inc.'s (NASDAQ:AMZN) Amazon Web Services (AWS) and model libraries such as Hugging Face soon.

Subscribe to the Benzinga Tech Trends newsletter to get all the latest tech developments delivered to your inbox.

The Llama 3 model, which comes in two sizes with 8B and 70B parameters, has shown significant improvements in text-based responses. It has demonstrated a higher diversity in answering prompts, fewer false refusals, and improved reasoning abilities.

The model also shows a better understanding of instructions and improved code-writing capabilities.

Meta's blog post stated that Llama 3 outperformed similarly sized models like Google's Gemma and Gemini, Anthropic's Claude 3, and Mistral 7B in specific benchmarking tests.

Meta's Llama 3 AI model benchmark comparison | Image credit: Meta

In the MMLU benchmark, which measures general knowledge, the 8B version of Llama 3 outperformed both Gemma 7B and Mistral 7B, while the 70B version slightly edged Gemini Pro 1.5.

It is worth noting that Meta did not mention OpenAI's GPT-4 in its post. The company also highlighted that benchmark testing AI models, while helpful, is not perfect, as the datasets used for benchmarking are often part of the model's training.

Why It Matters: Earlier in March, Anthropic introduced its latest generative AI model, Claude 3, claiming it outperforms rivals like OpenAI's GPT-4.

In a similar vein, Elon Musk's xAI announced a major update to Grok with improvements across the board in various metrics.

OpenAI, on the other hand, is expected to release a "significantly better" GPT-5 model later this year.

Check out more of Benzinga's Consumer Tech coverage by following this link.

Disclaimer: This content was partially produced with the help of Benzinga Neuro and was reviewed and published by Benzinga editors.

Photo courtesy: Shutterstock

The above content is for informational or educational purposes only and does not constitute any investment advice related to Futu. Although we strive to ensure the truthfulness, accuracy, and originality of all such content, we cannot guarantee it.
    Write a comment