Meta releases the multi-modal version of Llama 3, the strongest open source model, which will be launched later

cls.cn · Apr 19 02:12

①Llama 3有8B和70B两个版本，大版本的Llama 3将有超过4000亿参数； ②更高级的推理能力，比如制定更长的多步骤计划的能力，将在随后的版本中出现。

财联社4月19日讯（编辑牛占林）当地时间周四，美国科技巨头Meta推出了其最强大的开源人工智能(AI)模型Llama 3，以追赶行业领导者OpenAI。美股盘中，Meta股价上涨逾2%，今年迄今涨近43%。

Meta首席执行官扎克伯格声称，Llama 3有8B和70B两个版本，大版本的Llama 3将有超过4000亿参数。由于预训练和指令微调，Llama 3相比Llama 2有了极大的改进。

Llama 3在多种行业基准测试上展现了最先进的性能，并提供了包括改进的推理能力在内的新功能。Meta认为Llama 3是市场上最好的开源大模型。开源意味着这些模型的代码和数据对公众开放，任何人都可以查看、修改和使用。

开发人员抱怨之前的Llama 2模型无法理解基本的上下文，在处理查询时经常出现混淆。谷歌的Gemini AI图像生成工具也遇到了类似问题，它在生成历史人物的图像时产生了不准确的描述，这引起了广泛批评。

现在，Meta在训练Llama 3时使用了更高质量的数据，这些数据帮助AI模型更好地识别语言中的细微差别，从而提高其对上下文的理解能力。

Meta提到，他们为Llama 3模型输入的数据量是Llama 2的七倍，这可能有助于提高模型的性能和准确性。还利用了由AI生成的“合成”数据来加强模型在编码和推理等特定领域的能力。

据Meta介绍，Llama 3将被整合到其虚拟助手Meta AI中，这是免费使用的同类产品中最先进的AI应用程序。Meta AI助手已经在Facebook、Instagram、WhatsApp和Messenger等应用中上线，随后也将迎来更新。

Meta首席产品官Chris Cox在接受采访时说，这家社交媒体巨头为Llama 3配备了新的计算机编码能力，这次除了可以输入文本外，还可以输入了图像，不过目前该模型只能输出文本内容。因此，Llama 3目前还不是多模态大模型。

但他补充说，更高级的推理能力，比如制定更长的多步骤计划的能力，将在随后的版本中出现。并计划在未来几个月发布多模态版本，这意味着它们可以同时生成文本和图像。

Cox表示，最终的目标是帮助用户从繁杂的工作中解脱出来，让生活更轻松快乐，无论是与企业互动，还是写作，或者是计划旅行。

此外，Llama 3很快将在亚马逊AWS、Databricks、谷歌云、Hugging Face、Kaggle、IBM的云平台WatsonX、微软云Azure、英伟达的NIM和Snowflake上推出，并得到AMD、AWS、戴尔、英特尔、英伟达提供的硬件平台支持。

① Llama 3 has two versions, 8B and 70B. The larger version of Llama 3 will have more than 400 billion parameters; ② More advanced reasoning skills, such as the ability to make longer multi-step plans, will appear in subsequent versions.

Financial Services, April 19 (Editor Niu Zhanlin) On Thursday local time, US tech giant Meta launched its most powerful open source artificial intelligence (AI) model, Llama 3, to catch up with industry leader OpenAI. In the US stock market, Meta shares have risen more than 2% and have risen nearly 43% so far this year.

Meta CEO Zuckerberg claims that Llama 3 has two versions, 8B and 70B, and that the larger version of Llama 3 will have more than 400 billion parameters. Llama 3 is a huge improvement over Llama 2 due to pre-training and fine-tuning instructions.

Llama 3 showcased cutting-edge performance on multiple industry benchmarks and provided new features including improved reasoning capabilities. Meta believes Llama 3 is the best open source big model on the market. Open source means that the code and data for these models are open to the public and can be viewed, modified, and used by anyone.

The developers complained that the previous Llama 2 model didn't understand the basic context and was often confused when processing queries. Google's Gemini AI image generation tool also experienced a similar problem, which generated inaccurate descriptions when generating images of historical figures, which drew widespread criticism.

Meta is now using higher quality data when training Llama 3, which helps the AI model to better recognize subtle differences in language, thereby improving its ability to understand the context.

Meta mentioned that the amount of data they entered for the Llama 3 model was seven times that of Llama 2, which may help improve the model's performance and accuracy. “Synthetic” data generated by AI is also used to enhance the model's ability in specific fields such as coding and inference.

According to Meta, Llama 3 will be integrated into its virtual assistant Meta AI, the most advanced AI application of its kind for free use. Meta AI Assistant has been launched in apps such as Facebook, Instagram, WhatsApp, and Messenger, and will be updated soon.

Meta's chief product officer Chris Cox said in an interview that the social media giant equipped Llama 3 with new computer coding capabilities. This time, in addition to being able to input text, it can also input images, but currently the model can only output text content. As a result, Llama 3 is currently not a large multi-modal model.

But he added that more advanced reasoning skills, such as the ability to make longer multi-step plans, will appear in subsequent versions. They are also planning to release multi-modal versions in the next few months, which means they can simultaneously generate text and images.

Cox said the ultimate goal is to help users get rid of complicated work and make life easier and happier, whether it's interacting with businesses, writing, or planning trips.

Additionally, Llama 3 will soon be launched on Amazon AWS, Databricks, Google Cloud, Hugging Face, Kaggle, IBM's cloud platforms WatsonX, Microsoft Cloud Azure, Nvidia's NIM and Snowflake, and will be supported by hardware platforms provided by AMD, AWS, Dell, Intel, and Nvidia.

The translation is provided by third-party software.

The above content is for informational or educational purposes only and does not constitute any investment advice related to Futu. Although we strive to ensure the truthfulness, accuracy, and originality of all such content, we cannot guarantee it.

Meta发布最强开源大模型Llama 3 多模态版本随后将上线

Meta releases the multi-modal version of Llama 3, the strongest open source model, which will be launched later

Risk Disclaimer

Statement