Track the latest AI trends

Challenge GPT! Meta launches Llama 3, the strongest open source model, with the “smartest” free AI assistant for all social media

硬AI · Apr 19 08:42

来源：硬AI

Llama 3最大参数规模超4000亿，训练token超15万亿，对比GPT-3.5多种人类评估测评胜率超六成；亚马逊、微软、谷歌云将推出Llama 3，英伟达、英特尔、AMD硬件平台将支持Llama 3。英伟达称，Meta用搭载超2.4万块H100芯片的计算机集群训练Llama 3。Meta AI助手将在美国以外十三国推出英语版，手机和电脑均可用，用它查询无需切换App，文生图功能Image可根据提示词实时更新图片、可生成GIF动图。

OpenAI的GPT迎来强劲对手，$Meta Platforms (META.US)$正在发起最新一轮挑战。

美东时间4月18日周四，Meta宣布，推出旗下第三代大语言模型（LLM）Llama 3，称它为“迄今为止能力最强的开源LLM”，并且基于Llama 3，升级了人工智能（AI）助手Meta AI，称它“现在是你可以免费使用的最智能AI助手”。

Meta公布，Llama 3将在亚马逊、微软、谷歌云等云平台得到启用，并得到英伟达等芯片巨头和戴尔的硬件支持。英伟达透露，Meta在合计搭载超过2.4万块H100芯片的计算机集群上训练Llama 3，英伟达产品和服务加持的Llama 3用于云、边缘计算、机器人、PC等领域。

Llama 3最大参数规模超4000亿训练token超15万亿

去年7月Meta发布的Llama 2有三个版本，最大版本70B的参数规模为700亿。本周四Meta介绍，Llama 3有8B和70B两个版本。Meta CEO扎克伯格称，大版本的Llama 3将有超过4000亿参数。Meta并未透露会不会将4000亿参数规模的Llama 3开源，目前它还接受训练。

对比前代，Llama 3有了质的飞跃。Llama 2使用2万亿个 token进行训练，而训练Llama 3大版本的token超过15 万亿。

Meta称，由于预训练和训练后的改进，其预训练和指令调优的模型是目前8B和70B两个参数规模的最佳模型。在训练后程序得到改进后，模型的错误拒绝率（FRR）大幅下降，一致性提高，模型响应的多样性增加。在推理、代码生成和指令跟踪等功能方面，Llama 3相比Llama 2有极大改进，使Llama 3更易于操控。

下图可见，8B和70B版本的Llama 3指令调优模型在大规模多任务语言理解数据集（MMLU）、研究生水平专家推理（GPQA）、数学评测集（GSM8K）、编程多语言测试（HumanEval）等方面的测评得分都高于Mistral、谷歌的Gemma和Gemini和Anthropic的Claude 3。

8B和70B版本的预训练Llama 3多种性能测评优于Mistral、Gemma、Gemini和Mixtral。

Meta称，开发了一套新的高质量人类评估集，包括涵盖12 个关键用例的1800个提示词，这些用例分别是寻求建议、头脑风暴、分类、闭卷问答、开卷问答、编码、创意写作、提取、塑造角色/人物形象、推理、改写和总结。下图可见，在人类评估集测评中，70B版本指令调优Llama 3优于Claude Sonnet、Mistral Medium、GPT-3.5和Llama 2的胜率分别为52.9%、59.3%、63.2%、63.7%。

为了未来适用于多语言用例，超过5%的 Llama 3 预训练数据集属于涵盖30 多种语言的高质量非英语数据。但Meta预计，对非英语语种的性能不会和英语的一致。

Meta预计。未来几个月，将推出Llama 3 的新功能，上下文窗口会更长，性能会更强，还会有该模型新的尺寸版本，Meta还将分享 Llama 3的研究论文。

亚马逊等云平台将推出Llama 3 超2.4万英伟达H100芯片训练Llama 3

Meta介绍，Llama 3 模型很快将在亚马逊云AWS、Databricks、谷歌云、Hugging Face、Kaggle、IBM的云平台WatsonX、微软云Azure、英伟达的NIM和 Snowflake 上推出，得到 AMD、AWS、戴尔、英特尔、英伟达提供的硬件平台支持。

英伟达同日披露，Meta的工程师在包含2.4576万块英伟达H100 Tensor Core GPU、连接英伟达Quantum-2 InfiniBand网络的计算机集群上训练 Llama 3。在英伟达的支持下，Meta 为其LLM调整了网络、软件和模型架构。而且，为了进一步推进生成式AI的先进水平，Meta最近公布了计划，要在其基础设施用应用35万块H100 芯片。

英伟达称，由英伟达芯片助力的Llama 3现已推出，可用于云、数据中心、边缘计算和个人电脑（PC）。开发人员可以通过英伟达的网站ai.nvidia.com试用Llama 3，企业用户可以通过英伟达的端到端云原生框架NeMo，利用自身数据对 Llama 3进行调优。

Llama 3 还可在英伟达用于机器人开发的模组Jetson Orin 上运行，用于机器人和边缘计算设备，创建像Jetson AI 实验室中的交互式代理。此外，适用于工作站和 PC 的 NVIDIA RTX 和 GeForce RTX GPU可加快 Llama 3的推理速度。

美国以外十三国推出英语版Meta AI 手机和电脑均可用文生图功能Image可实时更新图、生成GIF

Meta介绍，用户可以在旗下社交媒体Facebook、Instagram、WhatsApp 和 Messenger上使用 Meta AI完成工作、学习、创作和连接自己看重的事物。

Meta称，将在美国以外的十三个国家推出英语版 Meta AI，包括加拿大、澳大利亚、新西兰、新加坡、南非、尼日利亚、巴基斯坦、加纳、牙买加、马拉维、乌干达、赞比亚和津巴布韦。

Meta AI能做什么？Meta举了一些例子，比如策划和朋友晚上怎么玩，推荐一家可欣赏日落美景并提供素食选择的餐厅，查找周末晚上哪里有音乐会，提供野餐地点的建议，解释遗传特征如何发挥作用这种课业问题。

Meta还提到一个新功能——名叫Image的AI图像生成功能，用户可以根据WhatsApp 和 Meta AI 网站中的文本生成图像。用这种功能，Meta AI 可以根据用户想要的审美要求“想象”生成图片，给用户的实际购物提供灵感。

扎克伯格表示，Image服务将在用户输入更详细的提示词时实时更新图像，并可以创建自定义动画GIF。

Meta称，当用户开始打字输入提示词时，会看到一个图像出现，并且每输入几个字母，这个图就会发生变化。

Meta介绍，如果用户找到喜欢的图片，可以让 Meta AI 为其制作动画，或者转换为GIF图片与朋友分享。

除了手机用户，Meta还兼顾电脑用户，上线了网站meta.ai，让用户在电脑上完成工作时也可以使用 Meta AI，让它帮忙解数学题，让工作电邮的内容更专业。用户还可以登录网站保存与Meta AI 的对话内容，供将来参考。

Meta AI 还可以在 Facebook、Instagram、WhatsApp 和 Messenger上进行网页实时搜索。用户可以通过网络访问实时信息，无需在这些社交媒体的应用程序App之间切换。假用户正在 Messenger 群聊时计划怎么安排滑雪旅行。使用Messenger 中的搜索，可以要求 Meta AI 查找从纽约飞往科罗拉多州的航班，找出出行人数相对最少的周末，所有这些查找工作都无需离开 Messenger的App进行。

滚动浏览Facebook Feed 时，用户也可以访问Meta AI。如果发现感兴趣的帖子，用户可以在打开帖子后直接问Meta AI获取更多相关信息。比如看到冰岛北极光的照片，可以询问 Meta AI 一年中的什么时间最适合观看北极光。

编辑/tolk

Source: Hard AI

The maximum parameter scale of Llama 3 exceeds 400 billion, and the training token exceeds 15 trillion yuan. Compared with GPT-3.5, the winning rate of various human evaluation tests is over 60%; Amazon, Microsoft, and Google Cloud will launch Llama 3, and Nvidia, Intel, and AMD hardware platforms will support Llama 3. According to Nvidia, Meta trains Llama 3 using a computer cluster with over 24,000 H100 chips. The Meta AI Assistant will launch an English version in 13 countries other than the US. It can be used on both mobile phones and computers. There is no need to switch apps to search. The Wensheng image function can update images in real time according to prompts and generate animated GIFs.

OpenAI's GPT ushered in a strong rival,$Meta Platforms (META.US)$The latest round of challenges is being launched.

On Thursday, April 18, EST, Meta announced the launch of its third-generation big language model (LLM), Llama 3, calling it “the most capable open source LLM so far” and upgraded the artificial intelligence (AI) assistant Meta AI based on Llama 3, calling it “the smartest AI assistant you can use for free now.”

Meta announced that Llama 3 will be launched on cloud platforms such as Amazon, Microsoft, and Google Cloud, and will receive hardware support from chip giants such as Nvidia and Dell. Nvidia revealed that Meta trains Llama 3 on a computer cluster with more than 24,000 H100 chips. Llama 3, which is supported by Nvidia products and services, is used in the fields of cloud, edge computing, robotics, and PCs.

The maximum parameter scale of Llama 3 exceeds 400 billion, and the training token exceeds 15 trillion

There are three versions of Llama 2 released by Meta in July last year. The largest version, 70B, has a parameter scale of 70 billion. According to Meta this Thursday, there are two versions of Llama 3, 8B and 70B. Meta CEO Zuckerberg said that the larger version of Llama 3 will have more than 400 billion parameters. Meta has not revealed whether it will open source Llama 3 with 400 billion parameters; it is currently undergoing training.

Compared to its predecessor, Llama 3 is a qualitative leap forward. Llama 2 uses 2 trillion tokens for training, while training the 3 major versions of Llama have more than 15 trillion tokens.

Meta said that due to pre-training and post-training improvements, its pre-training and instruction tuning model is currently the best model with two parameter scales of 8B and 70B. After the post-training program was improved, the model's erroneous rejection rate (FRR) dropped drastically, consistency improved, and the diversity of model responses increased. Llama 3 is a huge improvement over Llama 2 in terms of functions such as inference, code generation, and instruction tracking, making Llama 3 easier to manipulate.

As can be seen in the figure below, the 8B and 70B versions of the Llama 3 instruction tuning models all scored higher in terms of large-scale multitask language understanding data set (MMLU), graduate level expert reasoning (GPQA), mathematical evaluation set (GSM8K), programming multilingual test (HumanEval), etc. than Mistral, Google's Gemma and Gemini, and Anthropic's Cla3.

The 8B and 70B versions of pre-trained Llama 3 are superior to Mistral, Gemma, Gemini, and Mixtral in various performance tests.

According to Meta, it has developed a new set of high-quality human assessments, including 1,800 cues covering 12 key use cases, such as seeking suggestions, brainstorming, classification, closed-paper Q&A, open question and answer, coding, creative writing, extraction, and characterization, reasoning, rewriting, and summarization. As can be seen in the figure below, in the human assessment set, the 70B version of the instruction tuning Llama 3 was superior to Claude Sonnet, Mistral Medium, GPT-3.5, and Llama 2, winning rates of 52.9%, 59.3%, 63.2%, and 63.7%, respectively.

For future use cases in multiple languages, over 5% of the Llama 3 pre-trained data set is high-quality non-English data covering more than 30 languages. However, Meta anticipates that the performance for languages other than English will not match that of English.

Meta predicts. In the next few months, new features of Llama 3 will be launched. The context window will be longer, the performance will be stronger, and there will also be a new size version of the model. Meta will also share Llama 3 research papers.

Amazon and other cloud platforms will launch Llama 3 with over 24,000 Nvidia H100 chip training Llama 3

According to Meta, the Llama 3 model will soon be launched on Amazon Cloud AWS, Databricks, Google Cloud, Hugging Face, Kaggle, IBM's cloud platforms WatsonX, Microsoft Cloud Azure, Nvidia's NIM and Snowflake, and will be supported by hardware platforms provided by AMD, AWS, Dell, Intel, and Nvidia.

Nvidia revealed on the same day that Meta engineers trained Llama 3 on a computer cluster containing 245.76 million Nvidia H100 Tensor Core GPUs connected to the Nvidia Quantum-2 InfiniBand network. With support from Nvidia, Meta adjusted the network, software, and model architecture for its LLM. Furthermore, in order to further advance the advanced level of generative AI, Meta recently announced plans to use 350,000 H100 chips in its infrastructure.

According to Nvidia, Llama 3, powered by Nvidia chips, is now available and can be used in the cloud, data centers, edge computing, and personal computers (PCs). Developers can try out Llama 3 through Nvidia's website ai.Nvidia.com, and enterprise users can use Nvidia's end-to-end cloud-native framework NEMO to tune Llama 3 using their own data.

Llama 3 can also run on Nvidia's Jetson Orin module for robotics development, for robots and edge computing devices, creating interactive agents like the Jetson AI lab. Additionally, NVIDIA RTX and GeForce RTX GPUs for workstations and PCs speed up Llama 3's inference.

Thirteen countries other than the US have launched the English version of Meta AI on mobile phones and computers, and can use the Wenshengtu function Image to update images and generate GIFs in real time

According to Meta, users can use Meta AI to work, learn, create, and connect with the things they value on its social media Facebook, Instagram, WhatsApp, and Messenger.

Meta said it will launch an English-language version of Meta AI in 13 countries other than the US, including Canada, Australia, New Zealand, Singapore, South Africa, Nigeria, Pakistan, Ghana, Jamaica, Malawi, Uganda, Zambia and Zimbabwe.

What can Meta AI do? Meta gave some examples, such as planning how to have fun with friends at night, recommending a restaurant where you can enjoy the sunset and offer vegetarian options, finding concerts on weekend nights, providing suggestions on places to picnic, and explaining how genetic traits play a role in schoolwork.

Meta also mentioned a new feature — an AI image generation feature called image, which allows users to generate images based on text in WhatsApp and Meta AI websites. Using this function, Meta AI can “imagine” and generate images according to the user's desired aesthetic requirements to provide inspiration for the user's actual shopping.

Zuckerberg said the Image service will update images in real time as users enter more detailed prompts, and can create custom animated GIFs.

Meta said that when users start typing in a prompt, they will see an image appear, and every time a few letters are entered, the image changes.

According to Meta, if users find an image they like, Meta AI can animate it or convert it into a GIF to share with friends.

In addition to mobile phone users, Meta also takes into account computer users and launched the meta.ai website, so that users can also use Meta AI when completing work on a computer, so that it can help solve math problems and make the content of work emails more professional. Users can also log on to the website to save conversations with Meta AI for future reference.

Meta AI can also perform real-time web searches on Facebook, Instagram, WhatsApp, and Messenger. Users can access real-time information via the web without having to switch between these social media apps. The fake user is planning how to arrange a ski trip in a group chat on Messenger. Using the search in Messenger, you can ask Meta AI to find flights from New York to Colorado and find weekends with relatively few travelers, all without leaving the Messenger app.

Users can also access Meta AI when scrolling through Facebook feeds. If they find an interesting post, users can directly ask Meta AI to obtain more relevant information after opening the post. For example, if you see a picture of the Northern Lights in Iceland, you can ask Meta AI what time of year is best to see the Northern Lights.

editor/tolk

The translation is provided by third-party software.

The above content is for informational or educational purposes only and does not constitute any investment advice related to Futu. Although we strive to ensure the truthfulness, accuracy, and originality of all such content, we cannot guarantee it.

Track the latest AI trends

挑战GPT！Meta推出最强开源模型Llama 3，社交媒体全线配“最智能”免费AI助手

Challenge GPT! Meta launches Llama 3, the strongest open source model, with the “smartest” free AI assistant for all social media

Llama 3最大参数规模超4000亿 训练token超15万亿

亚马逊等云平台将推出Llama 3 超2.4万英伟达H100芯片训练Llama 3

美国以外十三国推出英语版Meta AI 手机和电脑均可用 文生图功能Image可实时更新图、生成GIF

The maximum parameter scale of Llama 3 exceeds 400 billion, and the training token exceeds 15 trillion

Amazon and other cloud platforms will launch Llama 3 with over 24,000 Nvidia H100 chip training Llama 3

Thirteen countries other than the US have launched the English version of Meta AI on mobile phones and computers, and can use the Wenshengtu function Image to update images and generate GIFs in real time

Risk Disclaimer

Statement

Llama 3最大参数规模超4000亿训练token超15万亿

美国以外十三国推出英语版Meta AI 手机和电脑均可用文生图功能Image可实时更新图、生成GIF