Ricoh has developed a high-performance Japanese LLM (70 billion parameters) equivalent to GPT-4 through model merging.

RICOH COMPANY · Sep 29 23:00

株式会社リコー（社長執行役員：大山晃)は、米Meta Platforms社が提供する「Meta-Llama-3-70B」の日本語性能を向上させた「Llama-3-Swallow-70B＊1」をベースモデルに、同社のInstructモデルからベクトル抽出したChat Vector＊2とリコー製のChat Vector＊3をリコー独自のノウハウでマージすることで、高性能な日本語大規模言語モデル（LLM＊4）を新たに開発しました。これにより、リコーが開発・提供するLLMのラインナップに、米OpenAIが開発したGPT-4と同等レベルの高性能モデルが追加されました。

生成AIの広がりにより、企業が業務で活用できる高性能なLLMのニーズが高まっています。しかし、LLMの追加学習は、コストが高く、時間もかかるという課題があります。その課題に対して、複数のモデルを組み合わせて、より高性能なモデルをつくる「モデルマージ＊5」は効率的な開発手法として注目されています。

リコーは、モデルマージのノウハウと、LLM開発の知見に基づき、今回、新たなLLMを開発しました。本技術は、企業独自のプライベートLLMや特定業務向けの高性能なLLMの開発の効率化につながるものです。

リコーは、自社製LLMの開発だけではなく、お客様の用途や環境に合わせて、最適なLLMを低コスト・短納期でご提供するために、多様で効率的な手法・技術の研究開発を推進してまいります。

評価結果＊6（ELYZA-tasks-100）

複雑な指示・タスクを含む代表的な日本語のベンチマーク「ELYZA-tasks-100」において、今回リコーがモデルマージの手法で開発したLLMはGPT-4と同等レベルの高いスコアを示しました。また、比較した他のLLMはタスクによって英語で回答するケースが見られましたが、全てのタスクに対して日本語で回答して高い安定性を示しました。

ベンチマークツール（ELYZA-tasks-100）における他モデルとの比較結果（リコーは最下段）

リコーのLLM開発の背景

労働人口減少や高齢化を背景に、AIを活用した生産性向上や付加価値の高い働き方が企業成長の課題となっており、その課題解決の手段として、多くの企業がAIの業務活用に注目しています。しかし、AIを実際の業務に適用するためには、企業固有の用語や言い回しなどを含む大量のテキストデータをLLMに学習させ、その企業独自のAIモデル（プライベートLLM）を作成する必要があります。

リコーは国内でもトップクラスのLLMの開発・学習技術をベースに、企業向けプライベートLLMの提供や、社内文書の活用を後押しするRAGの導入支援等、様々なAIソリューションの提案が可能です。

＊1Llama-3-Swallow-70B：東京工業大学情報理工学院情報工学系の岡崎直観教授と横田理央教授らの研究チームと国立研究開発法人産業技術総合研究所によって開発された日本語LLMモデル。＊2Chat Vector：指示追従能力を持つモデルからベースモデルのウェイトを差し引き、指示追従能力のみを抽出したベクトル。＊3リコー製のChat Vector：Meta社のベースモデル「Meta-Llama-3-70B」に対し、リコー独自開発を含む約1万6千件のインストラクションチューニングデータで追加学習したInstructモデルから抽出したChat Vector。＊4Large Language Model（大規模言語モデル）：人間が話したり書いたりする言葉（自然言語）に存在する曖昧性やゆらぎを、文章の中で離れた単語間の関係までを把握し「文脈」を考慮した処理を可能にしているのが特徴。「自然文の質問への回答」や「文書の要約」といった処理を人間並みの精度で実行でき、学習も容易にできる技術。＊5モデルマージ：複数の学習済みのLLMモデルを組み合わせて、より性能の高いモデルを作る新たな方法のこと。GPUのような大規模な計算リソースが不要で、より手軽にモデル開発ができるとして、近年注目されています。＊62024年9月24日時点の評価結果。「スコア」の算出に際して、生成文の評価には「GPT-4」（gpt-4-0613）と「GPT-4o」（gpt-4o-2024-05-13）を使用し、英語での回答による減点は行っていない。「英語で回答されたタスクの割合」は100タスクのうち英語で回答されたものの割合。

このニュースリリースはPDFファイルでもご覧いただけます

リコー、モデルマージによってGPT-4と同等の高性能な日本語LLM（700億パラメータ）を開発（224KB・全2ページ構成）

※社名、製品名は、各社の商標または登録商標です。

Ricoh Co., Ltd. (President and CEO: Akira Oyama) has developed a high-performance Japanese large-scale language model (LLM*) by improving the Japanese performance of "Meta-Llama-3-70B" provided by Meta Platforms Inc. to the base model "Llama-3-Swallow-70B*1", extracting vectors from the company's Instruct model, and merging the Chat Vectors*2 extracted from Ricoh and Chat Vectors*3 produced by Ricoh with Ricoh's unique expertise. As a result, Ricoh has added a high-performance model equivalent to GPT-4 developed by OpenAI to the lineup of LLM developed and provided by Ricoh.

The increasing spread of generative AI has led to a growing demand for high-performance LLM that companies can utilize in their operations. However, there is a challenge that additional learning of LLM is costly and time-consuming. In response to this challenge, the efficient development method of combining multiple models to create a higher-performance model, known as "Model Merge*5", is gaining attention.

Based on the expertise in model merging and the knowledge of LLM development, Ricoh has developed a new LLM. This technology contributes to streamlining the development of private LLMs unique to companies and high-performance LLMs for specific operations.

Ricoh will continue to promote research and development of diverse and efficient methods and technologies in order to not only develop its own LLMs but also to provide optimal LLMs tailored to customers' applications and environments at low cost and short delivery times.

Evaluation Results*6 (ELYZA-tasks-100)

"ELYZA-tasks-100", a representative Japanese benchmark including complex instructions and tasks, Ricoh's LLM developed using the model merge method in this instance showed a high level of score equivalent to GPT-4. Furthermore, while other LLMs compared showed cases where answers were given in English depending on the task, Ricoh's LLM consistently provided responses in Japanese for all tasks showing high stability.

Comparison Results with Other Models in Benchmark Tool (ELYZA-tasks-100) (Ricoh at the bottom)

Background of Ricoh's LLM development

Against the backdrop of declining labor force and aging population, the use of AI to improve productivity and create high-value working styles has become a challenge for corporate growth. As a means to address this challenge, many companies are focusing on the practical application of AI in their operations. However, in order to apply AI to actual operations, it is necessary to train LLM with large amounts of text data including company-specific terminology and phrasing, to create their own AI model (Private LLM).

Leveraging its top-class LLM development and learning technology in Japan, Ricoh is capable of proposing various AI solutions such as providing private LLM for enterprises and supporting the utilization of internal documents through the introduction of RAG.

*1Llama-3-Swallow-70B: Japanese LLM model developed by a research team including Professors Naokazu Okazaki and Rio Yokota from the Department of Information Engineering at the Tokyo Institute of Technology, and the National Institute of Advanced Industrial Science and Technology (AIST). *2Chat Vector: A vector extracted from a model with instruction-following capabilities, by subtracting the base model's weights and retaining only the instruction-following capabilities. *3Ricoh's Chat Vector: Extracted from an Instruct model that was finetuned with approximately 0.01 million 6 thousand instruction tuning data, including Ricoh's proprietary development, based on Meta's base model "Meta-Llama-3-70B". *4Large Language Model (LLM): A technology that enables understanding of the ambiguity and fluctuations present in natural language spoken or written by humans, considering the context even between distant words in a sentence. This technology allows for processing that takes into account the "context," can execute tasks such as "answering questions about natural language sentences" and "summarizing documents" with human-like accuracy, and is easy to train. *5Model Merging: A new method combining multiple pretrained LLM models to create a higher-performing model. It has gained attention in recent years for making model development more accessible without the need for large-scale computational resources like GPUs. *62024 Evaluation Results as of September 24: The evaluation was conducted using "GPT-4" (gpt-4-0613) and "GPT-4o" (gpt-4o-2024-05-13) for generating text scores, without deductions for responses in English. The "Percentage of tasks answered in English" indicates the proportion of tasks out of 100 that were answered in English.

This news release is also available as a PDF file.

Ricoh has developed a high-performance Japanese LLM (70 billion parameters) equivalent to GPT-4 through Model Merge (224KB, 2 pages in total).

* The company name and product name are trademarks or registered trademarks of each company.

The translation is provided by third-party software.

The above content is for informational or educational purposes only and does not constitute any investment advice related to Futu. Although we strive to ensure the truthfulness, accuracy, and originality of all such content, we cannot guarantee it.

リコー、モデルマージによってGPT-4と同等の高性能な日本語LLM（700億パラメータ）を開発

Ricoh has developed a high-performance Japanese LLM (70 billion parameters) equivalent to GPT-4 through model merging.

評価結果＊6（ELYZA-tasks-100）

リコーのLLM開発の背景

関連ニュース

このニュースリリースはPDFファイルでもご覧いただけます

Evaluation Results*6 (ELYZA-tasks-100)

Background of Ricoh's LLM development

Related News

This news release is also available as a PDF file.

リコー、モデルマージによってGPT-4と同等の高性能な日本語LLM（700億パラメータ）を開発

Ricoh has developed a high-performance Japanese LLM (70 billion parameters) equivalent to GPT-4 through model merging.

評価結果＊6（ELYZA-tasks-100）

リコーのLLM開発の背景

関連ニュース

このニュースリリースはPDFファイルでもご覧いただけます

Evaluation Results*6 (ELYZA-tasks-100)

Background of Ricoh's LLM development

Related News

This news release is also available as a PDF file.

Risk Disclaimer

Statement