Tencent has announced a patent for training large language models, which can improve the accuracy of the model.

Breakings · Feb 8 11:47

According to Tianyancha App, on February 7th, a patent for "training methods, devices, Computer Equipment, and storage media for large language models" applied by Tencent Technology (Shenzhen) Co., Ltd. was published. The abstract shows that in this method, by introducing a first summary text and a second summary text during the training process of the large language model, more learnable information is provided for model training. At the same time, since the amount of information in the first and second summary texts is different, and the first summary text contains correct and incorrect statements, comparing the two different summary texts of the same base text and distinguishing the correct and incorrect statements in the first summary text avoids problems such as model overfitting and inaccurate generation caused by the singular nature of summary texts. This not only improves the generalization performance of the model but also enhances the accuracy of the model.

The translation is provided by third-party software.

The above content is for informational or educational purposes only and does not constitute any investment advice related to Futu. Although we strive to ensure the truthfulness, accuracy, and originality of all such content, we cannot guarantee it.

腾讯公布大语言模型训练专利 可提高模型的准确性

Tencent has announced a patent for training large language models, which can improve the accuracy of the model.

Risk Disclaimer

Statement

腾讯公布大语言模型训练专利可提高模型的准确性