来源:智通财经
IDC发文称,近日OpenAI陆续发布ChatGPT,GPT-4,引发了AI界的全民狂欢,文生图类应用如StableDiffusion、Midjourney以及DALL·E2也开始快速流行。百度则于3月16日召开文心一言发布会,展示了中国厂商的大模型以及生成式AI能力。人工智能市场正式开启了全新的时代——大模型驱动的AI时代。对此,IDC十问GPT与AIGC真实现状与未来发展。
大模型、ChatGPT以及AIGC的关系
IDC定义的AI应用均是指基于机器学习算法的AI决策系统。大模型则是指读取海量数据、参数规模巨大的算法模型。业界一般认为超过千亿级参数即为大模型,其训练过程中可能使用了上千张以上的GPU/CPU芯片。ChatGPT与AIGC均为大模型的应用场景之一。ChatGPT可以类比原有的对话式AI应用、AI赋能的搜索类应用。AIGC则可以分为生成文本、生成图像、生成视频,也可以归为大模型的应用场景之一。
GPT-4为代表的大模型的变革所在
OpenAI自发布GPT1.0模型之后,一直在持续迭代,陆续发布GPT2.0、GPT3.0和GPT 3.5,本次发布GPT4.0是其持续投入AI大模型的必然阶段。相比前几个模型,GPT-4的参数量更大,模型迭代时间更长,也能够给出更准确的结果。IDC认为,新版本的发布是大模型循序渐进发展的必然成果。正如百度集团首席执行官李彦宏所说:“公司每一年都会发布大模型的新版本,是多年努力的自然延续”。
ChatGPT可能带来的产业影响
ChatGPT实质是对话式AI的应用,对话式AI的落地已经非常广泛。根据IDC追踪的人工智能市场规模数据,对话式AI市场规模在2022年达到54.6亿元人民币,其市场渗透率相对已经饱和。ChatGPT引发的浪潮促使主流厂商在其对话式AI应用中引入大模型,将带动对话AI相关市场新一轮增长。此外,在搜索、营销场景中,ChatGPT类型的应用则可能衍生出全新的产品形态。
市面上可用的产品
关于AIGC,除了大众所熟知的StableDiffusion, Midjourney以及DALL·E2之外,也有些商业公司对此提供云端支持。目前亚马逊云科技通过IndustryAI以及SageMaker提供了Stable Diffusion的支持。百度的文心一言已于3月16日开启邀测,提供文学创作、商业文案创作、数理逻辑的推算、自然语言理解以及多模态生成五大功能。此外,还有很多数字人的公司也采用了AIGC相关技术。从技术的角度,当前市面上的产品大多只能做到文生图,文生视频类产品的发布则还需要时间,值得期待。
关于大模型,在开源社区已经发布的大模型之外,目前提供商用的大模型包括微软Azure上整合的GPT大模型、百度智能云以及百度飞桨支持的文心大模型、华为云盘古大模型、阿里云M6大模型。由本土厂商研发的大模型,大多支持本地化部署。
引发的AI行业变革
过去几年部署的AI应用,接下来几年都有可能被基于大模型的AI所替代。升级迭代可能会从优先具备海量数据的场景开始。当大模型支撑的AI应用成为主流,不能利用大模型能力的厂商将失去竞争优势。
未来的工作中,AI助理将替代更多人类的工作。诸如文生图的应用,诸如各领域初级内容的搜索,均可以借助AI生成的内容。
可能的投资规模
目前已经公开的大模型诸如GPT系列、Bert系列所耗费的算力根据公开资料可以查到。而真正落地到产业界,具体的投资规模要视应用场景决定。投资成本与所需的算力,是否部署完整的大模型,以及要推理的数据流量相关。
带动的市场机会
纯AI算力市场:在这一波AI热潮中最先最直接受益的即AI算力提供商,包括芯片厂商、AI服务器厂商,以及支撑大模型训练和推理的AI算力云服务商。
大模型与算力的结合:即AIaaS+AIPaaS。为市场提供大模型与算力结合后高度优化的方案,以帮助用户降低硬件使用门槛、提高开发效率、降低整体投资成本。典型的解决方案如百度的“AI大底座”,商汤的“AI大装置”。
大模型即服务:开放大模型开发平台供外部用户使用。这一市场属于高度创新的市场,但仍存在较高的进入壁垒。
从何处着手跟随本次AI浪潮
大模型厂商都在着手将现有的AI软件升级为大模型支撑的AI应用。可以根据应用场景优先级与合作伙伴联系引入大模型支持的AI。而在MaaS(模型即服务)产品层面,市场上可选的成熟产品并不多,预计今年下半年会有数十家厂商的产品上线。可以率先选择数据隐私要求不高的领域在公有云上测试大模型能力。
新一代AI需要注意的问题
生成式AI生成内容的版权需提前规划。生成式AI读取海量数据后生成的图片等内容有可能会引起版权问题,需要提前从规则上加以控制。
对原有流程的改变:一方面生成式AI生成的内容还需要人类审核才能发布,另一方面可能会要求工作流程上做出改变以适配AIGC的加入。
鉴于其仍处于技术成熟度的早期阶段,在传统行业应用场景不十分清晰,投入产出比目前也难以评估。
跳出今天的AIGC看未来AI应用
借鉴今天的文生图、文生视频类应用,其实大多是基于过去几年已有的小模型通过各种技术路线实现的AI应用。类似的、各行各业的应用场景,都可以基于现有的AI模型,以低代码的形式拼接出人人可上手的AI应用,甚至未来的AI应用,都可能是输入自然语言直接输出结果的形式。
IDC中国研究总监卢言霞表示,新一代AI热度持续走高,然而由于其较低的技术成熟度、较高的部署成本,实际落地还需谨慎。但宏观趋势上,以大模型、生成式AI为代表的快速迭代的技术必然会催生全新的AI时代。
编辑/jayden
Source: Zhitong Finance
According to an IDC article, recently OpenAI released ChatGPT and GPT-4, which sparked a public carnival in the AI world, and graphic applications such as StableDiffusion, Midjourney, and Dall·E2 have also rapidly become popular. Baidu, on the other hand, held a Wenxin one-word press conference on March 16, showcasing the big models and generative AI capabilities of Chinese manufacturers. The artificial intelligence market has officially begun a new era — the era of AI driven by big models. In response to this,IDC asks ten questions about the real status and future development of GPT and AIGC.
The relationship between big models, ChatGPT, and AIGC
All AI applications defined by IDC refer to AI decision-making systems based on machine learning algorithms. A large model refers to an algorithm model that reads massive amounts of data and has huge parameter scales. The industry generally believes that parameters exceeding 100 billion levels are large models, and that thousands or more GPU/CPU chips may have been used during the training process. ChatGPT and AIGC are both big model application scenarios. ChatGPT can be compared to the original conversational AI applications and AI-enabled search applications. AIGC can be divided into generating text, generating images, and generating videos. It can also be classified as one of the application scenarios for big models.
Where are the changes in the big model represented by GPT-4
Since the release of the GPT1.0 model, OpenAI has continued to iterate and released GPT 2.0, GPT 3.0, and GPT 3.5. The release of GPT4.0 is an inevitable stage for it to continue investing in the AI big model. Compared to previous models, GPT-4 has a larger number of parameters, takes longer to iterate on the model, and can also give more accurate results. IDC believes that the release of the new version is an inevitable result of the gradual development of the big model. As Li Yanhong, CEO of Baidu Group, said, “The company releases a new version of the big model every year, which is a natural continuation of years of hard work.”
Possible industrial impact of ChatGPT
ChatGPT is essentially an application of conversational AI, and conversational AI has been widely implemented. According to artificial intelligence market size data tracked by IDC, the conversational AI market size reached 5.46 billion yuan in 2022, and its market penetration rate is relatively saturated. The wave triggered by ChatGPT has prompted mainstream manufacturers to introduce big models into their conversational AI applications, which will drive a new round of growth in conversational AI-related markets. Furthermore, in search and marketing scenarios, ChatGPT type applications may spawn new product forms.
Products available on the market
Regarding AIGC, in addition to the well-known StableDiffusion, Midjourney, and Dall·E2, some commercial companies also provide cloud support for this. Currently, Amazon Web Technology provides Stable Diffusion support through IndustryAI and SageMaker. Baidu's Wenxin Yigen began solicitation on March 16, providing five major functions: literary creation, commercial copywriting, mathematical logic estimation, natural language understanding, and multi-modal generation. In addition, many digital human companies have also adopted AIGC-related technology. From a technical point of view, most of the products currently on the market can only be made by Wensheng, and the release of Wensheng video products will take time, which is worth looking forward to.
Regarding the big model, in addition to the big model already released by the open source community, the big models currently used by the provider include the GPT big model integrated on Microsoft Azure, Baidu Smart Cloud and the Wenxin model supported by Baidu Flying Paddle, the Huawei Cloud Pangu model, and the Alibaba Cloud M6 model. Most of the large models developed by local manufacturers support localized deployment.
Triggered changes in the AI industry
AI applications deployed in the past few years are likely to be replaced by AI based on big models in the next few years. The upgrade iteration may begin with prioritizing scenarios with massive amounts of data. When AI applications supported by big models become mainstream, manufacturers that cannot utilize the capabilities of big models will lose their competitive advantage.
In future jobs, AI assistants will replace more human jobs. Applications such as Wensheng Map, such as searching for elementary content in various fields, can all use AI-generated content.
Possible investment size
The computing power consumed by large models that have been disclosed so far, such as the GPT series and the Bert series, can be found according to public data. However, to actually land in the industry, the specific investment scale must be determined by application scenarios. The cost of investment is related to the computing power required, whether to deploy a complete big model, and the data flow to be inferred.
Driven market opportunities
Pure AI computing power market: The first to directly benefit from this wave of AI boom are AI computing power providers, including chip manufacturers, AI server vendors, and AI computing power cloud service providers that support big model training and inference.
A combination of big models and computing power: AIaaS+AiPaaS. The market is provided with highly optimized solutions combining large models and computing power to help users lower the threshold for hardware use, improve development efficiency, and reduce overall investment costs. Typical solutions include Baidu's “AI Platform” and Shang Tang's “AI Big Device”.
Big model as a service: Open the big model development platform for external users. This market is a highly innovative market, but there are still high entry barriers.
Where to start following this AI wave
Big model manufacturers are starting to upgrade existing AI software to AI applications supported by big models. AI supported by big models can be introduced by connecting with partners according to application scenario priorities. However, at the MaaS (model as a service) product level, there are not many mature products to choose from on the market. It is expected that products from dozens of manufacturers will be launched in the second half of this year. You can take the lead in testing big model capabilities on public clouds in fields where data privacy requirements are not high.
Issues that the next generation of AI needs to pay attention to
Copyright for generative AI-generated content needs to be planned in advance. Content such as images generated by generative AI after reading massive amounts of data may cause copyright issues and need to be controlled by rules in advance.
Changes to the original process: On the one hand, content generated by generative AI also requires human review before it can be published, and on the other hand, changes in the workflow may be required to adapt to the addition of AIGC.
Since it is still in the early stages of technological maturity, application scenarios in traditional industries are not very clear, and input and output are more difficult to evaluate than at present.
Break out of today's AIGC to see future AI applications
Drawing on today's Wensheng Map and Wensheng Video applications, in fact, most of them are AI applications implemented through various technical routes based on small models that have existed in the past few years. Similar application scenarios in various industries can be based on existing AI models to create AI applications that everyone can use in the form of low code. Even future AI applications may be in the form of inputting natural language and directly outputting results.
Lu Yanxia, director of IDC China research, said,The popularity of next-generation AI continues to rise, but due to its low technical maturity and high deployment costs, actual implementation still requires caution. However, in terms of macro trends, rapid iterative technology represented by big models and generative AI will inevitably spawn a new era of AI.
Editor/jayden