Global EditionASIA 中文雙語Fran?ais
China
Home / China / Innovation

Chinese developer launches multimodal model unifying video, image, text

Xinhua | Updated: 2024-10-22 11:03
Share
Share - WeChat

BEIJING -- The Beijing Academy of Artificial Intelligence (BAAI) on Monday released Emu3, a multimodal world model that unifies the understanding and generation of text, image, and video modalities with next-token prediction.

Emu3 successfully validates that next-token prediction can serve as a powerful paradigm for multimodal models, scaling beyond language models and delivering state-of-the-art performance across multimodal tasks, said Wang Zhongyuan, director of BAAI, in a press release.

"By tokenizing images, text, and videos into a discrete space, we train a single transformer from scratch on a mixture of multimodal sequences," Wang said, adding that Emu3 eliminates the need for diffusion or compositional approaches entirely.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, according to BAAI, which has open-sourced the key technologies and models of Emu3 to the international technology community.

Technology practitioners have said that a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models (LLMs).

"In the future, the multimodal world model will promote scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference," Wang said.

Top
BACK TO THE TOP
English
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
 
主站蜘蛛池模板: 国产精欧美一区二区三区| 日本一道高清一区二区三区| 动漫人物差差差免费动漫在线观看| 亚洲爆乳精品无码一区二区| 97碰在线视频| 天天操2018| 二代妖精在线观看免费观看| 男女久久久国产一区二区三区| 国产真实乱对白mp4| ntr出差上司灌醉女职员电影| 欧日韩不卡在线视频| 你是我的城池营垒免费看| 试看120秒做受小视频免费| 国产精品夜色一区二区三区 | 日日碰狠狠添天天爽超碰97| 人人妻人人澡人人爽精品欧美| 黄网站色视频免费观看45分钟 | 性一交一乱一伦一色一情| 久久青青草原国产精品免费| 真实的国产乱xxxx在线播放| 国产成人精品一区二区三区免费| jealousvue熟睡入侵中| 拍摄直播play文h| 亚洲国产精品福利片在线观看| 美女下面直流白浆视频| 国产精品videossex国产高清| 一级二级三级黄色片| 日本三级韩国三级美三级91| 亚洲AV无码专区国产不乱码| 男女一边摸一边做爽爽| 四虎永久精品免费观看| 亚洲综合校园春色| 国产高清美女一级毛片图片| va亚洲va欧美va国产综合| 日本护士在线视频xxxx免费| 免费看AV毛片一区二区三区| 思99热精品久久只有精品| 天堂网www中文在线| 东北老妇露脸xxxxx| 日本特黄特色aaa大片免费| 亚洲毛片免费看|