Redian新闻
>
Chinese Startup Unveils AI Video Software to Rival OpenAI’s Sora

Chinese Startup Unveils AI Video Software to Rival OpenAI’s Sora

社会

Shengshu-AI claims its Vidu software can develop high-quality videos lasting up to 16 seconds, far surpassing previous Chinese text-to-video models.

A Chinese startup has unveiled an artificial intelligence-powered system capable of generating high-definition videos lasting up to 16 seconds, marking a major breakthrough for China’s AI industry as it races to catch up with the United States’ leading firms.

Shengshu-AI, a Beijing-based startup that was founded only last year, presented the new system — which it has named Vidu — at the Zhongguancun Forum in Beijing on Saturday, describing it as China’s “first long-duration, high-consistency, and highly dynamic video generation model.”

Many in China have been quick to dub Vidu China’s answer to Sora, the text-to-video model created by OpenAI that sent shockwaves around the world when it was unveiled in February.

For now, it appears that Vidu is still some way from matching Sora’s capabilities. According to Shengshu-AI, Vidu can generate high-definition videos lasting up to 16 seconds, whereas Sora can generate 60-second clips.

A GIF from a video clip powered by Vidu.

But this would still put Vidu at the very cutting edge of the rapidly evolving AI-generated content field. Most of the leading text-t0-video models, including Pika and Gen-2, only produce clips lasting up to 4 seconds.

Unlike those models, Vidu is not yet publicly available, and Shengshu-AI has yet to confirm when it will be formally launched. But the company performed a live demonstration of the system at the forum and said it was open to working with partners to further fine-tune its technology.

Shengshu-AI is one of many startups to have emerged during the frenzy of AI-related investment in China since the release of OpenAI’s ChatGPT in late 2022.

The firm was founded in March 2023 with Zhu Jun, a leading AI researcher at Beijing’s prestigious Tsinghua University, joining as chief scientist. It has since raised over 100 million yuan ($14 million) from investors, including the Chinese tech giants Ant Group and Baidu.

At the Zhongguancun Forum, Zhu said that Vidu was capable of generating scenes that are consistent with the laws of physics and contain rich details, such as realistic shadow effects and facial expressions.

In another nod to Shengshu-AI’s ambitions to rival OpenAI, the live demonstration of Vidu that followed featured a video almost identical to the one used to launch Sora — a clip of a car driving along a mountain road.

A GIF from a video clip powered by Vidu.

A GIF from a video clip powered by Sora.

The primary technology underpinning Vidu is the Universal Vision Transformer, which combines two AI models: Transformer and Diffusion. It is similar to Sora’s Diversity in Transformation architecture, but Shengshu-AI claims that its research team developed its system before OpenAI, releasing a related paper in September 2022.

“After Sora’s release in February, we found that our technical roadmaps are highly aligned, and we became even more determined to press forward with our own research,” Zhu said at the forum.

The release of Sora earlier this year astonished many in China, as the technical challenges involved in generating AI video far surpass those involved in creating text and still images. The hashtag “Sora” received over 100 million views on the Chinese microblogging platform Weibo within a week of the product’s launch.

Within China’s AI industry, there were fears that the launch of Sora showed that the gap between Silicon Valley and China was widening. But Shengshu-AI has been bullish about its ability to catch up with the U.S.’s market leaders.

As recently as February, Vidu was reportedly only capable of generating 4-second clips, but that has increased fourfold in just a few months. In March, Shengshu-AI’s CEO, Tang Jiayu, told domestic media: “It’s certain that the model can reach Sora’s level this year, though it’s difficult to say whether it will take three months or six months.”

A GIF from video clips powered by Vidu.

With its demonstration of Vidu, Shengshu-AI has proved itself a leader in China’s AI sector, Chen Chen, a partner at consultancy Analysys, told domestic media. Yet Sora remains far ahead in terms of the duration, diversity, and richness of its videos, Chen added.

China’s tech industry continues to invest heavily in AI content generation. Major AI models including ChatGPT, Stable Diffusion, and Midjourney are unavailable in China, leaving a large hole in the market for domestic firms to fill.

In recent months, major tech firms including ByteDance, Kuaishou, Tencent, and SenseTime, as well as a host of smaller players, have reported progress in developing text-to-video AI tools. However, several have stressed that their products remain in their infancy.

According to market researchers iResearch, the value of China’s AI-generated content market is predicted to grow at 87% annually for the remainder of the decade, twice the speed of the global market.

(Header image: Shengshu Technology and Tsinghua University launch Vidu, a text-to-video model, at the 2024 Zhongguancun Forum in Beijing, April 27, 2024. CNS)


Download the new Sixth Tone app at the App Store or Google Play
APK file for Android:
https://image4.sixthtone.com/pkg/sixthtone.apk
(Copy URL and open in browser)

微信扫码关注该文公众号作者

戳这里提交新闻线索和高质量文章给我们。
相关阅读
Piece by Piece, an Ancient Chinese Craft Is Shaping Future Toys闰年是不祥之年?Year After Going Under, China’s Iconic Online Forum Plans ReturnFor Stressed Young Chinese, Chiikawa Toys Are Digital IbuprofenArt of Adaptation: How Yue Opera Is Winning Over Young ChineseChinese Parents Turn to ‘Magic Potions’ to Help Kids Run FasterLove and Money: The Dating Scene for China’s MillionairesChinese Parents Falling Prey to Dubious Myopia ‘Miracle Cures’For Chinese Students, the New Tactic Against AI Checks: More AIHow DAN, ChatGPT’s Rogue Twin, Is Wooing Young ChineseOff the Books: Inside the Struggle to Save China’s PreschoolsMy Child Spent a Fortune on a Chinese Video Game. What Now?家 园In China, the Hottest Travel Accessory Is a Tenured Professor笑談國之怪現況 50 為人民服務我不敢Zen of One: A Canadian’s Pursuit of Ancient Chinese Aesthetic李宁公司的大股东创立合营企业,将瑞典百年户外品牌 Haglöfs 引入大中华区China’s ‘Supernanny’ Stirs Controversy With Ultra-Harsh MethodsHow a Student’s Fake Exercise Book Broke the Chinese InternetNot Just Toys: How Young Chinese Are ‘Parenting’ Dolls【求职战报】全球金融交易平台Deutsche Börse Systems销售运营面试邀约!The New Talent Show Striking Fear Into China’s Biggest Pop Stars看看AI startup Devin的创办人ABC小孩Scott Wu 小时候的数学有多牛吧!Chinese Soccer Has a New Hero: Singapore’s Veteran GoalkeeperThe Firefighter Documenting Sichuan’s Plateau Forest FiresOpenAI 宣布终止对中国、朝鲜、俄罗斯等地区提供 API 服务,大家怎么看?中國應該定都杭州挺突然呀,OpenAI CEO奥特曼和他的丈夫承诺捐出大部分财富Are Young Chinese Falling Out of Love With Love?StartupLast Stop: Looking Past the Stigma Facing China’s MorticiansPeople Mountain, People Sea: Labor Day Holiday in PhotosOpenAI releases realProperty to Virtual Goods, More Young Chinese Are Drafting WillsChinese Shopping Platforms Phase Out Unpopular Presale SchemesYoung Chinese Have Almost No Concerns About AI, Survey FindsUniversities Introduce Weight Loss Classes for Unfit StudentsIRS Tax Seminar - IRS Expert Reveals Tax Saving Secrets for You!How ‘Farming Literature’ Became China’s Hottest Genre
logo
联系我们隐私协议©2024 redian.news
Redian新闻
Redian.news刊载任何文章,不代表同意其说法或描述,仅为提供更多信息,也不构成任何建议。文章信息的合法性及真实性由其作者负责,与Redian.news及其运营公司无关。欢迎投稿,如发现稿件侵权,或作者不愿在本网发表文章,请版权拥有者通知本网处理。