The Chinese research team may have solved the problem of affordable high-resolution AI videos
- Researchers have developed a new AI video generator
- Generator creates several low-resolution stages before generating a high-resolution final
- The AI generator is made completely open source for individual and commercial use
A team of researchers from Beijing University, Kuaishou Technology and Beijing University of Posts and Telecommunications have made great progress in the field of AI video generation.
They have developed a new AI model, Pyramid Flow, that can generate high-resolution virtual video images in 768p.
Unlike many proprietary AI models developed in recent years, the creators of Pyramid Flow have made their model open source, giving developers and users free access to the underlying technology.
A breakthrough in AI video generation?
The Pyramid Flow model takes a different approach than many of its predecessors by generating low-resolution videos in multiple stages before producing a final high-resolution output.
This method reduces the computing power required to run the model, making it more cost-effective for users. The team claims that the model can produce a five-second video clip at 384p resolution in just 56 seconds.
Pyramid Flow is also available as open source software where the developers make the model freely accessible under the MIT license, meaning anyone can download, modify and use the code for personal or commercial purposes.
In addition to the model itself, the researchers have posted several example videos that demonstrate the quality of Pyramid Flow’s output. The results show highly realistic, detailed images, including examples such as underwater explosions that create bubbles and splashes. Additionally, the datasets used to train Pyramid Flow, which consist of approximately 10 million short videos, are also available to the public.
The use of open-source datasets in generating AI videos has been a contentious issue, with some critics claiming that such practices violate the rights of copyright holders.
The research team behind Pyramid Flow hasn’t specifically addressed these concerns, but they do suggest that the model could be a useful tool for refining open source material, potentially reducing the need to rely on third-party sources.
Via TechXplore