While OpenAI’s forthcoming Sora AI video generation tool has creatives both tantalized and horrified, a new player has emerged that has many AI enthusiasts crowing about its capabilities. Developed by Kuaishou, a Chinese tech giant and competitor to TikTok, Kling is also already available—with some caveats.
Kling's capabilities surpass the competition in several significant ways. Kling can generate videos up to two minutes long in 1080p resolution at 30 frames per second. This is a substantial improvement over Pika and RunwayML, which can generate only a few seconds each—and it’s twice as much as what OpenAI said Sora would handle.
Sora by OpenAI is insane.
But KWAI just dropped a Sora-like model called KLING, and people are going crazy over it.
Kling appears on the scene shortly after the release of Vidu, another Chinese AI video creation tool developed by Shengshu Technology and Tsinghua University. Vidu generates shorter videos, but can create highly realistic and detailed videos with Universal Vision Transformer (U-ViT) technology, which has already proven superior to Runway and Pika.
Midjourney, the generative image creation tool perhaps best known for running inside a Discord server, is spreading its AI wings. The creators of Midjourney announced on Tuesday that they plan to introduce a “text to video” model in the next few months.
The company will begin training its video models starting in January, CEO David Holz said during an "Office Hour" Discord session. This move represents a natural progression for the platform, building upon a mature image model to stir the competi...
For its part, Kling uses advanced 3D face and body reconstruction technology to generate realistic movements and limb movements based on a single, full-body picture. It also avoids other common problems that some AI video generators have when depicting people, such as extra or impossibly bent limbs.
— 木内翔大@SHIFT AI代表「日本をAI先進国に」𝕏 (@shota7180) June 9, 2024
Like Sora, Kling also has a better understanding of real-world physics, allowing for more accurate simulation of physical interactions between objects. It’s also designed to accurately follow prompts and create sequences of shots with multiple views, allowing for more complex and dynamic videos.
Image: Kling/Kuaishou
Kuaishou released a series of demo videos showing how Kling would render the scenes OpenAI demonstrated with Sora. The videos seemed very realistic, with good scene compositions and movements. The quality was superior to Runway and Pika in terms of realism and consistency, and was on par with Sora—even surpassing it in some generations.
Some users have shared other creations on social media, and they seem convincing enough to justify the hype.
A cute black and white border collie, with its head sticking out of the car window, its fur blown by the wind, smiling and sticking out its tongue pic.twitter.com/6cuJbPqsxB
Kling is currently available as a public demo in China through a waitlist, but Kuaishou is promising a wider global release.
The Kling website is in Mandarin and appears to require a Chinese phone number in order to register. You will also need to download the Kwaicut app and follow the steps shown on the official Kling-AI website before securing a place in line.
Until then, for those looking to generate AI videos, the best choices are Pika Labs, RunwayML and—for local generation—Stable Video Diffusion.
OpenAI has just overtaken the AI image generation race once more.
The tech giant's integration of native image generation directly into ChatGPT via its GPT-4o model is not an incremental change but a major overhaul of the model, vaulting it to the front of the class.
Within hours of its release yesterday, the model quickly went viral, with anime-style creations flooding social platforms and showcasing technical capabilities that leave DALL-E 3 in the dust.
ChatGPT when another Studio Ghibli req...
Clear communication is crucial in today’s global workforce, but for some, it's easier than for others.
On Tuesday, Krisp, an audio AI software developer, introduced an Accent Conversion tool to break down language barriers by altering how a person’s voice comes through to listeners.
While the technology promises inclusivity, it also raises concerns about cultural biases, cultural identity, and what it means to “sound professional.”
Krisp’s AI Accent Converter aims to enable more inclusive and eq...
Star Atlas, an upcoming Solana-based sci-fi video game, is collaborating with the Artificial Superintelligence Alliance founding member SingularityNET to integrate AI agents into its spacefaring online world, the two companies announced on Wednesday.
Star Atlas developer ATMTA said the collaboration aims to use AI agents to provide an immersive gaming experience using SingularityNET’s Autonomous Intelligent Reinforcement Interpreted Symbolism, or AIRIS, system, along with the OpenCog Hyperon fra...