logo80lv
Articlesclick_arrow
Research
Talentsclick_arrow
Events
Workshops
Aboutclick_arrow
profile_loginLogIn

NVIDIA Unveils PYoCo, a New Text-to-Video Diffusion Model

The model can achieve high-quality zero-shot video synthesis capability with superior photorealism and temporal consistency.

A team of researchers from NVIDIA, University of Chicago, and University of Maryland has unveiled PYoCo, a large-scale text-to-video diffusion model built upon the foundations of eDiff-I, a cutting-edge image generation model, with the addition of a novel video noise prior.

According to the developers, the model incorporates various effective techniques from prior studies, such as temporal attention, joint image-video fine-tuning, a cascaded generation architecture, and an ensemble of expert denoisers, surpassing other methods on numerous benchmark datasets. The paper shared by the team also highlighted the model's ability to achieve high-quality zero-shot video synthesis, boasting superior photorealism and temporal consistency.

"We propose a video diffusion noise prior tailored for fine-tuning text-to-image diffusion models for text-to-video synthesis," comments the team. "We show that fine-tuning a text-to-image diffusion model with this prior leads to better knowledge transfer and efficient training. On the small-scale unconditional generation benchmark, we achieve a new state-of-the-art with a 10× smaller model and 14× less training time. On the zero-shot MSR-VTT evaluation, our model achieves a new state-of-the-art FID of 9.73."

Learn more here. Also, don't forget to join our 80 Level Talent platform and our Telegram channel, follow us on Instagram and Twitter, where we share breakdowns, the latest news, awesome artworks, and more.

100% procedural Material made in Substance Designer

Join discussion

Comments 0

    You might also like

    A Week After "Basically Announcing" Minecraft 2, Notch Basically Cancels It

    Instead, he and his team will focus on the previously-announced retro-style roguelike.

    Discord Gets Sued Over Alleged Anti-Consumer Practices

    The plaintiffs claim the platform has intentionally made it overly difficult to cancel subscriptions.

    Rumor: Possible Release Date for Grand Theft Auto 6 Revealed

    A video game store from Uruguay appears to have disclosed the launch date for the gaming industry's most anticipated title.
    • Winter Environment
      by ANGRY MESH

      Winter Environment is a Unity package ready to be used with Unity Engine. Contains additional shaders with which you can adjust the snow amount on trees, grass and props.

    • Clearcut Series: SD Materials
      by Emiel Sleegers

      In this tutorial, you will be going over on how to create 4 very different materials from scratch in SD. The goal of these courses is to teach you a solid workflow that we also use in the AAA game industry.

    We need your consent

    We use cookies on this website to make your browsing experience better. By using the site you agree to our use of cookies.Learn more

    ×