Mar 28, 2023

Introducing Genmo Chat

Genmo Chat screenshot
Genmo Chat screenshot

Generative models have demonstrated incredible capabilities in synthesizing content across modalities, including text, images, videos, and beyond.With Genmo, we're taking it a step further by providing a creative copilot that works hand-in-hand with users to bring their creative visions to life.We are gradually rolling out alpha access to Genmo Chat to creatives from our waitlist. While there are limitations, we are scaling up capacity and continuously working to improve Genmo's capabilities, safety, and understanding of user intent.

What can you create with Genmo Chat?

Create 3D assets

Generate 3D meshes and 360 degree videos with Genmo. Ask for an object like an ice cream sundae, or upload a photo and turn it into 3D.

Upload an image and animate part of it

Genmo can animate existing images. The user uploads a starry night and asks Genmo to animate the sky into a timelapse. The user controls the animation by asking Genmo to only animate the sky and not the mountain.

Generate and edit movies

Genmo can generate and edit movies from scratch. The user asks Genmo to create a movie with a title. The model will help create ideas which the human can critique iteratively. Genmo takes it from there to generate an edited video.

Genmo opted to use our V2 video generation model because it can generate coherent global motion. It also automatically selects transitions and text overlays to match the plotline.

Write a script, then generate a trailer

Like the previous example, Genmo can generate and edit movies from scratch. The user asks to generate a movie called “Godfather: The Lunar Family”.

Genmo helps the user refine their ideas into a proposed script. Genmo generates a variety of scenes and transitions. In this example, the user works with Genmo to create a poster photo.

Note: These images were created with our V2 image generator. Genmo's current V3 image generator has significantly improved quality.

Edit and create photos with words

Replace content and change image styles with natural language. Genmo allows users to direct the creative process at a high level, while the model suggests specific details and calls the necessary tools to get the job done.

Expect even higher image quality today. The demo uses our old Genmo V2 model, and we've since upgraded to a new V3 image generator.

Design a presentation with app icons

Genmo can generate app icons as well. Here, Genmo makes icons for a “creative copilot”.

In response to user feedback, Genmo regenerates variations of their favorite icon. Finally, Genmo combines all the images into a slide deck to share with the team.

Bridging Humans and Generative Models

To bridge the gap between humans and generative tools, we're working on improving our models' understanding of user intent and context. This will allow for more seamless collaboration between users and their creative copilot, ultimately leading to better and more useful results.

Current Capabilities

As a creative assistant, Genmo supports a wide range of tools, such as text-to-image, image editing, image enhancement, video generation, and more. By using natural language, users can instruct Genmo to perform various tasks, including generating new images from descriptions, editing existing images, or even creating looping videos.

A Collaborative Future for Creative General Intelligence

We believe that collaboration is the missing piece from current generative AI models. We are building Genmo to transform the way we as people create content across modalities. Here's what we envision for the future:

  • Empowering creators of all levels: Good ideas can come from anyone, but not everyone has the skills to bring them to life. Genmo Chat can help everyone realize their ideas and create the content that they would like to see.

  • Safe and responsible AI: Our creative copilot will actively steer users away from generating content that may be harmful.

  • Enabling superhuman storytelling: Musicians already use Genmo to create music videos. We envision a future where a creative copilot can help you augment your stories with generations across modalities.