Skip to main content

Nvidia's next AI can turn words into photorealistic images

Nvidia deep learning technologies continue to do wonderful and weird things. Just a few weeks ago we saw how the company can use AI to automatically match voice lines to 3D animated faces. This cool kind of tech that can help people create great things with ease, or in the case of Nvidia’s latest unveiling, potentially horrible things, but still with ease.

We first saw Nvidia's GauGAN a few years back. It was demonstrated to turn basic doodles into photorealistic images with a click of a few buttons. It's pretty neat stuff, and definitely worth playing with. Now GuaGAN2 is out and it doesn’t even need your sketches to make highly detailed landscape images.

Instead, the deep learning model can turn text into what appear to be surprisingly complex images. It's been trained on 10 million landscape images, so it seems to know what it’s doing. The example shows how the image changes to suit more text as it is added, implying changes are easily made on the fly.

Black Friday deals

Black Friday deals

Black Friday 2021 deals: the place to go for the all the best early Black Friday bargains.

GuaGAN 2 works by using the tech we saw in GauGAN that can turn shapes into images, combined with text-to-image generation. This means a mix of drawings and words can be used to create these images. Users can start with a general statement to get a broader image and then are able to go in and alter the little details by hand. 

If you want to play with some of this tech, you can try it out on Nvidia's interactive AI demos, though we had trouble getting it to work. Or, if you have an Nvidia RTX GPU, you can download Nvidia Canvas for free. It offers a nice taste of painting with AI, which is just impressive to see work in real time with your own sketches.

Insert quote here

As for game implications of this technology, basically anything deep learning is likely to cut down time for developers. People can quickly mock up test scenes with tech in seconds. Haven’t figured out quite how the lookout from the cliff of your alien world looks, need a reference image you can’t find online? It has so many practical uses for anyone looking to create.

Though personally I'm hoping someone makes use of this tech into a cool pseudo-text adventure where you write the visuals into life. It could even work in with Nvidia's planned AI haptics somehow to be even more wonderful. Or disturbing. 

Hope Corrigan

Hope’s been writing about games for about a decade, starting out way back when on the Australian Nintendo fan site Vooks.net. Since then, she’s talked far too much about games and tech for publications such as Techlife, Byteside, IGN, and GameSpot. Of course there’s also here at PC Gamer, where she gets to indulge her inner hardware nerd with news and reviews. You can usually find Hope fawning over some art, tech, or likely a wonderful combination of them both and where relevant she’ll share them with you here. When she’s not writing about the amazing creations of others, she’s working on what she hopes will one day be her own. You can find her fictional chill out ambient far future sci-fi radio show/album/listening experience podcast at BlockbusterStation.buzzsprout.com. No, sadly she’s not kidding.