Intel presents an AI model that generates 3D images from text

2023-06-26 10:56:40

Computing giant Intel has just unveiled LDM3D, the industry’s first generative AI model to provide depth mapping. It has the potential to revolutionize content creation, the metaverse, and digital experiences.

Focus on 3D visual content

Intel Labs, in collaboration with Blockade Labs, introduced the Latent Diffusion Model for 3D (LDM3D), a new diffusion model that uses generative AI to create realistic 3D visual content. LDM3D is the industry’s first model to generate a depth map using the diffusion process to create 3D images with vivid and immersive 360 ​​degree views.

Photo credit: Intel

LDM3D has the potential to revolutionize content creation, metaverse applications and digital experiences, transforming a wide range of industries, from entertainment and games to architecture and design.

AI at the service of creativity

“Generative AI technology aims to further augment and enhance human creativity and save time. However, most generative AI models today are limited to generating 2D images and very few can generate 3D images from text prompts. Unlike existing latent stable diffusion models, LDM3D allows users to generate an image and depth map from a given text prompt using almost the same number of parameters. It provides more accurate relative depth for each pixel in an image compared to standard post-processing methods for depth estimation and saves developers significant time developing scenes” said Vasudev Lal, AI researcher at Intel Labs.

Photo credit: Intel

A strong competitive advantage

Intel’s commitment to truly democratizing AI will enable broader access to its benefits through an open ecosystem. Indeed, many contemporary generative AI models are limited to generating only 2D images. Unlike existing broadcast models, which typically only generate 2D RGB images from text prompts, LDM3D allows users to generate both an image and a depth map from a given text prompt.

Related Articles:  EGS has released a first version of the shooter Witchfire from the creators of Painkiller and Bulletstorm. The developers released a trailer for the game

Photo credit: Intel

This research could revolutionize the way we interact with digital content by allowing users to experience their text prompts in ways previously inconceivable. This ability to capture depth information can instantly enhance realism and overall immersion, enabling innovative applications for industries ranging from entertainment and gaming to interior design and real estate listings, as well as virtual museums and immersive virtual reality (VR) experiences.

Photo credit: Intel
1687779173
#Intel #presents #model #generates #images #text

Leave a Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.