AMD has launched a brand new Secure Diffusion 3 Medium synthetic intelligence (AI) mannequin optimised for XDNA 2 neural processing models (NPUs). The chipmaker claimed that it’s the world’s first AI mannequin that processes outputs within the BF16 format. The mannequin might be supported by the newer Ryzen AI laptops with at the least 24GB RAM, after customers obtain Tensorstack’s Amuse 3.1 beta software program. The Secure Diffusion 3 Medium is an on-device picture technology mannequin that doesn’t require Web connectivity.
AMD’s Picture Era Mannequin Can Generate Print-Prepared Photographs
In a press launch, the Santa Clara-based tech large detailed the brand new picture technology mannequin. The AI mannequin relies on Secure Diffusion 3 Medium, which is optimised for the corporate’s XDNA NPUs and are outfitted within the Ryzen AI laptops launched in 2024 and newer.
The corporate claims the mannequin can be utilized to generate stock-quality photos from textual content prompts. The mannequin generates 1024×1024 decision photos, that are then upscaled to 2048×2048 print-ready decision utilizing the NPU’s capabilities.
The brand new AI mannequin is a part of AMD and Tensorstack’s new Amuse 3.1 desktop app, which is free to obtain and set up. Because the picture technology mannequin runs totally regionally, it even works when the system will not be linked to the Web. The info-processing happens on-device, powered by the XDNA 2 NPUs.
AMD stated it has labored on the reminiscence necessities of the AI mannequin, and it now requires 24GB RAM, as a substitute of 32GB RAM which was obligatory for the Secure Diffusion XL Turbo mannequin. Moreover, the brand new picture mannequin consumes solely 9GB of RAM whereas lively. The corporate achieved this through the use of the block floating level 16 or block fp16 (BF16) memory-efficient format.
The tech large highlighted that the Secure Diffusion 3 Medium AI mannequin strictly adheres to the immediate, construction, and order. AMD stated customers attempting out the mannequin ought to first describe the kind of picture, then the structural parts, and at last particulars and different context. Unfavourable prompts can be utilized to take away components from the picture, and placement of full stops can change the context understanding of the mannequin.