Poster
Effective and Efficient Masked Image Generation Models
Zebin You · Jingyang Ou · Xiaolu Zhang · Jun Hu · JUN ZHOU · Chongxuan Li
East Exhibition Hall A-B #E-3107
Our new AI, eMIGM, creates realistic pictures by learning to fill in missing parts of images, much like solving a puzzle. We discovered that by combining the strengths of two existing image generation methods into a unified system, and carefully refining how it learns and creates, we could significantly boost both image quality and generation speed. eMIGM learns more effectively when more of an image is hidden during its training. When generating new pictures, it cleverly predicts fewer details initially and only receives stronger guidance later on, making it much faster. As a result, eMIGM produces high-quality images, outperforming standard methods and matching top-tier ones with significantly less computational power, even on large images. This work makes high-quality AI image generation more effective and efficient, and our code is publicly available.