InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

InfiniteYou is a robust framework developed by ByteDance for identity-preserving image generation. Leveraging Diffusion Transformers (DiTs), specifically FLUX, it addresses challenges like insufficient identity similarity and poor text-image alignment. Central to InfiniteYou is InfuseNet, a component that injects identity features into the DiT base model via residual connections, enhancing identity similarity while maintaining generation capabilities. A multi-stage training strategy, including pretraining and supervised fine-tuning with synthetic single-person-multiple-sample (SPMS) data, further improves text-image alignment and image quality.

Key Features

  • Utilizes InfuseNet to integrate identity features into the DiT model.
  • Employs a multi-stage training strategy with pretraining and supervised fine-tuning.
  • Supports two model variants: aes_stage2 for better text-image alignment and aesthetics, and sim_stage1 for higher identity similarity.
  • Plug-and-play design ensures compatibility with various existing methods.
  • Official ComfyUI node available for seamless integration

Project Screenshots

Project Screenshot
Project Screenshot
Project Screenshot
Project Screenshot