Stable Diffusion Reddit18h ago|プロダクト・サービスチュートリアル

Z-image reimagine project

A workflow for reimagining movie posters using a Python script, a vision-language model (qwen3-vl-8b), and the Stable Diffusion image generation model (Z).

💡

Why it matters

This workflow demonstrates how AI-powered image generation can be used to reimagine and transform existing visual content in creative ways.

Key Points

1Uses a Python script to scan a directory of movie posters and generate detailed descriptions using a vision-language model
2Passes the descriptions to Stable Diffusion (Z) to generate reimagined versions of the posters
3Includes tips for improving the results, such as telling the vision-language model to name characters and using a specific K-sampler configuration

Details

The 'reimagine' workflow involves using a Python script to scan a directory of movie posters (or any other images), generate detailed descriptions of the contents using a vision-language model (qwen3-vl-8b), and then pass those descriptions to the Stable Diffusion image generation model (Z) to create reimagined versions of the posters. The author has found that telling the vision-language model to name each character in the scene helps to avoid duplicate faces, and a specific K-sampler configuration with a 0.6 denoise and 2x contrast helps to get more variety from the Stable Diffusion model. The author has also decided not to use any face detailers or upscales, as they can lead to increased skin noise in the generated images.

Z-image reimagine project

Why it matters

Key Points

Details

Dive deeper

Related Articles

New Photographic Tools for Stable Diffusion

NitroGen: NVIDIA's new Image-to-Action model

Omni-View: Unlocking How Generation Facilitates Understandi…

Single HTML File Offline Metadata Editor

New Desktop UI for Z-Image made by the creator of Stable-Fa…

Upgrading to RTX 5060 TI 16GB for Stable Diffusion

ComfyUI-TRELLIS2 - マイクロソフトの最先端の画像から3Dへの変換ツール

They are the same image, but for Flux2 VAE

NoobAI Flux2VAE Prototype

Exploring and Testing the Blocks of a Z-image LoRA

AI Curator

Ask me anything about AI

Related Articles

New Photographic Tools for Stable Diffusion

NitroGen: NVIDIA's new Image-to-Action model

Omni-View: Unlocking How Generation Facilitates Understandi…

Single HTML File Offline Metadata Editor

New Desktop UI for Z-Image made by the creator of Stable-Fa…

Upgrading to RTX 5060 TI 16GB for Stable Diffusion

ComfyUI-TRELLIS2 - マイクロソフトの最先端の画像から3Dへの変換ツール

They are the same image, but for Flux2 VAE

Exploring and Testing the Blocks of a Z-image LoRA