FLUX.2 [klein] 9B vs GPT Image 1 Mini
Head-to-head across 3 challenges
FLUX.2 [klein] 9B
33.3%
win rate
Ties
33.3%
GPT Image 1 Mini
33.3%
win rate
Challenge Results
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
FLUX.2 [klein] 9B
- + Excellent chalk texture and natural smudging on the board
- + Strong cursive title style as requested
- + Includes background café elements for better context
- − Several spelling errors including 'Riott', 'Risoto', 'Octoopus', and 'fress'
- − Repeated price tag on the second item creates visual clutter
GPT Image 1 Mini
- + Perfect spelling on all menu items
- + Clean and legible layout
- + Consistent chalk texture across all text
- − Failed to provide the requested 'elegant cursive' for the title
- − Text feels a bit too uniform, bordering on a digital font look in some areas
Verdict: GPT Image 1 Mini is the clear winner for its perfect spelling and adherence to the menu content, which is a critical failure point for FLUX.2 [klein] 9B. While FLUX.2 [klein] 9B captures a more authentic handwritten 'feeling' and better environment context, the distracting spelling errors and repeated prices make it less useful as a final output.
Pose & Character Mashup
Editing“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”
AI Judge Analysis
FLUX.2 [klein] 9B
- + Successfully replicates the complex crossed-leg pose from Image 1.
- + Excellent character preservation, capturing the face, scarf, and sunglasses from Image 2 accurately.
- + Perfectly matches the lighting and monochromatic yellow background of the source environment.
- − The left hand is rendered as a clenched fist instead of the open-fingered pose shown in Image 1.
- − The toes on the right foot are slightly distorted.
GPT Image 1 Mini
- + Cleverly integrates the character elements (scarf, sunglasses) onto the new body.
- + Good skin tone and facial feature matching from Image 2.
- − Fails the primary instruction by completely ignoring the crossed-leg pose from Image 1.
- − Lower hand position and finger count are anatomically flawed.
- − The composition is cropped tighter, losing the full vertical dynamic of the requested pose.
Verdict: FLUX.2 [klein] 9B followed the prompt with high precision, successfully mapping the complex, difficult pose from Image 1 onto the character from Image 2 while maintaining perfect environmental consistency. GPT Image 1 Mini failed to replicate the specific leg positioning, resulting in a generic pose that did not fulfill the 'exact pose reference' requirement. Because FLUX.2 handled both character consistency and geometric pose replication effectively, it is the clear winner.
Outfit Transfer Challenge
Editing“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”
AI Judge Analysis
FLUX.2 [klein] 9B
- + Excellent preservation of the subject's face, hair, and unique skin patterns.
- + Accurately recreates the plaid scarf pattern and navy peacoat from the reference.
- + Successfully integrates the clothing onto the original body pose.
- − Added several gold chains that were not present in the reference outfit image.
GPT Image 1 Mini
- + Successfully applies the requested outfit including the coat, scarf, and jeans.
- + Matches the hand/watch positioning well with the style of Image 2.
- − Fails to keep the person's face and hair 'completely unchanged', altering the subject's features significantly.
- − Does not preserve the specific vitiligo/marking pattern from the original person.
- − The scarf pattern is simplified compared to the source.
Verdict: FLUX.2 [klein] 9B followed the strict preservation instructions much better, maintaining the original person's identity, hair, and specific skin details perfectly while adding the new clothes. GPT Image 1 Mini failed at source preservation, essentially generating a new person who resembles the source but lacks his specific features and hair. Although FLUX.2 added extra jewelry, it is the superior edit for maintaining the base image's integrity.
FLUX.2 [klein] 9B
Black Forest Labs' distilled 9 billion parameter image generation model with sub-second inference and multi-reference support
GPT Image 1 Mini
OpenAI's cost-effective image generation model for when image quality isn't the top priority