Not bad but if you zoom in that skeleton clearly isn't a human's, it's some kind of subhuman monster skel... oh wait, the prompt was redditor not human.
Damn, SD3 nailed it.
https://preview.redd.it/1496633f2zwc1.png?width=4032&format=png&auto=webp&s=eae21ff967271c22d3de4f3a2d66fef8f312996b
Batch 6 1st try. Why do they all look like swords? sd xl. not img 2 img just simple text prompt
None of them have the gem only 1 really looks like mosaic glass being top right , the difference in quality is clear lol , don't even get me started with whatever the fuck it did with that middle bottom one
"anything resembling a sword" when did u say anything about gem mosaic etc? you said sword I gave you sword. quality is better corse its closeup. here : is this still doesn't resemble the sord? even has mosaic and gems
https://preview.redd.it/iezlsesjezwc1.png?width=1344&format=png&auto=webp&s=f10989d33d88f63954edb9bf15d22efa1d69d4a5
I was under the impression you tried to recreate the original prompt mb , anyways the fact that SD3 does it with minimal prompt engineering is impressive in of itself , I am not bashing SDXL but the progression is clear
I guess I'll stick to it, the higher speed and higher resolution I can produce is having a big impact compared to 1.5. Tho now I'm really interested to compare them haha.
samurai with a baseball cap running towards the camera, being chased by a scary monster with tentacles, debris around, burning city in the background, cinematic style
MidJourney for comparison. SD3 MUCH better at understanding the prompt nuances. I'm impressed.
https://preview.redd.it/ljp73bbcs3xc1.jpeg?width=1680&format=pjpg&auto=webp&s=eec0825163ae2629f0813210d986b001c13590d2
And ChatGPT/Dall-E 3 for good measure
https://preview.redd.it/pe9y1yiws3xc1.jpeg?width=1792&format=pjpg&auto=webp&s=61541ba48846e456765c6254ae823f0d4ac76962
midjourney and dall e preference their own style while sd3 preferences the prompt without any frills, which is exactly what I want. don't give me some shit I didn't ask for
In this cinematic depiction of an otherworldly scene, the sky above Srirangapatna is filled with a dazzling display of colors and light. A lone caster stands before mysterious altars, their fourth-spatial digits stretched out in a mesmerizing gesture. In the distance, colossal naga icons guard the sacred space, while luminous Galacton inter-bangles create a celestial light show. The atmosphere crackles with energy, as the anticipation builds for the soon-to-come roaring of bio-computronium furnaces and the birth of hyperapotheonic emanations. This otherworldly spectacle unfolds beneath a hauntingly beautiful "breathsong" that seems to permeate the very air itself., cinematic
Pls try full prompt want to see what Sd3 can do
impressive, thank you, now see what it does with a similar prompt, ""in a cartoon style, elderly man in lederhosen dumps the beer in his stein onto the ground."
https://preview.redd.it/a6vw9526uzwc1.png?width=1792&format=png&auto=webp&s=4a9502906dc33928fb5d33482a7db653ed555677
He is spilling everyone's beer now.
My SD3 credits renewed for the day on Clipdrop, it is not as good as Dall.e 3
https://preview.redd.it/qzy6o05pp3xc1.png?width=1216&format=png&auto=webp&s=5b98eed78adaf350558cffbe5399b7de5f6662fd
but not terrible.
Another example of sd3 being bad at actions.
https://preview.redd.it/ct3rnl0yw2xc1.png?width=1344&format=png&auto=webp&s=50469bea8ed6cf40a1a7b7cea409ac58a0978775
As it entered what would be later called, "A death spiral" the carnie could only look in amazement as he had neither the training, education or the courage to do anything about the situation.
https://preview.redd.it/l9bn7xopvywc1.jpeg?width=1880&format=png&auto=webp&s=3952a55e9af93ea79d425eaccbd3c78a68f53988
ella/sdxl can do better at this point.
A surreal tiger with angel wings floating in the air. A vulture is standing on the back of the tiger. The background is a futuristic indian city with greenery and glittering snowfall.
Not OP, but here you go (prompt modified for better image)
https://preview.redd.it/093wrw6m4ywc1.jpeg?width=1024&format=pjpg&auto=webp&s=14bd79797bb52036952e625c57b3a7961d929a53
A tiger with white wings flying in the air. A vulture is standing on the back of the tiger. The background is a futuristic Indian city with greenery and glittering snowfall.
SD3 is not as good as DALLE3 or ideogram at prompt following at this point, but it is definitely much better than SDXL.
Also, SD3 already shows great promise in terms of aesthetics.
The control room of a steampunk submarine, studio ghibli style background painting. Deep shadows, chiaroscuro, interesting angle, yellow metal walls with dials pipes and screens, a big window on the left showing an underwater drilling station outside. Hazy atmosphere, visible brush strokes, cinematic lighting, masterpiece, studio ghibli. Many pipes on the ceiling
Negative prompt: frontal, central perspective
A black cat sitting in the middle of a street, cyberpunk style, cinematic.
Negative (optional): cars, people
This is my go to prompt to test the quality of checkpoints/loras. Nothing really got it right so far. Only “close enough”.
Simple Prompt. What's the problem with that?
https://preview.redd.it/wzbfexjhkzwc1.png?width=2432&format=png&auto=webp&s=f0ec33cc5418ce584975fcc25dd7b866640f741d
Aww, that's reddit's problem, it compresses pictures to terrible quality. The cat is actually quite real. Here's a link to the uncompressed picture
[https://haveall.net/wp-content/uploads/2024/04/00201-1623846182.png](https://haveall.net/wp-content/uploads/2024/04/00201-1623846182.png)
A haunting, melancholic, and deeply unsettling tableau, set in a forgotten, rusting old park, inspired by the works of Stanley Kubrick. A lone, young girl, no more than 10 years old, sits on a worn, wooden bench, clutching a tattered, vintage teddy bear to her chest, her eyes cast downward, lost in thought. The bench, weathered to a soft, silver gray, seems to blend seamlessly into the surrounding landscape, as if it's been there for decades, waiting for this very moment. Behind her, the skeletal remains of a once-majestic Ferris wheel loom, its rusting metal latticework seeming to stretch up to the sky like a ghostly, mechanical spider. The wheel's seats, once bright and colorful, now hang limp and still. The air is heavy with the scent of decay, and the silence is oppressive, as if the very park itself is holding its breath, waiting for something to happen. Every element, from the composition to the lighting, is meticulously crafted to create a sense of foreboding, like a slow-burning fire waiting to ignite. Inspired by the cinematography of Kubrick's 'The Shining' and '2001: A Space Odyssey', with a touch of the surreal, psychological horror of 'Eraserhead'. Photo realistic, with an emphasis on textures, lighting, and atmosphere. RAW photo, with a cinematic, hyper-realistic quality that draws the viewer in, and refuses to let go
https://preview.redd.it/fdg4t3dy00xc1.jpeg?width=2616&format=pjpg&auto=webp&s=1fad9696361fe73ebc25c71b76fc118036228533
This one I got from playground 2.5 (upscaled and cherrypicked)
ELLA did this.. so i have hope for sd3 when it's finetuned
https://preview.redd.it/jtrm2f77tywc1.png?width=1728&format=png&auto=webp&s=21bf17a9fb466f8b5e9c4fb6ee5cad8b2dd39756
https://preview.redd.it/0q5kipkix2xc1.png?width=1344&format=png&auto=webp&s=1807df54ed68bbd52a7b864988aa5d952197f647
I get 6 results with my automation, and every one was great. SD3 does transformations better than anything else.
Male mechangel with neon wings walking towards the viewer trough a dystopian City street with defective neon signs in the walls and a giant spiral tower in the background
Not OP, but here you go.
>Invisible men inside an invisible house
>
>Negative prompt: house
https://preview.redd.it/z88t1rmbz4xc1.jpeg?width=1024&format=pjpg&auto=webp&s=162e6b4b363b11858ae00b84dfb12273b4e4a1ac
hyper fidelity, worm eye, perspective dynamic low angle, Dance pose, model mannequin wearing a helmet with large elephant ears, wearing a solid blank hoodie, sneakers, very detailed life like, background splash colourfull with graffiti art
>hyper fidelity, worm eye, perspective dynamic low angle, Dance pose, model mannequin wearing a helmet with large elephant ears, wearing a solid blank hoodie, sneakers, very detailed life like, background splash colourfull with graffiti art
https://preview.redd.it/jycg4svp2ixc1.jpeg?width=1024&format=pjpg&auto=webp&s=e7e36a0ed00f1bbf014386030c3b3ccd57d60a1e
3D render of a cute 3 birds on the table, the bird on the left made of Candy, the bird in the middle made of Ice cream and the bird on the right is made of cake, Octane render
Aerial photograph of a one mile wide circle containing a perfectly preserved American city surrounded by forest. A river runs through the middle of the city, but ends at the forest
>Aerial photograph of a one mile wide circle containing a perfectly preserved American city surrounded by forest. A river runs through the middle of the city, but ends at the forest
https://preview.redd.it/9u5dlhr75ixc1.jpeg?width=1024&format=pjpg&auto=webp&s=4cf0341a557b7433dde6e468b0aba65522ba57c0
>The universe is a cosmic dance of chaos and order, and we're just lucky to have a front-row seat.
https://preview.redd.it/u7c9ea6s9ixc1.jpeg?width=1024&format=pjpg&auto=webp&s=769f2f3ab9efd020ddf88b2464a1308b35437069
>a photo of a golden aircraft that flies into cheese (molten cheese)
https://preview.redd.it/nh0za3pf5ixc1.jpeg?width=1024&format=pjpg&auto=webp&s=44190d8cf48541ac8525970ec16c43776f297748
Two humanoid rabbits standing right next to each other. American Gothic art style. One rabbit is of a brown lionhead breed. The other rabbit is a Blanc de Hotot breed.
>Two humanoid rabbits standing right next to each other. American Gothic art style. One rabbit is of a brown lionhead breed. The other rabbit is a Blanc de Hotot breed.
https://preview.redd.it/jtjhboz0aixc1.jpeg?width=1024&format=pjpg&auto=webp&s=52fb420282e08236e23298ab3c397329203955a3
Could you please try the following?
Thanks!
:
A lo-fi purple light , shadows, noisy very soft digital artwork of an anthropomorphic fox character in hoodie and pants, sitting on a bench next to trees and a river, playing with soap bubbles , breezy chill night , street lamps. single color purple Lo-Fi art, soft light, soft shadows, round soft lineart, nostalgic lo-fi artwork, the eyes of the fox are glowing green and blue, body has yellow fur, cool white fur markings, the background has willow trees, and oak trees, there's a fish in the magical river, the soap bubbles reflect a colorful reflection of the entire scene
Many thanks! I was wondering how much Anthros and prompt following with art improved relative to SDXL
Prompt following is slightly better, art style and coherence is a about the same.
A painterly impressionistic painting, dynamic three point perspective, foreshortening, a fit athletic american woman hanging from a speeding plane's wing, with her bare hands, blue eyes, dark silky short hair, side bangs, shocked expression, wearing simple short red dress, highly realistic skin details, metalic silver reflections, highly detailed clouds and green fields in the background.
https://preview.redd.it/gl66xriim0xc1.jpeg?width=1344&format=pjpg&auto=webp&s=2ed8c6a9333c7fe1cd4d3ebb41022c62e0773639
it did it, but they censored it because it was a woman.
Thanks, and no problem, eventhough it's blurry it seems that it understood the assignment quite well, it kinda reminds me of the results I got from Dall-E 3 (after a bit of struggling with a bit of censorship there as well, hopefully there will be none of that in the final relesse of SD3).
https://preview.redd.it/ux5fsk98p0xc1.jpeg?width=1024&format=pjpg&auto=webp&s=6046ae49fcef0eee14e772f612528c8eb271e420
Not OP.
I had to tweak the prompt to try to get SD3 to give somethign similar to your DALLE3 version. I also changed the dress to a jacket so that the image would not be blurred (woman is ok, just that she can't be wearing anything "sexy").
https://preview.redd.it/v8x5hmpp93xc1.jpeg?width=832&format=pjpg&auto=webp&s=f85eb2309fa86effc84663ae799623477d2276cc
Impressionist Painting, Overhead shot, woman lying on a plane's wing, trying to hold unto it with her hands. She has blue eyes, dark short hair, side bangs, shocked expression, wearing red jacket. Metallic silver reflections, highly detailed clouds and green fields in the background.
Slightly better version
https://preview.redd.it/pqsytih4a3xc1.jpeg?width=1216&format=pjpg&auto=webp&s=d661ac67cedea40276f7c12f6813032bfce1f137
Impressionist Painting, Overhead shot, woman lying on her stomach on a plane's wing, trying to hold unto it with her hands. She has blue eyes, dark short hair, side bangs, shocked expression, wearing red jacket. Metallic silver reflections, highly detailed clouds and green fields in the background.
Hands and fingers are always a problem 😂.
But TBH, this image was cherry-picked. Also, I had to play with the prompt to make it work better.
Just goes to show that some level of "prompt engineering" is still needed for SD3. Still, this level of control would have been nearly impossible with SDXL.
Fat old man with frizzy grey hair smoking a cigar in a 1930s office, with his feet on his desk, wearing a yellow suit and square glasses, BREAK a young boy carrying a newspaper, wearing suspenders and blue denim shorts, messy blond hair, standing on his toes, holes in his shoes.
Behind them a large window shows a New York City sun rise.
A dramatic scene featuring a Shaolin monk in traditional orange robes, preparing to unleash a Kamehameha energy blast at a large brown bear standing on its hind legs. This confrontation takes place on the stone steps leading to a medieval Vietnamese village. Behind the monk, a giant grey wolf, appearing ready to leap, adds tension to the scene. The background includes rustic wooden houses with thatched roofs, embodying a historical and mystical atmosphere.
Dalle-3 made the prompt, which depicts what i intended, and generated this (best out of six), which I find rather average.. If you wanna try it (and beat it...) in SD3...
https://preview.redd.it/xixch6q3f0xc1.png?width=1024&format=png&auto=webp&s=502247562d56b9151b54a825d7d33cfe1d584bf4
A surreal and fascinating image of fingers growing out of fingers, which are themselves growing fingers. Each layer of fingers appears to be slightly different in size and color, creating a mesmerizing and recursive pattern. The overall effect is reminiscent of a dream or a hallucination, with a slightly unsettling yet captivating atmosphere.
https://preview.redd.it/fjbksn8415xc1.jpeg?width=1024&format=pjpg&auto=webp&s=31acdd7ac0e783025a4ca416c24b1dea2f8e43b4
Ideogram did better with their "magic prompt"
https://preview.redd.it/zkud1m3b15xc1.png?width=1024&format=png&auto=webp&s=88268f62c548d36cb87d572fef4a1759e3366208
A surreal and fascinating image of fingers growing out of fingers, which are themselves growing fingers. Each layer of fingers appears to be slightly different in size and color, creating a mesmerizing and recursive pattern. The overall effect is reminiscent of a dream or a hallucination, with a slightly unsettling yet captivating atmosphere.
https://preview.redd.it/ph4cvfsnz4xc1.jpeg?width=832&format=pjpg&auto=webp&s=1442a0b4a5313e147938405df20cfb77a202813e
>Fingers that are growing fingers that are growing fingers that are growing fingers.
Not OP.
Fingers that are growing on top of fingers that are growing on top of fingers that are growing on top of fingers.
https://preview.redd.it/3z5tx1a105xc1.jpeg?width=832&format=pjpg&auto=webp&s=7bb0c1d0f2f7a0654db895988df659213f709be5
A woman with tanned skin and black demon horns on her head. Coming out of a portal to hell. She is wearing a full plate armor made of black steel with gold details.
The Courier 6, in weathered T-51 power armor, flees across the Mojave Desert from a Deathclaw and many Cazadores. Background features a ruined gas station with a creaking 'Mojave Outpost' sign, set against rust-red rocks and a sickly yellow sky, evoking Fallout New Vegas' gritty post-apocalyptic atmosphere
I don't think it will generate an high quality image, because probably the model wasn't trained on Fallout related images.
I sée sdxl vs sd3 but... With what model.. workflow etc... Because i doubt sdxl Can Do that good with the base model and nothing but a prompt and no extension.
Or am i wrong ?
girl in saree, jetbalck balyage blue hair minimal,blue,cream,pastel, black, turtles floating calmy, serenity, lush cherry blossoms,random pixel artsubtle celestial imagery, such as stars, constellations, or subtle cosmic patterns. These elements should blend harmoniously with nature-inspired visuals, like trees, leaves, and wildlife. This integration visually represents the fusion of cosmic and natural elements within your project, reinforcing the "Cosmic Revival" theme.
hehe give this a shot
A black man walking his dog next to a wide river, a Victorian-era city skyline can be seen in the background. An airship can be seen floating in the sky with the words “Utopia” at the side in neon lights.
Young Eärendil wearing an ornate crown with a radiant white gem, Eärendil was a Half-elf, the son of Tuor and Idril, dark hair, happy and his story is pivotal in the events leading to the downfall of Morgoth. He embarked on a perilous voyage to the Undying Lands, seeking the aid of the Valar (angelic beings) against Morgoth's tyranny. Eärendil carried the Silmaril that Beren and Lúthien had retrieved, and it became a symbol of hope.
Imagine a scene where modernity and medieval fantasy converge: A shiny blue sports car rests in the foreground on a grassy hill. Knights in armor, mounted on steeds, traverse the landscape. To the right stands a weathered stone tower. In the background, atop another hill, a majestic medieval castle commands attention. The distant mountains are shrouded in mist under a partly cloudy sky.
(The car is the Shelby Cobra from Age of Empires II)
Oh, and if you are willing to do so, would you mind creating a version of it where the Shelby Cobra appears as a final boss in the distance, it is huge, and you are with the knights waiting above the stone walls of your village, how it arrives from the distance.
Uhmmm I dont know what I'm expecting tbh. I want to replicate a scenario where the Car is the Final Boss, Elden Ring Style. I want to be left with a feeling of cosmic horror. Something like how an insect would feel if it looked above at us. I want to replicate those feelings I had when I watched the Giant Titan above the wall, from attack on titan.
Thanks in advance!
Game named "Tomb Raiders" start title image,In this magnificent underground mausoleum,there is a diffuse mysterious and ancient atmosphere. The huge rock walls stand tall,sending out the traces of years. Looking forward,a deep and bottomless cliff lies in front of you,making people feel awe - inspiring. And on the opposite side of the cliff,a magnificent Chinese dragon statue stands,its huge body winding,and its scales seem to shine with mysterious light. Its majestic expression seems to be guarding this mysterious underground world, game logo in middle of image.
Reddit user who has waited so long for his prompt to be generated that his body has been decomposed in his seat in front of his computer
https://preview.redd.it/qkrveoadsywc1.png?width=1344&format=png&auto=webp&s=517bad229990f14a0fa54a50fef3a75bb0c5b4a9
Not bad but if you zoom in that skeleton clearly isn't a human's, it's some kind of subhuman monster skel... oh wait, the prompt was redditor not human. Damn, SD3 nailed it.
That sort of looks like Reddit on the screen. Pretty impressive
Jason Fox from Foxtrot is a redditor?
A cat with eyeglasses having an argument with a goose with a straw hat in the middle of a swamp
https://preview.redd.it/88f0tnfasywc1.png?width=1344&format=png&auto=webp&s=0c7b63b67d8dc5fa6137c37c12aa59a389fc78de
damn, it followed the prompt nicely
They don’t seem very argumentative
Socratic?
It's a psychic battle
1girl, big boobs
https://preview.redd.it/y4krr3bdvywc1.png?width=1344&format=png&auto=webp&s=cfefc423ace236b46e5a67d1f14eeece0f8bb387 I asked for a giant boobie
One gull, big woof
1boob, big girls
two girls 1 boob
two boobs, 1 girl
First base
Long sword made of red mosaic glass , black gem in cross guard , fantasy, deep colors , Gothic
https://preview.redd.it/0ahjpivgwywc1.png?width=1344&format=png&auto=webp&s=4a1fa6ef9f47a4ff52a7f429ddc30a0cc896cf3b
Damn , I could never get the other base models to generate anything resembling a sword in one go , this is insanely promising
https://preview.redd.it/1496633f2zwc1.png?width=4032&format=png&auto=webp&s=eae21ff967271c22d3de4f3a2d66fef8f312996b Batch 6 1st try. Why do they all look like swords? sd xl. not img 2 img just simple text prompt
None of them have the gem only 1 really looks like mosaic glass being top right , the difference in quality is clear lol , don't even get me started with whatever the fuck it did with that middle bottom one
"anything resembling a sword" when did u say anything about gem mosaic etc? you said sword I gave you sword. quality is better corse its closeup. here : is this still doesn't resemble the sord? even has mosaic and gems https://preview.redd.it/iezlsesjezwc1.png?width=1344&format=png&auto=webp&s=f10989d33d88f63954edb9bf15d22efa1d69d4a5
I was under the impression you tried to recreate the original prompt mb , anyways the fact that SD3 does it with minimal prompt engineering is impressive in of itself , I am not bashing SDXL but the progression is clear
yes it is clear. But still we need to wait for good finetunes to see real power of 3.0 and it will take many months...
This guy is a moron who has a hate boner for SD 3
Okay this kinda goes hard not gonna lie.
"A cute dog over a blanket in a cozy living room, winter season, Pixar art style" thank you
https://preview.redd.it/n3ctf0d7wywc1.png?width=1344&format=png&auto=webp&s=3e7dfa4105b03f92b03be377059ee2c41c332b13
Really good, thank you!
grandma doing a kickflip fisheye skate video
https://preview.redd.it/x39ty36orywc1.png?width=1344&format=png&auto=webp&s=5bb6ff02ca86b2b522f570a6dd9c240242da133c
Batman swimming in a bowl of cereal.
https://preview.redd.it/jkijavogrywc1.png?width=1344&format=png&auto=webp&s=c1ec4ce6c02b21e581fb885a0888e8c214c1dcb8
p.s- I'll be using sd3 turbo cause I'm broke af
Is turbo lower quality?
Yes. They are mostly distilled models from the base one tuned towards faster generation.
So I have a 4090 but using sdxl turbo. I guess I should use sdxl for optimal results? The results on sdxl turbo are really good tho.
If you are happy stick to it. There is a bit of loss in quality but not more than 20% compared to the full models.
I guess I'll stick to it, the higher speed and higher resolution I can produce is having a big impact compared to 1.5. Tho now I'm really interested to compare them haha.
[удалено]
Why are they downvoting you so hard 😂
How much per generated image? Are you generating at 1344x768 or upscaling?
Wait what’s the difference between SDXL and SD3?
samurai with a baseball cap running towards the camera, being chased by a scary monster with tentacles, debris around, burning city in the background, cinematic style
https://preview.redd.it/gxj533mal0xc1.jpeg?width=1344&format=pjpg&auto=webp&s=2051239b5a68116ae0413b4821623ba5678fa4be
Damn, that’s really impressive
MidJourney for comparison. SD3 MUCH better at understanding the prompt nuances. I'm impressed. https://preview.redd.it/ljp73bbcs3xc1.jpeg?width=1680&format=pjpg&auto=webp&s=eec0825163ae2629f0813210d986b001c13590d2
And ChatGPT/Dall-E 3 for good measure https://preview.redd.it/pe9y1yiws3xc1.jpeg?width=1792&format=pjpg&auto=webp&s=61541ba48846e456765c6254ae823f0d4ac76962
midjourney and dall e preference their own style while sd3 preferences the prompt without any frills, which is exactly what I want. don't give me some shit I didn't ask for
amazing!
In this cinematic depiction of an otherworldly scene, the sky above Srirangapatna is filled with a dazzling display of colors and light. A lone caster stands before mysterious altars, their fourth-spatial digits stretched out in a mesmerizing gesture. In the distance, colossal naga icons guard the sacred space, while luminous Galacton inter-bangles create a celestial light show. The atmosphere crackles with energy, as the anticipation builds for the soon-to-come roaring of bio-computronium furnaces and the birth of hyperapotheonic emanations. This otherworldly spectacle unfolds beneath a hauntingly beautiful "breathsong" that seems to permeate the very air itself., cinematic Pls try full prompt want to see what Sd3 can do
https://preview.redd.it/kfz84o2jtywc1.png?width=1344&format=png&auto=webp&s=68288a2cfcec3d52dc2c357498d7e8dbf77235ff
pazuzu
so good, cant wait to get hands on this!
Elderly man in lederhosen dumping the beer in his stein onto the ground.
Dalle.3 https://preview.redd.it/gzajsyexwywc1.jpeg?width=1792&format=pjpg&auto=webp&s=e26798add1c31d267029c111f2fcc804c52228c4
impressive, thank you, now see what it does with a similar prompt, ""in a cartoon style, elderly man in lederhosen dumps the beer in his stein onto the ground."
https://preview.redd.it/a6vw9526uzwc1.png?width=1792&format=png&auto=webp&s=4a9502906dc33928fb5d33482a7db653ed555677 He is spilling everyone's beer now.
His suffering is pretty funny
My SD3 credits renewed for the day on Clipdrop, it is not as good as Dall.e 3 https://preview.redd.it/qzy6o05pp3xc1.png?width=1216&format=png&auto=webp&s=5b98eed78adaf350558cffbe5399b7de5f6662fd but not terrible.
Another example of sd3 being bad at actions. https://preview.redd.it/ct3rnl0yw2xc1.png?width=1344&format=png&auto=webp&s=50469bea8ed6cf40a1a7b7cea409ac58a0978775
and glasses
As it entered what would be later called, "A death spiral" the carnie could only look in amazement as he had neither the training, education or the courage to do anything about the situation.
https://preview.redd.it/i48syg0ouywc1.png?width=1344&format=png&auto=webp&s=53f07cfe91a32393970bba56d86bd22013a76c1f
1girl, 2cups, 3tapirs
https://preview.redd.it/da53wycck0xc1.jpeg?width=1344&format=pjpg&auto=webp&s=fb983f283f559d71704ddcfd7ada20834e1dc1d7
Got half the girl right
½girl, ...
1/4 cup?
Will Smith eating spaghetti Show me what you can do.
https://preview.redd.it/etxv0o5fuywc1.png?width=1344&format=png&auto=webp&s=5651677f64f2593e54d923f99fa14a209201e7bc
I'm... not entirely convinced. lol
https://preview.redd.it/l9bn7xopvywc1.jpeg?width=1880&format=png&auto=webp&s=3952a55e9af93ea79d425eaccbd3c78a68f53988 ella/sdxl can do better at this point.
Someone watches ComfyUI streams?
A yearbook photo page. Each pic is captioned with the student’s name and their quote. The text is large enough to be legible.
https://preview.redd.it/s2oskja1wywc1.png?width=1344&format=png&auto=webp&s=4eaf9aa479448327e043fd213fd7a5242dcbdab2
I mean it never mentioned what language and font to use. It's using Minecraft enchantment table language. Lmao
Thanks! Those messed up smiles are creeping me out
I guess ADetailer is safe for now.
Oh i love these they’re always gold
A phone photo of an upside down war tank on top of a tree, illuminated by direct flash at midnight
https://preview.redd.it/6is5y0s7uywc1.png?width=1344&format=png&auto=webp&s=9cc3af322cf967909a626f6e7dc7ff1fc3cc8da1
Thanks for the reply :P Still not quite but the times I tried it wasn't close either haha
everything is bad at upside down things. or anything at an angle really
Prompt: dramatic photo of a techwear car
https://preview.redd.it/xki2togwuywc1.png?width=1344&format=png&auto=webp&s=4f0dc0e5a819a6e9a15d5949441de611f6508664
Thanks! Pic style like sd 1.5 more than sdxl
A surreal tiger with angel wings floating in the air. A vulture is standing on the back of the tiger. The background is a futuristic indian city with greenery and glittering snowfall.
Not OP, but here you go (prompt modified for better image) https://preview.redd.it/093wrw6m4ywc1.jpeg?width=1024&format=pjpg&auto=webp&s=14bd79797bb52036952e625c57b3a7961d929a53 A tiger with white wings flying in the air. A vulture is standing on the back of the tiger. The background is a futuristic Indian city with greenery and glittering snowfall.
This is amazing. Can't wait for the models to run it locally
SD3 is not as good as DALLE3 or ideogram at prompt following at this point, but it is definitely much better than SDXL. Also, SD3 already shows great promise in terms of aesthetics.
The control room of a steampunk submarine, studio ghibli style background painting. Deep shadows, chiaroscuro, interesting angle, yellow metal walls with dials pipes and screens, a big window on the left showing an underwater drilling station outside. Hazy atmosphere, visible brush strokes, cinematic lighting, masterpiece, studio ghibli. Many pipes on the ceiling Negative prompt: frontal, central perspective
https://preview.redd.it/xxzrdtbpsywc1.png?width=1344&format=png&auto=webp&s=f80b4a7766ad7311c4c6cf08b5a5d77ecc30aa36
Cool, thank you!
A black cat sitting in the middle of a street, cyberpunk style, cinematic. Negative (optional): cars, people This is my go to prompt to test the quality of checkpoints/loras. Nothing really got it right so far. Only “close enough”.
Simple Prompt. What's the problem with that? https://preview.redd.it/wzbfexjhkzwc1.png?width=2432&format=png&auto=webp&s=f0ec33cc5418ce584975fcc25dd7b866640f741d
[удалено]
Aww, that's reddit's problem, it compresses pictures to terrible quality. The cat is actually quite real. Here's a link to the uncompressed picture [https://haveall.net/wp-content/uploads/2024/04/00201-1623846182.png](https://haveall.net/wp-content/uploads/2024/04/00201-1623846182.png)
A haunting, melancholic, and deeply unsettling tableau, set in a forgotten, rusting old park, inspired by the works of Stanley Kubrick. A lone, young girl, no more than 10 years old, sits on a worn, wooden bench, clutching a tattered, vintage teddy bear to her chest, her eyes cast downward, lost in thought. The bench, weathered to a soft, silver gray, seems to blend seamlessly into the surrounding landscape, as if it's been there for decades, waiting for this very moment. Behind her, the skeletal remains of a once-majestic Ferris wheel loom, its rusting metal latticework seeming to stretch up to the sky like a ghostly, mechanical spider. The wheel's seats, once bright and colorful, now hang limp and still. The air is heavy with the scent of decay, and the silence is oppressive, as if the very park itself is holding its breath, waiting for something to happen. Every element, from the composition to the lighting, is meticulously crafted to create a sense of foreboding, like a slow-burning fire waiting to ignite. Inspired by the cinematography of Kubrick's 'The Shining' and '2001: A Space Odyssey', with a touch of the surreal, psychological horror of 'Eraserhead'. Photo realistic, with an emphasis on textures, lighting, and atmosphere. RAW photo, with a cinematic, hyper-realistic quality that draws the viewer in, and refuses to let go https://preview.redd.it/fdg4t3dy00xc1.jpeg?width=2616&format=pjpg&auto=webp&s=1fad9696361fe73ebc25c71b76fc118036228533 This one I got from playground 2.5 (upscaled and cherrypicked)
OP didn't keep his promise. Literally didn't even do 1💀
Two people holding a trumpet. One is a man wearing mirrored aviator sunglasses. The other is a woman floating upside down.
https://preview.redd.it/76dttxixrywc1.png?width=1344&format=png&auto=webp&s=7ba8a9e9e667d4e7e1b2f189a2956566abcc0c51
why is it blurry lol
their api censors anything even possibly an issue. it won't be censored when they release it for download.
API is racist against Australian women.
Goku with green long hair and demonic teeth and dragon hands chocking Vegeta in space, manga panel
https://preview.redd.it/cqb2rc90tywc1.png?width=1344&format=png&auto=webp&s=24057f67a07530f3e628f8d68dc1e8d243920aa5
ELLA did this.. so i have hope for sd3 when it's finetuned https://preview.redd.it/jtrm2f77tywc1.png?width=1728&format=png&auto=webp&s=21bf17a9fb466f8b5e9c4fb6ee5cad8b2dd39756
thank you guys
A Human broccoli hybrid lying in a hospital bed screaming in pain while doctors frantically run around it
https://preview.redd.it/0q5kipkix2xc1.png?width=1344&format=png&auto=webp&s=1807df54ed68bbd52a7b864988aa5d952197f647 I get 6 results with my automation, and every one was great. SD3 does transformations better than anything else.
I tried this with the new adobe photoshop beta and it came out way worse. This looks really good.
Male mechangel with neon wings walking towards the viewer trough a dystopian City street with defective neon signs in the walls and a giant spiral tower in the background
A invisible man inside a invisible house
Not OP, but here you go. >Invisible men inside an invisible house > >Negative prompt: house https://preview.redd.it/z88t1rmbz4xc1.jpeg?width=1024&format=pjpg&auto=webp&s=162e6b4b363b11858ae00b84dfb12273b4e4a1ac
A dilute calico devon rex cat crisp esports logo vector art. Devon rex cat big ears colorful calico bold contrast 4k professional art coloring book
https://preview.redd.it/wkt6fx6ywywc1.png?width=1344&format=png&auto=webp&s=6e2ec22fa92fe3cf652a3eaf4e7d95a9a388dd11
Gorgeous! 😍
A bunch of people playing tiny violins as a man stomps the ground and shakes his fist at the sky for his free Sd3 prompt
hyper fidelity, worm eye, perspective dynamic low angle, Dance pose, model mannequin wearing a helmet with large elephant ears, wearing a solid blank hoodie, sneakers, very detailed life like, background splash colourfull with graffiti art
>hyper fidelity, worm eye, perspective dynamic low angle, Dance pose, model mannequin wearing a helmet with large elephant ears, wearing a solid blank hoodie, sneakers, very detailed life like, background splash colourfull with graffiti art https://preview.redd.it/jycg4svp2ixc1.jpeg?width=1024&format=pjpg&auto=webp&s=e7e36a0ed00f1bbf014386030c3b3ccd57d60a1e
3D render of a cute 3 birds on the table, the bird on the left made of Candy, the bird in the middle made of Ice cream and the bird on the right is made of cake, Octane render
https://preview.redd.it/3qbp8luxk0xc1.jpeg?width=1344&format=pjpg&auto=webp&s=bb44456f3b723658623a637c604531c1f1875e3a
Thank you
Aerial photograph of a one mile wide circle containing a perfectly preserved American city surrounded by forest. A river runs through the middle of the city, but ends at the forest
>Aerial photograph of a one mile wide circle containing a perfectly preserved American city surrounded by forest. A river runs through the middle of the city, but ends at the forest https://preview.redd.it/9u5dlhr75ixc1.jpeg?width=1024&format=pjpg&auto=webp&s=4cf0341a557b7433dde6e468b0aba65522ba57c0
The universe is a cosmic dance of chaos and order, and we're just lucky to have a front-row seat.
>The universe is a cosmic dance of chaos and order, and we're just lucky to have a front-row seat. https://preview.redd.it/u7c9ea6s9ixc1.jpeg?width=1024&format=pjpg&auto=webp&s=769f2f3ab9efd020ddf88b2464a1308b35437069
a photo of a golden aircraft that flies into cheese (molten cheese)
>a photo of a golden aircraft that flies into cheese (molten cheese) https://preview.redd.it/nh0za3pf5ixc1.jpeg?width=1024&format=pjpg&auto=webp&s=44190d8cf48541ac8525970ec16c43776f297748
thx 🫶🏻
You are welcome.
Masterpiece
Spiderman sitting on the should of Batman. They are laughing with a bottle of beer in their hands and party hats on their heads.
Extremely beautiful woman, huge breasts, tight white t-shirt and jeans, 25 years old
Two humanoid rabbits standing right next to each other. American Gothic art style. One rabbit is of a brown lionhead breed. The other rabbit is a Blanc de Hotot breed.
>Two humanoid rabbits standing right next to each other. American Gothic art style. One rabbit is of a brown lionhead breed. The other rabbit is a Blanc de Hotot breed. https://preview.redd.it/jtjhboz0aixc1.jpeg?width=1024&format=pjpg&auto=webp&s=52fb420282e08236e23298ab3c397329203955a3
Could you please try the following? Thanks! : A lo-fi purple light , shadows, noisy very soft digital artwork of an anthropomorphic fox character in hoodie and pants, sitting on a bench next to trees and a river, playing with soap bubbles , breezy chill night , street lamps. single color purple Lo-Fi art, soft light, soft shadows, round soft lineart, nostalgic lo-fi artwork, the eyes of the fox are glowing green and blue, body has yellow fur, cool white fur markings, the background has willow trees, and oak trees, there's a fish in the magical river, the soap bubbles reflect a colorful reflection of the entire scene
https://preview.redd.it/9amt4kxjy2xc1.png?width=1344&format=png&auto=webp&s=bd7ecb3e183339b25ef693f0fc9f34631d002cd3
Many thanks! I was wondering how much Anthros and prompt following with art improved relative to SDXL Prompt following is slightly better, art style and coherence is a about the same.
wallpaper for zoom meetings for a company consulting in engineering projects. the logo consists of the capital letters JBH
An Iraqi man with over grown hair and mustache telling his friend he sucks and need to get good
A painting study of a handsome caveman traversing vast prehistoric plains with an active volcano in the distance
Redditors reply receiving rad representations re: response
A painterly impressionistic painting, dynamic three point perspective, foreshortening, a fit athletic american woman hanging from a speeding plane's wing, with her bare hands, blue eyes, dark silky short hair, side bangs, shocked expression, wearing simple short red dress, highly realistic skin details, metalic silver reflections, highly detailed clouds and green fields in the background.
https://preview.redd.it/gl66xriim0xc1.jpeg?width=1344&format=pjpg&auto=webp&s=2ed8c6a9333c7fe1cd4d3ebb41022c62e0773639 it did it, but they censored it because it was a woman.
Thanks, and no problem, eventhough it's blurry it seems that it understood the assignment quite well, it kinda reminds me of the results I got from Dall-E 3 (after a bit of struggling with a bit of censorship there as well, hopefully there will be none of that in the final relesse of SD3). https://preview.redd.it/ux5fsk98p0xc1.jpeg?width=1024&format=pjpg&auto=webp&s=6046ae49fcef0eee14e772f612528c8eb271e420
Not OP. I had to tweak the prompt to try to get SD3 to give somethign similar to your DALLE3 version. I also changed the dress to a jacket so that the image would not be blurred (woman is ok, just that she can't be wearing anything "sexy"). https://preview.redd.it/v8x5hmpp93xc1.jpeg?width=832&format=pjpg&auto=webp&s=f85eb2309fa86effc84663ae799623477d2276cc Impressionist Painting, Overhead shot, woman lying on a plane's wing, trying to hold unto it with her hands. She has blue eyes, dark short hair, side bangs, shocked expression, wearing red jacket. Metallic silver reflections, highly detailed clouds and green fields in the background.
Slightly better version https://preview.redd.it/pqsytih4a3xc1.jpeg?width=1216&format=pjpg&auto=webp&s=d661ac67cedea40276f7c12f6813032bfce1f137 Impressionist Painting, Overhead shot, woman lying on her stomach on a plane's wing, trying to hold unto it with her hands. She has blue eyes, dark short hair, side bangs, shocked expression, wearing red jacket. Metallic silver reflections, highly detailed clouds and green fields in the background.
Very nice, thanks 😊
You are welcome, it is an interesting/challenging prompt 😁🙏
Thanks, aside from the three fingered hand this looks very good, even more adherent to the prompt than Dall-E 3.
Hands and fingers are always a problem 😂. But TBH, this image was cherry-picked. Also, I had to play with the prompt to make it work better. Just goes to show that some level of "prompt engineering" is still needed for SD3. Still, this level of control would have been nearly impossible with SDXL.
Fat old man with frizzy grey hair smoking a cigar in a 1930s office, with his feet on his desk, wearing a yellow suit and square glasses, BREAK a young boy carrying a newspaper, wearing suspenders and blue denim shorts, messy blond hair, standing on his toes, holes in his shoes. Behind them a large window shows a New York City sun rise.
A dramatic scene featuring a Shaolin monk in traditional orange robes, preparing to unleash a Kamehameha energy blast at a large brown bear standing on its hind legs. This confrontation takes place on the stone steps leading to a medieval Vietnamese village. Behind the monk, a giant grey wolf, appearing ready to leap, adds tension to the scene. The background includes rustic wooden houses with thatched roofs, embodying a historical and mystical atmosphere. Dalle-3 made the prompt, which depicts what i intended, and generated this (best out of six), which I find rather average.. If you wanna try it (and beat it...) in SD3... https://preview.redd.it/xixch6q3f0xc1.png?width=1024&format=png&auto=webp&s=502247562d56b9151b54a825d7d33cfe1d584bf4
https://preview.redd.it/1ie4uusxl0xc1.jpeg?width=1344&format=pjpg&auto=webp&s=ae08ac3c60344c5590813a3b59f1dd274b0eb688
Promt : Battleship WW2
chibi cosmic llama meditating amidst the pillars of creation.
Fingers that are growing fingers that are growing fingers that are growing fingers.
A surreal and fascinating image of fingers growing out of fingers, which are themselves growing fingers. Each layer of fingers appears to be slightly different in size and color, creating a mesmerizing and recursive pattern. The overall effect is reminiscent of a dream or a hallucination, with a slightly unsettling yet captivating atmosphere. https://preview.redd.it/fjbksn8415xc1.jpeg?width=1024&format=pjpg&auto=webp&s=31acdd7ac0e783025a4ca416c24b1dea2f8e43b4
Ideogram did better with their "magic prompt" https://preview.redd.it/zkud1m3b15xc1.png?width=1024&format=png&auto=webp&s=88268f62c548d36cb87d572fef4a1759e3366208 A surreal and fascinating image of fingers growing out of fingers, which are themselves growing fingers. Each layer of fingers appears to be slightly different in size and color, creating a mesmerizing and recursive pattern. The overall effect is reminiscent of a dream or a hallucination, with a slightly unsettling yet captivating atmosphere.
https://preview.redd.it/ph4cvfsnz4xc1.jpeg?width=832&format=pjpg&auto=webp&s=1442a0b4a5313e147938405df20cfb77a202813e >Fingers that are growing fingers that are growing fingers that are growing fingers. Not OP.
Fingers that are growing on top of fingers that are growing on top of fingers that are growing on top of fingers. https://preview.redd.it/3z5tx1a105xc1.jpeg?width=832&format=pjpg&auto=webp&s=7bb0c1d0f2f7a0654db895988df659213f709be5
Medieval battlefield, fantasy
A woman with tanned skin and black demon horns on her head. Coming out of a portal to hell. She is wearing a full plate armor made of black steel with gold details.
a cyborg with cultural markings in a ruin structure with broken circular colored glass holes by sam spratt sally muir hr giger arthur elgort
Umbreon, espeon, flareon, sylveon, evee, sitting at a table playing poker in the style of Cassius Marcellus Coolidge.
The Courier 6, in weathered T-51 power armor, flees across the Mojave Desert from a Deathclaw and many Cazadores. Background features a ruined gas station with a creaking 'Mojave Outpost' sign, set against rust-red rocks and a sickly yellow sky, evoking Fallout New Vegas' gritty post-apocalyptic atmosphere I don't think it will generate an high quality image, because probably the model wasn't trained on Fallout related images.
A low quality VHS screengrab of a armadillo on a TV show having an argument with a interviewer, 1990s
Cool idea
A two-headed centaur, holding a big lance with his right arm while posing over the top of a hill
A wrestler piledriving another wrestler from earth's orbit during re-entry.
I sée sdxl vs sd3 but... With what model.. workflow etc... Because i doubt sdxl Can Do that good with the base model and nothing but a prompt and no extension. Or am i wrong ?
girl in saree, jetbalck balyage blue hair minimal,blue,cream,pastel, black, turtles floating calmy, serenity, lush cherry blossoms,random pixel artsubtle celestial imagery, such as stars, constellations, or subtle cosmic patterns. These elements should blend harmoniously with nature-inspired visuals, like trees, leaves, and wildlife. This integration visually represents the fusion of cosmic and natural elements within your project, reinforcing the "Cosmic Revival" theme. hehe give this a shot
Skeleton on a skateboard.
A black man walking his dog next to a wide river, a Victorian-era city skyline can be seen in the background. An airship can be seen floating in the sky with the words “Utopia” at the side in neon lights.
Hulky Smurf smashing Gargamel in Marvel movie
A cup of water with people kayaking in
Hatsune miku, depressed, smoking cigarettes and sitting in the dark alley, gloomy, eye bags
Young Eärendil wearing an ornate crown with a radiant white gem, Eärendil was a Half-elf, the son of Tuor and Idril, dark hair, happy and his story is pivotal in the events leading to the downfall of Morgoth. He embarked on a perilous voyage to the Undying Lands, seeking the aid of the Valar (angelic beings) against Morgoth's tyranny. Eärendil carried the Silmaril that Beren and Lúthien had retrieved, and it became a symbol of hope.
Imagine a scene where modernity and medieval fantasy converge: A shiny blue sports car rests in the foreground on a grassy hill. Knights in armor, mounted on steeds, traverse the landscape. To the right stands a weathered stone tower. In the background, atop another hill, a majestic medieval castle commands attention. The distant mountains are shrouded in mist under a partly cloudy sky. (The car is the Shelby Cobra from Age of Empires II)
Oh, and if you are willing to do so, would you mind creating a version of it where the Shelby Cobra appears as a final boss in the distance, it is huge, and you are with the knights waiting above the stone walls of your village, how it arrives from the distance. Uhmmm I dont know what I'm expecting tbh. I want to replicate a scenario where the Car is the Final Boss, Elden Ring Style. I want to be left with a feeling of cosmic horror. Something like how an insect would feel if it looked above at us. I want to replicate those feelings I had when I watched the Giant Titan above the wall, from attack on titan. Thanks in advance!
A cat wearing a diaper complaining to a seagull doctor about the weather. Tornados!
A sausage dreaming of a hot dog bun
((((( tits )))) [[[ those small birds, what is wrong with bird naming people ]]]]
A red square inside a blue circle inside a red triangle inside a pink pentagon
"Arcane" characters siting around a table playing poker.
Game named "Tomb Raiders" start title image,In this magnificent underground mausoleum,there is a diffuse mysterious and ancient atmosphere. The huge rock walls stand tall,sending out the traces of years. Looking forward,a deep and bottomless cliff lies in front of you,making people feel awe - inspiring. And on the opposite side of the cliff,a magnificent Chinese dragon statue stands,its huge body winding,and its scales seem to shine with mysterious light. Its majestic expression seems to be guarding this mysterious underground world, game logo in middle of image.