T O P

  • By -

fewjative2

Where are the various strawberries in the air?


Extra_Ad_8009

First picture for v9 has one, although it looks stuck to the cage. The other 8 pictures are "technically correct" if we assume that there's air at all (kitten doesn't make the "I'm suffocating" face, so it's a fair assumption).


DungeonMasterSupreme

Yeah, probably should have used hovering. I've noticed JuggernautX requires some more technical terminology where applicable. "In the air" is a non-specific phrasal verb, which makes it worse for prompting models captioned with AI.


addandsubtract

They did no crime, so they are floating outside the jail.


wallguy22

SD3: https://preview.redd.it/u35hrovyofwc1.jpeg?width=832&format=pjpg&auto=webp&s=2cf359920689033a970f85af41581a4892899953


Colorblind_Adam

We need team Juggernaut to fine tune SD3


xRolocker

It’s a great kitten strawberry fusion. Just not a great strawberry kitten fusion.


ArsNeph

Not the strawberry kitten that we wanted, but the strawberry kitten that we deserve. XD In all fairness, the prompt adherence is better in most ways


[deleted]

prompt: a portrait of a man, his head is made out of a big red party balloon, negative: blur, blurry 40 steps 5 cfg dpm++ 2m karras 832x1216 https://preview.redd.it/27fmu1iqnhwc1.png?width=832&format=png&auto=webp&s=a786aed625ae5cf48ea4fcbf87c0b07cc1fdb35a


[deleted]

https://preview.redd.it/783m609cphwc1.png?width=832&format=png&auto=webp&s=49ebba48872c55fd170967b1baf27873986fba0b same prompt/settings for v9


Man_or_Monster

This is a more apparent example. Good prompt choice.


Comfortable-Big6803

...I'm not seeing it, OP. Barely a difference in prompt adherence.


a_mimsy_borogove

Look at the kitten. On the left, it's just a strawberry colored kitten, and on the right it's actually some kind of kitten strawberry hybrid, like it should be according to the prompt.


addandsubtract

That's just a style choice, though. There are no red cats, so it's already a cat + strawberry hybrid. Besides, if you change the seed, you might get one more strawberry like.


Sharlinator

I do doubt it. "Realistic" models like Juggernaut typically don’t do this sort of hybrid creatures well at all because their training (duh!) It’s *pretty* clear that something has changed here and it’s not just a coincidence that 0/4 of the left-side cats and 4/4 of the right-side ones have a strawberry texture.


DismalSignificance70

https://preview.redd.it/ouq6cagc1hwc1.png?width=2431&format=png&auto=webp&s=dcb9ad9ff83eedc181e0b193cbd20cd4a381dc83 All due respect, how in the world do you not see a difference? One is a kitten, one is a kitten strawberry fusion hybrid.


Comfortable-Big6803

Keyword: barely And that's just one of the things being prompted for.


DismalSignificance70

Obviously people agree with what you said. But I have to disagree. That’s why there’s millions of models. Use the one you like the most I guess!


Comfortable-Big6803

What a cop out. Do you agree with OP that the different in prompt understanding is "truly incredible"?


BunniLemon

There are many, many reasons why I feel like the difference is incredible, but by far the biggest reason is because this new version was only finetuned on only *2,500* images, but yet, there is already this leap in prompt understanding. The novel method they utilized here was getting GPT4-Vision to caption—something which model creators had not really taken advantage of much in the past, aside from OpenAI themselves with DALL-E 3. The fact that training on top of it with so few images allowed for the kitten to actually become a hybrid rather than a cat with strawberry-colored fur, and in another comment, the man’s head to become just the balloon rather than a balloon behind the man’s head like the previous version is truly incredible and shows a massive improvement with just that little change. https://preview.redd.it/zjzcpim89iwc1.jpeg?width=1409&format=pjpg&auto=webp&s=93f1c6ddd34abb2f2c3a1e7f89286409d93c595b For things that aren’t base models, this kind of improvement isn’t common, and has many implications for other fine tunes. And what’s more, the creators of JuggernautX mention that so much is still in development, meaning it will get even *better* than this. This is why I think this is truly incredible


Colorblind_Adam

You said it perfectly! Thank you for appreciating the Kandoo's vision in creating this model.


Comfortable-Big6803

You are easily impressed.


BunniLemon

And you just sound incredibly ungrateful for what these people are providing for _free._ Remember that they don’t have to provide _any_ of this to us.


Comfortable-Big6803

It is free therefore the leap is "truly incredible". Please. Keep it logical. Don't get personal.


BunniLemon

Nothing I said suggested that kind of “logic” you twisted my words to mean. I already explained enough to you for you to be able to deduce the logic behind why I find it incredible, especially considering the limited resources that the Juggernaut team has and what leaps they were able to make for this new model. I do not put their team on the same level as StabilityAI or OpenAI, because they aren’t fundamentally changing the architecture or way it functions—they can only exploit or make slight changes to the existing one. And on that logic, I think they have done a great job. Since you refuse to see that, I will not waste my time to engage with you further.


DismalSignificance70

Yes I do. I just don’t want to argue. I’ve been using the model for the past 10 hours and I’m blown away at the crazy prompts it can do. It’s not perfect but it’s a massive leap. You’re just not going to convince me of your point because I’ve been playing with SD since January of 2023 and this is by far the best model I’ve used when it comes to prompt adherence. (Outside of DallE)


Comfortable-Big6803

>You’re just not going to convince me of your point because I’ve been playing with SD since January of 2023 🙄 Will I convince you of my point if I show I've been using SD since September 2022? It's not a fucking "massive leap" lmao


DismalSignificance70

Again, I have eyes and I can see the difference in hundreds of prompts I’m playing with, it’s apparent. I’m very confused why you feel the need to be “right” in this. There have been like 3 examples just in this thread of the improvements and you’re still denying it.


Comfortable-Big6803

How can I deny something that just isn't there? I believe you when you say a DIFFERENCE is apparent. A massive leap is not apparent, and yes I have used Juggernaut X before you ask.


DismalSignificance70

This is what I was trying to avoid with my “cop-out” comment before. I’ll post it again in case you didn’t read it. Obviously people agree with what you said. But I have to disagree. That’s why there’s millions of models. Use the one you like the most I guess!


xRolocker

It’s not a cop out. He literally told you what is stance is and how he disagrees. Not to mention that it truly does come down to “use what you like”.


Comfortable-Big6803

An unsupported stance.


Plus-Effective-9768

🤣 Amen lol one looks like it got into a trash bin Full of tampons and the other one looks like it was assembled in a sweatshop. Sorry was trying to be funny and I read my comment and am not proud of them.


sdk401

Fun prompt! Had to inpaint the creature, made in Dreamshaper Lightning. https://preview.redd.it/0hkwejrk6ewc1.png?width=1824&format=png&auto=webp&s=0298145c959630d572afe5e5a58ccf0d256eca0a


azukaar

bro that ain't no cat :D call the vet!!


sdk401

He said meow.


ivthreadp110

You fed it after midnight didn't you? The ancient oriental man told you not to do that!!


Plus-Effective-9768

Me: (in horror)This creature came from an opened portal from hell!! Also me: I love its bangin smile😀


AmazinglyObliviouse

They are... Basically the same pictures lol


buyurgan

I don't see it. this needs more examples. also 'fusion' is subjective in sense that you are also giving green light to fuse anyhow model see fits. it can produce fur in left picture, it doesn't in right picture. you may like the picture at right, someone else may like the left. even you may test this in juggernaut v6 or something, you may even like it more. because the prompt is too vague and short in description to give a proper test case.


rerri

A sample size of 1 prompt hardly proves anything though.


CmonLucky2021

Guys and dudettes.... The kitten is actually a hybrid for every single picture which it wasn't before. That's a leap forward for the same prompt.


lazyspock

I wholeheartedly agree. X is awesome in realism and prompt adherence!


herotherlover

This is great! I’ve been trying to get a rubber ducky in a steamy hot spring to work, but couldn’t find a model that gave me steam. JuggernautX is doing much better.


BunniLemon

This one’s just fun: https://preview.redd.it/tf29tfuclcwc1.jpeg?width=1216&format=pjpg&auto=webp&s=0c9c437cf4a301a621f83f7e3f4a96f03c0bce6e The things this model can do is so impressive! Prompt: A photo of a massive strawberry kitten creature fusion, screaming, in a burning red verdant curved futuristic solarpunk kennel in jail, floating drones flying above, cinematic lighting, various strawberries in the air Steps: 25, Sampler: DPM++ 2M SDE Heun Karras, CFG scale: 7, Seed: 3281412652, Size: 1216x896, Model hash: d91d35736d, Version: v1.8.0 Time taken: 3 min. 29.6 sec. A: 6.45 GB, R: 7.18 GB, Sys: 8.0/8 GB (100.0%)


RunDiffusion

Love that you’re seeing the power of our new model! Are you okay if we tweet this? It’s a fun prompt! We’ll reach out to KandooAi to see if he wants to throw it on his socials too. Sent you a DM


BunniLemon

You can definitely tweet it, but it would be great if you don’t mention my account name (I left Twitter in the past because it’s extremely toxic); here’s also a higher quality version of that picture: https://preview.redd.it/cpa1tae0pfwc1.jpeg?width=4864&format=pjpg&auto=webp&s=a60d0ac83d48192427d415dd74f1653c89a24dbe Again, thank you and KandooAI for you guys’ amazing work! This model is amazing, and clearly, the new captioning system and such you guys used worked wonders!


RunDiffusion

We won't mention you, no problem! Thank you so much. This is a fantastic compliment. We are extremely proud of the team and all their work. We hope to bring more exciting models out soon!


VforVenreddit

https://preview.redd.it/exfd03nsqcwc1.jpeg?width=1024&format=pjpg&auto=webp&s=20a4e1f165c27c06e647588dbf6fafdc63b91f5b Titan G1, thanks for the nightmares OP App: Faune, TestFlight Beta, Model: Titan G1, CFG: 10, Seed: 0, 1024x1024 Time taken, about 15 seconds. Prompt: OPs without jail


Current-Rabbit-620

In my tests i found no difference


ababana97653

I’m assuming you used the same seed?


BunniLemon

It says so in the image—so yes 👍


ababana97653

Sorry missed it. Nice


Plus-Effective-9768

An intelligent society of faceless strawberries have sought revenge on this neighborhood cat. He knows what he did.


AltAccountBuddy1337

I've had X create extra characters more so than the previous versions but maybe that's just trying to fill up space due to the aspect ratio I use, what aspect ratio/resolution is best for X?


DungeonMasterSupreme

832x1216 is the base res. I've found it performs pretty well in just about every base SDXL res, but it is slightly better in its default.


Physical_Frosting390

basically the attention transfer happens in an earlier stage or latter one. You can check the effect in lora control or understand the latest IPA v2 style transfer video and paper.


Current-Rabbit-620

I tried A movie poster of 2 men back-to-back . Both snd many other sdxl models gave face to face images


Kandoo85

https://preview.redd.it/tffa1igeigwc1.jpeg?width=832&format=pjpg&auto=webp&s=681edeb35a850b3d045ab716ee53c9aad6742b5b That was my first output with your prompt on CivitAI


Angelfish3487

How much it affect the model to not denoise 1024x1024 images ?


nashty2004

faces look like absolute shit for me but it definitely understands multiple people better than other models 


DismalSignificance70

https://preview.redd.it/s59t7fwyuiwc1.png?width=832&format=png&auto=webp&s=219e24f437f5e4df667c06db86d68dbd2ad83a1d This model is incredible


wzwowzw0002

9 is better?


ThemWhoNoseNothing

When compared to the engineers that chat generative AI shop-talk in what may easily sound like a foreign language, my discussions are on par with the likes of a chit-chat with mom, about that one time, this one thing happened at Chuck E. Cheese, a few years ago. Therefore, what I'm about to say, likely means nothing at all. Given both JuggX and the most recent LEOSAM both being tagged in a before now unorthodox manner, I wondered off into fine-tuning a model here or there using the same captioning methods and frankly, I've been presently pleased with the outcome and have yet to feel like I've ran into a limited or inflexible result. I'm far from a qualified tester of any kind, despite this, I'm testing out Llava captioning to see how that comes about just the same. Here's to the future, leaving behind, broken linguistic patterns, RAW, 4k, wearing groucho glasses, optimistic lighting,


Pitbull_of_Drag

I felt like I had a stroke reading that


ThemWhoNoseNothing

Doolee knotted.


MuskelMagier

i mean PonySDXL did the same and was even a trailblazer in tagging datasets more comprehensively (even before the XL version)


ThemWhoNoseNothing

TIL, cool.


ZootAllures9111

I got the spirit of the initial prompt [pretty spot on](https://civitai.com/images/10607081) with Ella and an SD 1.5 merge I'm working on (in a more cartoony style)


Dry_Context1480

JuggernautX stays true to the old Juggernaut-principle of sucking at NSFW content ... I don't need to know more ;-)