Instance VS Class Token with filewords ? #667
Replies: 2 comments 2 replies
-
So, it works like this... If you, say, have 50 images of a boxer in various poses, and want to teach the robot how to make more pictures of boxes, I'd probably recommend you use some kind of tool to caption all of your photos, and ensure "boxer" is in each prompt. Then, you just use [filewords] for your instance/sample prompt, and start training. That's for "classic fine-tuning", where you're just training on pictures with captions. "Dreambooth" training is more specific to a unique subject, like a person or a specific pet. In this example, say you have 50 images of a person. I would still recommend to caption all of your images, and again, use [filewords], but this time for the class/instance/sample prompts. What makes this different is we now add in the class/instance tokens. You'd want to ensure that all of your captions have man or woman or person in them, as long as it's the same throughout. With me so far? What this does on the back end is makes it so we can prepend "xabyzz" to the prompts when training on the subject - "a photo of xabyzz person on a sailboat", and use the clean prompt "a photo of a person on a sailboat" when generating class images. Also, if your prompts already had the subject in them "a photo of xabyzz on a sailboat", it will automagically swap xabyzz with "person" when generating class prompts, and inject "person" after xabyzz for instance prompts. |
Beta Was this translation helpful? Give feedback.
-
Still a bit confused about this. I want to try and make a model that will mimic a specific character/person. @d8ahazard Let's say we have a Dog called Dandy. Is this correct? I do not want/use prompts for each picture. |
Beta Was this translation helpful? Give feedback.
-
From the readme :
Instance Token The unique identifier for your subject. (sks, xyz). Leave blank for fine-tuning.
I'm not training a person (which is pretty straight forward) - i want to train a new concept (like a bear with horns) or an action (like running / dancing / boxing).
Until today i knew that dreambooth required an instance token and class token.
But what does "Leave blank for fine tuning means" ?
I mean i'm using filewords already - so if i leave the instance token blank, do i have to enter something in the class token or leave that blank as well? Or maybe use filewords in there?
Can this DB implementation now be used to properly fine tune SD? Is this what this means?
For the prompts i already use [filewords], this is pretty straight forward, but for the instance token and class token things are a bit unclear.
If @d8ahazard or anyone else can help, i'm sure everyone would benefit from this knowledge.
Beta Was this translation helpful? Give feedback.
All reactions