Skip to content

Results between training previews and after merge are really different #5

Description

@scarbain

Hi!

I'm training using the default values you set.
After a few thousands of steps, the training previews are starting to be overfitted but if I take the corresponding attention processor and merge it with my base model, the inference images in automatic1111 are really undertrained. I can see that the base model have changed (comparing with same seed) and it's "starting" to converge to my concept but it's still WAY undertrained, even after 10000 steps at LR1e-4.

Any idea to what this is due ? Is it because of a configuration in my automatic1111 or the conversion script ?

How many steps have you used in your tests ? I've made some tests with 5 and with 200 images with different concepts, same problem.

Thanks

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions