-
-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BLIP 2 caption fail with fresh install #3037
Comments
Tip: If your (large long time) training works with without captions (Which it will), then DON'T uninstall and try again. I did that and have not been able to get it to work again in 2 weeks. I can train (maybe if i am lucky) small (max 30 min trains), but it crash in training randomly so i don't dare setup longer trains. Also, the setup misses a lot.. a lot of times i have to press install 2-5 times before all is installed. The same when stating train, it will not load all, but repetetive presses on train before it finally start. Kohya is so very very buggy. Or if it is bmaltais code that is buggy i don't know. Cant even get it to work in pinokio. Use WD14 instead. for captioning, or one of the other. PS: |
I tried for several hours to determine the issue, it just seems to be because BLIP-2 requires older Transformers, (possibly) anyio and Gradio versions to work correctly. WD14 Moat Tagger v2 or HWTagger work best. BLIP-1 works fine, but is practically useless for tagging. It looks like BLIP-3 (xgen-mm) is here is already, according to the developer. So Blip-2 is probably just outdated. https://huggingface.co/Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5 Kohya might implement it in the future. I personally think the easiest way to tag is with HWTagger or with the WD14 extension in Forge though. Again HWTagger is a very promising contender and the developers are friendly (they even implemented a couple ideas I had). I highly recommend giving them a try! https://github.com/HaW-Tagger/HWtagger |
GIT Caption:
c:\Users\ZeroTwo\AppData\Local\Programs\Python\Python3109\python.exe: can't open file 'C:\\Users\\ZeroTwo\\Downloads\\Kohya_ss-GUI-LoRA-Portable-main\\kohya_ss-masterREINSTALL\\kohya_ss-master\\sd-scripts\\finetune\\make_captions_by_git.py': [Errno 2] No such file or directory21:22:44-176572 INFO ...captioning done
BLIP Caption:
C:\Users\ZeroTwo\AppData\Local\Programs\Python\Python3109\python.exe: can't open file 'C:\\Users\\ZeroTwo\\Downloads\\Kohya_ss-GUI-LoRA-Portable-main\\kohya_ss-masterREINSTALL\\kohya_ss-master\\sd-scripts\\finetune\\make_captions.py': [Errno 2] No such file or directory
nvidia-smi:
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 551.61 Driver Version: 551.61 CUDA Version: 12.4 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name TCC/WDDM | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce RTX 4070 ... WDDM | 00000000:01:00.0 On | N/A |
| 0% 38C P0 50W / 285W | 5845MiB / 16376MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
I use a fresh python 3.10.9 install and a fresh koyha ss setup
The text was updated successfully, but these errors were encountered: