diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
Patrick von Platen	29f15673ed	Release: v0.21.0	2023-09-13 15:58:24 +02:00
Will Berman	d73e6ad050	guard save model hooks to only execute on main process (#4929 )	2023-09-08 10:30:06 -07:00
Mario Namtao Shianti Larcher	87ae330056	[Examples] Save SDXL LoRA weights with chosen precision (#4791 ) * Increase min accelerate ver to avoid OOM when mixed precision * Rm re-instantiation of VAE * Rm casting to float32 * Del unused models and free GPU * Fix style	2023-08-28 13:57:40 +05:30
Mario Namtao Shianti Larcher	c25c46137d	[Examples] Add madebyollin VAE to SDXL LoRA example, along with an explanation (#4762 ) Add madebyollin VAE to LoRA example, along with an explenation	2023-08-25 09:34:32 +05:30
Sayak Paul	4909b1e3ac	[Examples] fix checkpointing and casting bugs in `train_text_to_image_lora_sdxl.py` (#4632 ) * fix: casting issues. * fix checkpointing. * tests * fix: bugs	2023-08-23 10:58:54 +05:30
Sayak Paul	d0c30cfd37	make post-release (#4650 )	2023-08-17 14:16:25 +05:30
Sayak Paul	5175d3d7a5	add: train to text image with sdxl script. (#4505 ) * add: train to text image with sdxl script. Co-authored-by: CaptnSeraph <s3raph1m@gmail.com> * fix: partial func. * fix: default value of output_dir. * make style * set num inference steps to 25. * remove mentions of LoRA. * up min version * add: ema cli arg * run device placement while running step. * precompute vae encodings too. * fix * debug * should work now. * debug * debug * goes alright? * style * debugging * debugging * debugging * debugging * fix * reinit scheduler if prediction_type was passed. * akways cast vae in float32 * better handling of snr. Co-authored-by: bghira <bghira@users.github.com> * the vae should be also passed * add: docs. * add: sdlx t2i tests * save the pipeline * autocast. * fix: save_model_card * fix: save_model_card. --------- Co-authored-by: CaptnSeraph <s3raph1m@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: bghira <bghira@users.github.com>	2023-08-16 09:02:49 +05:30
Wang Qiang	078df46bc9	An invalid clerical error in sdxl finetune (#4608 )	2023-08-15 10:41:51 +05:30
Sayak Paul	d67eba0f31	[Utility] adds an image grid utility (#4576 ) * add: utility for image grid. * add: return type. * change necessary places. * add to utility page.	2023-08-12 10:34:51 +05:30
Sayak Paul	d5983a6779	[Examples] fix: network_alpha -> network_alphas (#4572 ) network_alpha	2023-08-11 14:18:49 +05:30
Rastislav Švarba	6c5b5b260e	Fix push_to_hub in train_text_to_image_lora_sdxl.py example (#4535 ) fix: push_to_hub in train text2image lora sdxl	2023-08-09 11:48:24 +05:30
takuoko	9c29bc2df8	[Examples] Support train_text_to_image_lora_sdxl.py (#4365 ) * add train_text_to_image_lora_sdxl.py * add train_text_to_image_lora_sdxl.py * add test and minor fix * Update examples/text_to_image/README_sdxl.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * fix unwrap_model rule * add invisible-watermark in requirements * del invisible-watermark * Update examples/text_to_image/README_sdxl.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/text_to_image/README_sdxl.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/text_to_image/train_text_to_image_lora_sdxl.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * del comment & update readme --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-08-06 13:47:20 +05:30
AisingioroHao	70d098540d	Add a data_dir parameter to the load_dataset method. (#4482 ) Co-authored-by: AisingioroHao0 <1286098622@qq.com>	2023-08-06 08:45:48 +05:30
Patrick von Platen	20e92586c1	0.20.0dev0 (#4299 ) * 0.20.0dev0 * make style	2023-07-26 23:06:18 +02:00
Ruoxi	ece55227ff	Multiply lr scheduler steps by `num_processes`. (#3983 ) * Multiply lr scheduler steps by `num_processes`. * Stop multiplying steps by gradient accumulation.	2023-07-13 17:50:25 +05:30
Patrick von Platen	b9feed8795	move to 0.19.0dev (#4048 )	2023-07-11 22:49:12 +02:00
takuoko	cdf2ae8a84	[Enhance] Add LoRA rank args in train_text_to_image_lora (#3866 ) * add rank args in lora finetune * del network_alpha	2023-06-29 17:09:59 +05:30
Sayak Paul	4870626728	[Examples] Improve the model card pushed from the `train_text_to_image.py` script (#3810 ) * refactor: readme serialized from the example when push_to_hub is True. * fix: batch size arg. * a bit better formatting * minor fixes. * add note on env. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * condition wandb info better * make mixed_precision assignment in cli args explicit. * separate inference block for sample images. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * address more comments. * autocast mode. * correct none image type problem. * ifx: list assignment. * minor fix. --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-06-20 08:59:41 +05:30
Will Berman	3ddc2b7395	[train text to image] add note to loading from checkpoint (#3806 ) add note to loading from checkpoint	2023-06-16 11:54:49 +05:30
Will Berman	d49e2dd54c	manual check for checkpoints_total_limit instead of using accelerate (#3681 ) * manual check for checkpoints_total_limit instead of using accelerate * remove controlnet_conditioning_embedding_out_channels	2023-06-15 15:38:54 -07:00
Patrick von Platen	908e5e9cc6	Fix some bad comment in training scripts (#3798 ) * relax tolerance slightly * correct incorrect naming	2023-06-15 15:07:51 +02:00
Patrick von Platen	c42f6ee43e	Post 0.17.0 release (#3721 ) * Post release * Post release	2023-06-08 18:08:49 +02:00
Zachary Mueller	79fa94ea8b	Apply deprecations from Accelerate (#3714 ) Apply deprecations	2023-06-08 16:44:22 +02:00
Alex McKinney	cd9d0913d9	Fixes eval generator init in `train_text_to_image_lora.py` (#3678 )	2023-06-07 15:37:13 +05:30
Max-We	12a232efa9	Fix schedulers zero SNR and rescale classifier free guidance (#3664 ) * Implement option for rescaling betas to zero terminal SNR * Implement rescale classifier free guidance in pipeline_stable_diffusion.py * focus on DDIM * make style * make style * make style * make style * Apply suggestions from Peter Lin * Apply suggestions from Peter Lin * make style * Apply suggestions from code review * Apply suggestions from code review * make style * make style --------- Co-authored-by: MaxWe00 <gitlab.9v1lq@slmail.me> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-06-07 10:57:10 +01:00
Kashif Rasul	f1d4743394	fixed typo in example train_text_to_image.py (#3608 ) fixed typo	2023-06-02 20:54:54 +05:30
wfng92	2faf91dbde	Add min snr to text2img lora training script (#3459 ) add min snr to text2img lora training script	2023-05-17 16:37:45 +05:30
Sayak Paul	3a237f4fa2	fix: deepseepd_plugin retrieval from accelerate state (#3410 )	2023-05-12 10:02:22 +01:00
Stas Bekman	af2a237676	[deepspeed] partial ZeRO-3 support (#3076 ) * [deepspeed] partial ZeRO-3 support * cleanup * improve deepspeed fixes * Improve * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-11 16:59:20 +01:00
Isamu Isozaki	fa9e35fca4	Added input pretubation (#3292 ) * Added input pretubation * Fixed spelling	2023-05-04 18:12:32 +05:30
Sayak Paul	71de5b7051	[LoRA] quality of life improvements in the loading semantics and docs (#3180 ) * 👽 qol improvements for LoRA. * better function name? * fix: LoRA weight loading with the new format. * address Patrick's comments. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * change wording around encouraging the use of load_lora_weights(). * fix: function name. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-28 11:36:49 +05:30
Patrick von Platen	f842396367	Post release for 0.16.0 (#3244 ) * Post release * fix more	2023-04-26 17:43:09 +01:00
Patrick von Platen	6ba0efb9a1	Release: v0.16.0	2023-04-26 13:35:01 +02:00
Sayak Paul	3b641eabe9	feat: verfication of multi-gpu support for select examples. (#3126 ) * feat: verfication of multi-gpu support for select examples. * add: multi-gpu training sections to the relvant doc pages.	2023-04-18 08:36:13 +05:30
YiYi Xu	1bd4c9e93d	remvoe one line as requested by gc team (#3077 ) remvoe one line	2023-04-14 06:39:25 -10:00
Patrick von Platen	0a73b4d3cd	[Post release] v0.16.0dev (#3072 )	2023-04-12 17:18:30 +01:00
Patrick von Platen	e7534542a2	Release: v0.15.0	2023-04-12 15:15:31 +00:00
Will Berman	67ec9cf513	accelerate min version for ProjectConfiguration import (#3042 )	2023-04-11 10:12:28 -07:00
Patrick von Platen	8b451eb63b	Fix config prints and save, load of pipelines (#2849 ) * [Config] Fix config prints and save, load * Only use potential nn.Modules for dtype and device * Correct vae image processor * make sure in_channels is not accessed directly * make sure in channels is only accessed via config * Make sure schedulers only access config attributes * Make sure to access config in SAG * Fix vae processor and make style * add tests * uP * make style * Fix more naming issues * Final fix with vae config * change more	2023-04-11 13:35:42 +02:00
Sayak Paul	24947317a6	[Examples] Add support for Min-SNR weighting strategy for better convergence (#2899 ) * improve stable unclip doc. * feat: support for applying min-snr weighting for faster convergence. * add: support for validation logging with wandb * make not a required arg. * fix: arg name. * fix: cli args. * fix: tracker config. * fix: loss calculation. * fix: validation logging. * fix: unwrap call. * fix: validation logging. * fix: internval. * fix: checkpointing push to hub. * fix: `c8a2856c6d`\#commitcomment-106913193 * fix: norm group test for UNet3D. * address PR comments. * remove unneeded code. * add: entry in the readme and docs. * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> --------- Co-authored-by: Suraj Patil <surajp815@gmail.com>	2023-04-06 19:08:40 +05:30
Lucain	a87e88b783	Use `upload_folder` in training scripts (#2934 ) use upload folder in training scripts Co-authored-by: testbot <lucainp@hf.co>	2023-04-04 16:19:12 +01:00
Naoki Ainoya	14e3a28c12	Rename 'CLIPFeatureExtractor' class to 'CLIPImageProcessor' (#2732 ) The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.	2023-03-23 13:49:22 +01:00
Mishig	8e35ef0142	[doc wip] literalinclude (#2718 )	2023-03-23 13:42:54 +01:00
Haofan Wang	e0d8c9ef83	Support for Offset Noise in examples (#2753 ) * add noise offset * make style	2023-03-23 09:36:17 +05:30
Patrick von Platen	e828232780	Rename attention (#2691 ) * rename file * rename attention * fix more * rename more * up * more deprecation imports * fixes	2023-03-16 00:35:54 +01:00
Will Berman	ebd44957fc	image generation main process checks (#2631 )	2023-03-14 01:28:03 -07:00
zxypro	f0b661b8fb	[Docs]Fix invalid link to Pokemons dataset (#2583 )	2023-03-07 14:26:09 +01:00
Pedro Cuenca	d3ce6f4b1e	Support revision in Flax text-to-image training (#2567 ) Support revision in Flax text-to-image training.	2023-03-07 08:16:31 +01:00
Patrick von Platen	3d2648d743	[Post release] Push post release (#2546 )	2023-03-03 18:11:01 +01:00
Patrick von Platen	f20c8f5a1a	Release: v0.14.0	2023-03-03 16:45:08 +01:00

1 2 3

114 Commits