diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Sayak Paul	1f0705adcf	[Big refactor] move unets to `unets` module 🦋 (#6630 ) * move unets to module 🦋 * parameterize unet-level import. * fix flax unet2dcondition model import * models __init__ * mildly depcrecating models.unet_2d_blocks in favor of models.unets.unet_2d_blocks. * noqa * correct depcrecation behaviour * inherit from the actual classes. * Empty-Commit * backwards compatibility for unet_2d.py * backward compatibility for unet_2d_condition * bc for unet_1d * bc for unet_1d_blocks	2024-01-23 08:57:58 +05:30
Sayak Paul	da95a28ff6	[Diffusion DPO] apply fixes from #6547 (#6668 ) apply fixes from #6547	2024-01-22 20:14:54 +05:30
HelloWorldBeginner	f95615b823	Fixed the bug related to saving DeepSpeed models. (#6628 ) * Fixed the bug related to saving DeepSpeed models. * Add information about training SD models using DeepSpeed to the README. * Apply suggestions from code review --------- Co-authored-by: mhh001 <mahonghao1@huawei.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-19 19:21:57 +05:30
SangKim	a9288b49c9	Modularize InstructPix2Pix SDXL inferencing during and after training in examples (#6569 )	2024-01-19 15:47:34 +05:30
Aryan V S	6382663dc8	[Community] Experimental AnimateDiff Image to Video (open to improvements) (#6509 ) * add animatediff img2vid * fix * Update examples/community/README.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix code snippet between ip adapter face id and animatediff img2vid --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2024-01-19 12:05:41 +02:00
Linoy Tsaban	619e3ab6f6	[bug fix] advanced dreambooth lora sdxl - fixes bugs described in #6486 (#6599 ) * fixes bugs: 1. redundant retraction 2. param clone 3. stopping optimization of text encoder params * param upscaling * style	2024-01-17 20:11:45 +05:30
Steve Rhoades	dce06680d2	Fixes torch.compile() compatible training (#6589 ) resolve conflicts	2024-01-17 07:47:03 +05:30
Steve Rhoades	181280baba	Fixes training resuming: Advanced Dreambooth LoRa Training (#6566 ) * Fixes #6418 Advanced Dreambooth LoRa Training * change order of import to fix nit * fix nit, use cast_training_params * remove torch.compile fix, will move to a new PR * remove unnecessary import	2024-01-16 14:30:49 +05:30
SangKim	96d6e16550	Enable image resizing to adjust its height and width in StableDiffusionXLInstructPix2PixPipeline (#6581 ) * Enable image resizing to adjust its height and width in StableDiffusionXLInstructPix2PixPipeline * Ensure that validation is performed at every 'validation_step', not at every step	2024-01-16 07:50:34 +05:30
Aryan V S	c11de13588	[training] fix training resuming problem for fp16 (SD LoRA DreamBooth) (#6554 ) * fix training resume * update * update	2024-01-16 07:27:06 +05:30
Fabio Rigano	f825221b5d	[Community Pipeline] IPAdapter FaceID (#6276 ) * Add support for IPAdapter FaceID * Add docs * Move subfolder to kwargs * Fix quality * Fix image encoder loading * Fix loading + add test * Move to community folder * Fix style * Revert constant update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-15 16:43:54 +02:00
Aryan V S	119d734f6e	[AnimateDiff+Controlnet] Fix multicontrolnet support (#6551 ) * fix multicontrolnet support * update README with multicontrolnet example	2024-01-15 16:36:54 +02:00
Haofan Wang	3d574b3bbe	Fix a bug of flip in SDXL training script (#6547 ) * Update train_text_to_image_sdxl.py * Update train_text_to_image_lora_sdxl.py	2024-01-15 16:28:04 +02:00
Charchit Sharma	09903774d9	Make T2I Adapter SDXL Training Script torch.compile compatible (#6577 ) update for t2i_adapter	2024-01-15 19:42:56 +05:30
dependabot[bot]	d6a70d8ba8	Bump jinja2 from 3.1.2 to 3.1.3 in /examples/research_projects/realfill (#6539 ) Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.2 to 3.1.3. - [Release notes](https://github.com/pallets/jinja/releases) - [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/jinja/compare/3.1.2...3.1.3) --- updated-dependencies: - dependency-name: jinja2 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-01-15 16:10:10 +02:00
Charchit Sharma	e3103e171f	Make InstructPix2Pix SDXL Training Script torch.compile compatible (#6576 ) * changes for pix2pix_sdxl * style fix	2024-01-15 17:54:03 +05:30
Charchit Sharma	b053053ac9	Make InstructPix2Pix Training Script torch.compile compatible (#6558 ) * added torch.compile for pix2pix * required changes	2024-01-15 17:03:22 +05:30
Vinh H. Pham	08702fc1cb	Make text-to-image SDXL LoRA Training Script torch.compile compatible (#6556 ) make compile compatible	2024-01-15 16:58:16 +05:30
Vinh H. Pham	7ce89e979c	Make text-to-image SD LoRA Training Script torch.compile compatible (#6555 ) make compile compatible	2024-01-15 16:55:08 +05:30
gzguevara	05faf3263b	SDXL text-to-image torch compatible (#6550 ) * torch compatible * code quality fix * ruff style * ruff format	2024-01-15 16:49:11 +05:30
Sayak Paul	a080f0d3a2	[Training Utils] create a utility for casting the lora params during training. (#6553 ) create a utility for casting the lora params during training.	2024-01-15 13:51:13 +05:30
Sayak Paul	79df50388d	[Training] fix training resuming problem when using FP16 (SDXL LoRA DreamBooth) (#6514 ) * fix: training resume from fp16. * add: comment * remove residue from another branch. * remove more residues. * thanks to Younes; no hacks. * style. * clean things a bit and modularize _set_state_dict_into_text_encoder * add comment about the fix detailed.	2024-01-12 17:11:06 +05:30
Vinh H. Pham	7d631825b0	Make Dreambooth SD Training Script `torch.compile` compatible (#6532 ) * support compile * make style * move unwrap_model inside function * change unwrap call * run make style * Update examples/dreambooth/train_dreambooth.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Revert "Update examples/dreambooth/train_dreambooth.py" This reverts commit `70ab09732e`. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-12 12:50:15 +05:30
gzguevara	33d2b5b087	SD text-to-image torch compile compatible (#6519 ) * added unwrapper * fiz typo	2024-01-12 09:28:35 +05:30
Suvaditya Mukherjee	f486d34b04	Make ControlNet SD Training Script `torch.compile` compatible (#6525 ) * update: make controlnet script torch compile compatible Signed-off-by: Suvaditya Mukherjee <suvadityamuk@gmail.com> * update: correct earlier mistakes for compilation Signed-off-by: Suvaditya Mukherjee <suvadityamuk@gmail.com> * update: fix code style issues Signed-off-by: Suvaditya Mukherjee <suvadityamuk@gmail.com> --------- Signed-off-by: Suvaditya Mukherjee <suvadityamuk@gmail.com>	2024-01-12 09:27:26 +05:30
Charchit Sharma	e44b205e0b	Make ControlNet SDXL Training Script torch.compile compatible (#6526 ) * make torch.compile compatible * fix quality	2024-01-12 09:25:09 +05:30
Vinh H. Pham	60cb44323d	Make Dreambooth SD LoRA Training Script torch.compile compatible (#6534 ) support compile	2024-01-12 09:24:03 +05:30
Radamés Ajna	1dd0ac9401	[DPO Training] pass tracker name as argument (#6542 ) pass tracker name as argumentw	2024-01-12 09:15:39 +05:30
Aryan V S	9df566e6da	[Community] StyleAligned Pipeline (#6489 ) * add stylealigned sdxl pipeline * bugfix * update docs * remove einops dependency * update README * update example docstring	2024-01-11 14:35:55 +01:00
Sayak Paul	be0b425762	[Training] make checkpointing compatible when using `torch.compile` (part II) (#6511 ) make checkpointing compatible when using torch.compile.	2024-01-11 18:37:30 +05:30
dg845	17cece072a	Fix bug in LCM Distillation Scripts when args.unet_time_cond_proj_dim is used (#6523 ) * Fix bug where unet's time_cond_proj_dim is not set correctly if using args.unet_time_cond_proj_dim. * make style	2024-01-11 08:21:07 +05:30
Rahul Raman	2d1f2182cc	example: Train Instruct pix2 pix with lora implementation (#6469 ) * base template file - train_instruct_pix2pix.py * additional import and parser argument requried for lora * finetune only instructpix2pix model -- no need to include these layers * inject lora layers * freeze unet model -- only lora layers are trained * training modifications to train only lora parameters * store only lora parameters * move train script to research project * run quality and style code checks * move train script to a new folder * add README * update README * update references in README --------- Co-authored-by: Rahul Raman <rahulraman@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-10 06:38:19 +05:30
Sayak Paul	4497b3ec98	[Training] make DreamBooth SDXL LoRA training script compatible with torch.compile (#6483 ) * make it torch.compile comaptible * make the text encoder compatible too. * style	2024-01-09 20:11:26 +05:30
Yifan Zhou	fc63ebdd3a	[Community Pipeline] Rerender-A-Video: Zero-Shot Video-to-Video Translation (#6332 ) * upload codes and doc * lint * lint * lint * update code * remove blank lines * Fix load url	2024-01-09 14:55:34 +01:00
jiqing-feng	aa1797e109	enable stable-xl textual inversion (#6421 ) * enable stable-xl textual inversion * check if optimizer_2 exists * check text_encoder_2 before using * add textual inversion for sdxl in a single file * fix style * fix example style * reset for error changes * add readme for sdxl * fix style * disable autocast as it will cause cast error when weight_dtype=bf16 * fix spelling error * fix style and readme and 8bit optimizer * add README_sdxl.md link * add tracker key on log_validation * run style * rm the second center crop --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-09 15:12:33 +05:30
Sayak Paul	a483a8eddf	Rename REAMDE.md to README.md	2024-01-05 20:49:44 +05:30
Vinh H. Pham	3848606c7e	[Community Pipeline] Add gluegen (#6433 ) * init works * add gluegen pipeline * add gluegen code * add another way to load language adapter * make style * Update README.md * change doc	2024-01-05 13:05:40 +01:00
Sayak Paul	2a97067b84	[Experimental] Diffusion LoRA DPO training (#6422 ) * add: experimental script for diffusion dpo training. * random_crop cli. * fix: caption tokenization. * fix: pixel_values index. * fix: grad? * debug * fix: reduction. * fixes in the loss calculation. * style * fix: unwrap call. * fix: validation inference. * add: initial sdxl script * debug * make sure images in the tuple are of same res * fix model_max_length * report print * boom * fix: numerical issues. * fix: resolution * comment about resize. * change the order of the training transformation. * save call. * debug * remove print * manually detaching necessary? * use the same vae for validation. * add: readme.	2024-01-05 16:40:06 +05:30
Sayak Paul	9d945b2b90	0.25.0 post release (#6358 ) * post release * style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2024-01-05 16:13:27 +05:30
Junsheng121	d184291c7d	null-text-inversion-pipeline-implementation (#6329 ) * null-text-inversion-implementation * edited * edited * edited * edited * edited * edit * makestyle --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-05 11:35:21 +01:00
Linoy Tsaban	2fada8dc1b	[bug fix] fixes #6444 - checkpointing save issue in advanced dreambooth lora sdxl script (#6464 ) * unwrap text encoder when saving hook only for full text encoder tuning * unwrap text encoder when saving hook only for full text encoder tuning * save embeddings in each checkpoint as well * save embeddings in each checkpoint as well * save embeddings in each checkpoint as well * Update examples/advanced_diffusion_training/train_dreambooth_lora_sdxl_advanced.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-05 15:35:24 +05:30
jiqing-feng	f2d51a28f7	Intel Gen 4 Xeon and later support bf16 (#6367 ) * Intel Gen 4 Xeon and later support bf16 * fix bf16 notes	2024-01-05 11:47:28 +05:30
dg845	f3d1333e02	Improve LCM(-LoRA) Distillation Scripts (#6420 ) * Make WDS pipeline interpolation type configurable. * Make the VAE encoding batch size configurable. * Make lora_alpha and lora_dropout configurable for LCM LoRA scripts. * Generalize scalings_for_boundary_conditions function and make the timestep scaling configurable. * Make LoRA target modules configurable for LCM-LoRA scripts. * Move resolve_interpolation_mode to src/diffusers/training_utils.py and make interpolation type configurable in non-WDS script. * apply suggestions from review	2024-01-05 06:55:13 +05:30
Sayak Paul	aad18faa3e	Update README_sdxl.md to update the LR (#6432 ) Update README_sdxl.md	2024-01-03 20:55:51 +05:30
Sayak Paul	d700140076	[LoRA deprecation] handle rest of the stuff related to deprecated lora stuff. (#6426 ) * handle rest of the stuff related to deprecated lora stuff. * fix: copies * don't modify the uNet in-place. * fix: temporal autoencoder. * manually remove lora layers. * don't copy unet. * alright * remove lora attn processors from unet3d * fix: unet3d. * styl * Empty-Commit	2024-01-03 20:54:09 +05:30
Aryan V S	e30b661437	Update lpw_xl pipeline to latest diffusers (#6411 ) * add clip_skip, freeu, qkv * fix * add ip-adapter support * callback on step end * update * fix NoneType bug * fix * add guidance scale embedding * add textual inversion	2024-01-02 16:28:45 +01:00
Linoy Tsaban	b4077af212	[bug fix] using snr gamma and prior preservation loss in the dreambooth lora sdxl training scripts (#6356 ) * change timesteps used to calculate snr when --with_prior_preservation is enabled * change timesteps used to calculate snr when --with_prior_preservation is enabled (canonical script) * style * revert canonical script to before snr gamma change --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-02 09:21:39 -06:00
2510	8a366b835c	Fix gradient-checkpointing option is ignored in SDXL+LoRA training. (#6388 ) (#6402 ) * Fix gradient-checkpointing option is ignored in SDXL+LoRA training. (#6388) * Fix gradient-checkpointing option is ignored in SD+LoRA training. * Fix gradient checkpoint is not applied to text encoders. (SDXL+LoRA) --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-01 08:51:04 +05:30
apolinário	1622265e13	Add WebUI format support to Advanced Training Script (#6403 ) * Add WebUI format support to Advanced Training Script * style --------- Co-authored-by: multimodalart <joaopaulo.passos+multimodal@gmail.com>	2023-12-30 08:45:49 -06:00
gzguevara	9f283b01d2	changed w&b report link (#6387 )	2023-12-29 19:49:11 +05:30

1 2 3 4 5 ...

771 Commits