diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Sayak Paul	cc12f3ec92	[Examples] Update with HFApi (#5393 ) * update training examples to use HFAPI. * update training example. * reflect the changes in the korean version too. * Empty-Commit	2023-10-16 19:34:46 +05:30
Heinz-Alexander Fuetterer	0ea78f9707	chore: fix typos (#5386 ) * chore: fix typos * Update src/diffusers/pipelines/shap_e/renderer.py Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com> --------- Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com>	2023-10-16 15:23:37 +02:00
Kashif Rasul	d03c9099bc	[Wuerstchen] text to image training script (#5052 ) * initial script * formatting * prior trainer wip * add efficient_net_encoder * add CLIPTextModel * add prior ema support * optimizer * fix typo * add dataloader * prompt_embeds and image_embeds * intial training loop * fix output_dir * fix add_noise * accelerator check * make effnet_transforms dynamic * fix training loop * add validation logging * use loaded text_encoder * use PreTrainedTokenizerFast * load weigth from pickle * save_model_card * remove unused file * fix typos * save prior pipeilne in its own folder * fix imports * fix pipe_t2i * scale image_embeds * remove snr_gamma * format * initial lora prior training * log_validation and save * initial gradient working * remove save/load hooks * set set_attn_processor on prior_prior * add lora script * typos * use LoraLoaderMixin for prior pipeline * fix usage * make fix-copies * yse repo_id * write_lora_layers is a staitcmethod * use defualts * fix defaults * undo * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_prior.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/loaders.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/loaders.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/wuerstchen/modeling_wuerstchen_prior.py * Update src/diffusers/loaders.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/loaders.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * add graident checkpoint support to prior * gradient_checkpointing * formatting * Update examples/wuerstchen/text_to_image/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/wuerstchen/text_to_image/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/wuerstchen/text_to_image/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/wuerstchen/text_to_image/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/wuerstchen/text_to_image/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/wuerstchen/text_to_image/train_text_to_image_lora_prior.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update src/diffusers/loaders.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/wuerstchen/text_to_image/train_text_to_image_prior.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * use default unet and text_encoder * fix test --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-10-16 15:00:33 +02:00
Sayak Paul	93df5bb670	[Examples] fix unconditioning generation training example for mixed-precision training (#5407 ) * fix: unconditional generation example * fix: float in loss. * apply styling.	2023-10-16 14:11:35 +05:30
Sayak Paul	0fa32bd674	[Examples] use loralinear instead of depecrecated lora attn procs. (#5331 ) * use loralinear instead of depecrecated lora attn procs. * fix parameters() * fix saving * add back support for add kv proj. * fix: param accumul,ation. * propagate the changes.	2023-10-11 13:02:42 +02:00
ssusie	aea73834f6	Adding PyTorch XLA support for sdxl inference (#5273 ) * Added mark_step for sdxl to run with pytorch xla. Also updated README with instructions for xla * adding soft dependency on torch_xla * fix some styling --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2023-10-11 12:35:37 +02:00
Humphrey009	9c82b68f07	fix problem of 'accelerator.is_main_process' to run in mutiple GPUs (#5340 ) fix problem of 'accelerator.is_main_process' to run in mutiple GPUs or NPUs Co-authored-by: jiaqiw <wangjiaqi50@huawei.com>	2023-10-10 15:39:22 +05:30
Julien Simon	d3e0750d5d	Add missing dependency in requirements file (#5345 ) Update requirements_sdxl.txt Add missing 'datasets' Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-10-10 15:37:58 +05:30
Pu Cao	cc2c4ae759	fix inference in custom diffusion (#5329 ) * Update train_custom_diffusion.py * make style * Empty-Commit --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-10-09 10:08:01 +02:00
chuzh	6bd55b54bc	Fix [core/GLIGEN]: TypeError when iterating over 0-d tensor with In-painting mode when EulerAncestralDiscreteScheduler is used (#5305 ) * fix(gligen_inpaint_pipeline): 🐛 Wrap the timestep() 0-d tensor in a list to convert to 1-d tensor. This avoids the TypeError caused by trying to directly iterate over a 0-dimensional tensor in the denoising stage * test(gligen/gligen_text_image): unit test using the EulerAncestralDiscreteScheduler --------- Co-authored-by: zhen-hao.chu <zhen-hao.chu@vitrox.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-10-09 09:54:01 +02:00
Zeng Xian	0513a8cfd8	fix typo in train dreambooth lora description (#5332 )	2023-10-08 14:54:33 +02:00
Bagheera	02a8d664a2	Min-SNR Gamma: correct the fix for SNR weighted loss in v-prediction … (#5238 ) Min-SNR Gamma: correct the fix for SNR weighted loss in v-prediction by adding 1 to SNR rather than the resulting loss weights Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-10-05 20:52:27 +02:00
Patrick von Platen	dfcce3ca6e	[Research folder] Add SDXL example (#5275 ) * [SDXL Flax] Add research folder * Add co-author Co-authored-by: Juan Acevedo <jfacevedo@google.com> --------- Co-authored-by: Juan Acevedo <jfacevedo@google.com>	2023-10-03 07:51:46 +02:00
Patrick von Platen	bdd16116f3	[Schedulers] Fix callback steps (#5261 ) * fix all * make fix copies * make fix copies	2023-10-02 19:52:53 +02:00
Sayak Paul	d56825e4b4	fix: how print training resume logs. (#5117 ) * fix: how print training resume logs. * propagate changes to text-to-image scripts. * propagate changes to instructpix2pix. * propagate changes to dreambooth * propagate changes to custom diffusion and instructpix2pix * propagate changes to kandinsky * propagate changes to textual inv. * debug * fix: checkpointing. * debug * debug * debug * back to the square * debug * debug * change condition order. * debug * debug * debug * debug * revert to original * clean --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-10-02 18:29:52 +02:00
Nicholas Bardy	1c4c4c48d9	Correct file name in t2i adapter training readme (#5207 ) Update README_sdxl.md	2023-09-28 14:51:02 +02:00
Benjamin Paine	693a0d08e4	Remove Offensive Language from Community Pipelines (#5206 ) * Update run_onnx_controlnet.py * Update run_tensorrt_controlnet.py	2023-09-27 22:02:25 +02:00
Sayak Paul	cdcc01be0e	[Examples] add `compute_snr()` to training utils. (#5188 ) add compute_snr() to training utils.	2023-09-27 21:42:20 +05:30
Bagheera	4a06c74547	Min-SNR Gamma: follow-up fix for zero-terminal SNR models on v-prediction or epsilon (#5177 ) * merge with main * fix flax example * fix onnx example --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-09-26 18:14:52 +05:30
Bagheera	89d8f84893	Timestep bias for fine-tuning SDXL (#5094 ) * Timestep bias for fine-tuning SDXL * Adjust parameter choices to include "range" and reword the help statements * Condition our use of weighted timesteps on the value of timestep_bias_strategy * style --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-09-26 13:45:37 +05:30
Bagheera	539846a7d5	SDXL microconditioning documentation should indicate the correct default order of parameters, so that developers know (#5155 ) * SDXL microconditioning documentation should indicate the correct default order of parameters, so that developers know * SDXL microconditioning documentation should indicate the correct default order of parameters, so that developers know * empty --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-09-25 20:22:09 +02:00
Anh71me	28254c79b6	Fix type annotation (#5146 ) * Fix type annotation on Scheduler.from_pretrained * Fix type annotation on PIL.Image	2023-09-25 19:26:39 +02:00
Bagheera	d558811b26	Min-SNR gamma support for Dreambooth training (#5107 ) * min-SNR gamma for Dreambooth training * Align the mse_loss_weights style with SDXL training example --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-09-21 22:53:06 +01:00
Bagheera	24563ca654	SNR gamma fixes for v_prediction training (#5106 ) Co-authored-by: bghira <bghira@users.github.com>	2023-09-20 21:18:56 +01:00
Bagheera	74e43a4fbd	Resolve v_prediction issue for min-SNR gamma weighted loss function (#5096 ) * Resolve v_prediction issue for min-SNR gamma weighted loss function * Combine MSE loss calculation of epsilon and velocity, with a note about the application of the epsilon code to sample prediction * style --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-09-19 17:31:27 +01:00
Bagheera	81331f3b7d	Add x-prediction / prediction_type=sample support for SDXL fine-tuning (#5095 ) Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-09-19 16:57:44 +01:00
maksymbekuzarovSC	65c162a5b3	Fixed `get_word_inds` mistake/typo in P2P community pipeline (breaking code examples) (#5089 ) Fixed `get_word_inds` mistake/typo in P2P community pipeline The function `get_word_inds` was taking a string of text and either a word (str) or a word index (int) and returned the indices of token(s) the word would be encoded to. However, there was a typo, in which in the second `if` branch the word was checked to be a `str` again, not `int`, which resulted in an [example code from the docs](https://github.com/huggingface/diffusers/tree/main/examples/community#prompt2prompt-pipeline) to result in an error	2023-09-19 11:34:49 +02:00
Ruoxi	16b9a57d29	Implement `CustomDiffusionAttnProcessor2_0`. (#4604 ) * Implement `CustomDiffusionAttnProcessor2_0` * Doc-strings and type annotations for `CustomDiffusionAttnProcessor2_0`. (#1) * Update attnprocessor.md * Update attention_processor.py * Interops for `CustomDiffusionAttnProcessor2_0`. * Formatted `attention_processor.py`. * Formatted doc-string in `attention_processor.py` * Conditional CustomDiffusion2_0 for training example. * Remove unnecessary reference impl in comments. * Fix `save_attn_procs`.	2023-09-18 14:49:00 +02:00
Lee Dong Joo	b089102a8e	fix guidance_rescale docstring (#5063 )	2023-09-18 13:39:12 +02:00
Kashif Rasul	73bb97adfc	[LoRA] fix typo in attention_processor.py (#5066 ) * [LoRA] fix typo in attention_processor.py fixes #5062 * make style * make fix-copies, logger comented for torch compile	2023-09-16 14:43:18 +02:00
Sayak Paul	38a664a3d6	fix: validation_image arg (#5053 )	2023-09-15 12:20:50 +01:00
Gang Wu	9f40d7970e	[FIX BUG] type of args in train_instruct_pix2pix_sdxl.py (#4955 )	2023-09-15 12:53:07 +02:00
dotieuthien	941473a12f	Fix import in examples (#5048 ) * convert tensorrt controlnet * Fix code quality * Fix code quality * Fix code quality * Fix code quality * Fix code quality * Fix code quality * Fix number controlnet condition * Add convert SD XL to onnx * Add convert SD XL to tensorrt * Add convert SD XL to tensorrt * Add examples in comments * Add examples in comments * Add test onnx controlnet * Add tensorrt test * Remove copied * Move file test to examples/community * Remove script * Remove script * Remove text * Fix import --------- Co-authored-by: dotieuthien <thien.do@mservice.com.vn> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-09-15 12:48:06 +02:00
YiYi Xu	e70cb1243f	[WIP] adding Kandinsky training scripts (#4890 ) * Add files via upload Co-authored-by: Shahmatov Arseniy <62886550+cene555@users.noreply.github.com> Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-09-14 06:58:20 -10:00
Patrick von Platen	342c5c02c0	[Release 0.21] Bump version (#5018 ) * [Release 0.21] Bump version * fix & remove * fix more * fix all, upload	2023-09-14 18:28:57 +02:00
UmerHA	169fc4add5	Add Prompt2Prompt pipeline (#4563 ) * Initial commit P2P * Replaced CrossAttention, added test skeleton * bug fixes * Updated docstring * Removed unused function * Created tests * improved tests - made fast inference tests faster - corrected image shape assertions * Corrected expected output shape in tests * small fix: test inputs * Update tests - used conditional unet2d - set expected image slices - edit_kwargs are now not popped, so pipe can be run multiple times * Fixed bug in int tests * Fixed tests * Linting * Create prompt2prompt.md * Added to docs toc * Ran make fix-copies * Fixed code blocks in docs * Using same interface as StableDiffusionPipeline * Fixed small test bug * Added all options SDPipeline.__call_ has * Fixed docstring; made __call__ like in SD * Linting * Added test for multiple prompts * Improved docs * Incorporated feedback * Reverted formatting on unrelated files * Moved prompt2prompt to community - Moved prompt2prompt pipeline from main to community - Deleted tests - Moved documentation to community and shorted it * Update src/diffusers/utils/dummy_torch_and_transformers_objects.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-09-14 16:39:59 +02:00
Patrick von Platen	1037287e2b	examples fix t2i training (#5001 ) * examples fix t2i training * make style	2023-09-12 23:52:41 +02:00
Kashif Rasul	73bf620dec	fix E721 Do not compare types, use `isinstance()` (#4992 )	2023-09-12 16:52:25 +02:00
Dhruv Nair	b6e0b016ce	Lazy Import for Diffusers (#4829 ) * initial commit * move modules to import struct * add dummy objects and _LazyModule * add lazy import to schedulers * clean up unused imports * lazy import on models module * lazy import for schedulers module * add lazy import to pipelines module * lazy import altdiffusion * lazy import audio diffusion * lazy import audioldm * lazy import consistency model * lazy import controlnet * lazy import dance diffusion ddim ddpm * lazy import deepfloyd * lazy import kandinksy * lazy imports * lazy import semantic diffusion * lazy imports * lazy import stable diffusion * move sd output to its own module * clean up * lazy import t2iadapter * lazy import unclip * lazy import versatile and vq diffsuion * lazy import vq diffusion * helper to fetch objects from modules * lazy import sdxl * lazy import txt2vid * lazy import stochastic karras * fix model imports * fix bug * lazy import * clean up * clean up * fixes for tests * fixes for tests * clean up * remove import of torch_utils from utils module * clean up * clean up * fix mistake import statement * dedicated modules for exporting and loading * remove testing utils from utils module * fixes from merge conflicts * Update src/diffusers/pipelines/kandinsky2_2/__init__.py * fix docs * fix alt diffusion copied from * fix check dummies * fix more docs * remove accelerate import from utils module * add type checking * make style * fix check dummies * remove torch import from xformers check * clean up error message * fixes after upstream merges * dummy objects fix * fix tests * remove unused module import --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-09-11 09:56:22 +02:00
Will Berman	d73e6ad050	guard save model hooks to only execute on main process (#4929 )	2023-09-08 10:30:06 -07:00
Sayak Paul	d0cf681a1f	[Tests] add: tests for t2i adapter training. (#4947 ) add: tests for t2i adapter training.	2023-09-08 19:45:39 +05:30
Suraj Patil	dfec61f4b3	[examples] T2IAdapter training script (#4934 ) * add t2i_example script * remove in channels logic * remove comments * remove use_euler arg * add requirements * only use canny example * use datasets * comments * make log_validation consistent with other scripts * add readme * fix title in readme * update check_min_version * change a few minor things. * add doc entry * add: test for t2i adapter training * remove use_auth_token * fix: logged info. * remove tests for now. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-09-08 10:03:02 +05:30
Bagheera	cfdfcf2018	Add --vae_precision option to the SDXL pix2pix script so that we have… (#4881 ) * Add --vae_precision option to the SDXL pix2pix script so that we have the option of avoiding float32 overhead * style --------- Co-authored-by: bghira <bghira@users.github.com>	2023-09-05 09:04:06 +02:00
Isamu Isozaki	3201903d94	Retrieval Augmented Diffusion Models (#3297 ) * Resetting rdm pr * Fixed styles * Fixed style * Moved to rdm folder+fixed slight errors * Removed config diff * Started adding tests * Adding retrieved images * Fixed faiss import * Fixed import errors * Fixing tests * Added require_faiss * Updated dependency table * Attempt solving consistency test * Fixed truncation and vocab size issue * Passed common tests * Finished up cpu testing on pipeline * Passed all tests locally * Removed some slow tests * Removed diffs from test_pipeline_common * Remove logs * Removed diffs from test_pipelines_common * Fixed style * Fully fixed styles on diffs * Fixed name * Proper rename * Fixed dummies * Fixed issue with dummyonnx * Fixed black style * Fixed dummies * Changed ordering * Fixed logging * Fixing * Fixing * quality * Debugging regex * Fix dummies with guess * Fixed typo * Attempt fix dummies * black * ruff * fixed ordering * Logging * Attempt fix * Attempt fix dummy * Attempt fixing styles * Fixed faiss dependency * Removed unnecessary deprecations * Finished up main changes * Added doc * Passed tests * Fixed tests * Remove invisible watermark * Fixed ruff errors * Added prompt embed to tests * Added tests and made retriever an optional component * Fixed styles * Made faiss a dependency of pipeline * Logging * Fixed dummies * Make pipeline test work * Fixed style * Moved to research projects * Remove diff * Fixed style error --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-09-04 09:42:04 +02:00
Yukun Huang	85b3f08c26	Fix potential type mismatch errors in SDXL pipelines (#4796 ) * Fix potential type conversion errors in SDXL pipelines * make sure vae stays in fp16 --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-08-31 09:22:18 +02:00
Patrick von Platen	d1e20be664	make style	2023-08-30 14:13:14 +02:00
Anatoly Belikov	af3854d6ad	sketch inpaint from a1111 for non-inpaint models (#4824 ) * Create masked_stable_diffusion_img2img.py * add MaskedIm2ImPipeline to readme * Update README.md	2023-08-30 09:51:28 +02:00
Mario Namtao Shianti Larcher	87ae330056	[Examples] Save SDXL LoRA weights with chosen precision (#4791 ) * Increase min accelerate ver to avoid OOM when mixed precision * Rm re-instantiation of VAE * Rm casting to float32 * Del unused models and free GPU * Fix style	2023-08-28 13:57:40 +05:30
Patrick von Platen	1b46c66132	make style	2023-08-28 07:17:21 +00:00
Yead	031358988b	Fix save_path bug in textual inversion training script (#4710 ) * Update textual_inversion.py fixed safe_path bug in textual inversion training * Update test_examples.py update test_textual_inversion for updating saved file's name * Update textual_inversion.py fixed some formatting issues --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-08-28 09:17:08 +02:00

1 2 3 4 5 ...

610 Commits