diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Linoy Tsaban	3003ff4947	[bug fix] fix small bug in readme template of sdxl lora training script (#5914 ) readme improvement and metadata fix	2023-11-23 19:08:49 +01:00
Linoy Tsaban	5ffa603244	[bug fix] fix small bug in readme template of sdxl lora training script (#5906 ) * readme bug fix * style fix --------- Co-authored-by: Linoy Tsaban <linoy@huggingface.co>	2023-11-23 12:11:50 +01:00
Linoy Tsaban	0eeee618cf	Adds an advanced version of the SD-XL DreamBooth LoRA training script supporting pivotal tuning (#5883 ) * sdxl dreambooth lora training script with pivotal tuning * bug fix - args missing from parse_args * code quality fixes * comment unnecessary code from TokenEmbedding handler class * fixup --------- Co-authored-by: Linoy Tsaban <linoy@huggingface.co>	2023-11-22 16:27:56 +01:00
Andrés Romero	93f1a14cab	ControlNet+Adapter pipeline, and ControlNet+Adapter+Inpaint pipeline (#5869 ) * ControlNet+Adapter pipeline, and +Inpaint pipeline --------- Co-authored-by: andres <andres@hax.ai>	2023-11-21 08:59:29 -10:00
Patrick von Platen	13d73d9303	[Lora] Seperate logic (#5809 ) * [Lora] Seperate logic * [Lora] Seperate logic * [Lora] Seperate logic * add comments to explain the code better * add comments to explain the code better	2023-11-21 18:58:37 +01:00
Linoy Tsaban	6fac1369d0	Add features to the Dreambooth LoRA SDXL training script (#5508 ) * Additions: - support for different lr for text encoder - support for Prodigy optimizer - support for min snr gamma - support for custom captions and dataset loading from the hub * adjusted --caption_column behaviour (to -not- use the second column of the dataset by default if --caption_column is not provided) * fixed --output_dir / --model_dir_name confusion * added --repeats, --adam_weight_decay_text_encoder + some fixes * Update examples/dreambooth/train_dreambooth_lora_sdxl.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/dreambooth/train_dreambooth_lora_sdxl.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/dreambooth/train_dreambooth_lora_sdxl.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * - import compute_snr from diffusers/training_utils.py - cluster adamw together - when using 'prodigy', if --train_text_encoder == True and --text_encoder_lr != --learning rate, changes the lr of the text encoders optimization params to be --learning_rate (otherwise errors) * shape fixes when custom captions are used * formatting and a little cleanup * code styling * --repeats default value fixed, changed to 1 * bug fix - removed redundant lines of embedding concatenation when using prior_preservation (that duplicated class_prompt embeddings) * changed dataset loading logic according to the following usecases (to avoid unnecessary dependency on datasets)- 1. user provides --dataset_name 2. user provides local dir --instance_data_dir that contains a metadata .jsonl file 3. user provides local dir --instance_data_dir that contains only images in cases [1,2] we import datasets and use load_dataset method, in case [3] we process the data same as in the original script setting * styling fix * arg name fix * adjusted the --repeats logic * -removed redundant arg and 'if' when loading local folder with prompts -updated readme template -some default val fixes -custom caption tests * image path fix for readme * code style * bug fix * --caption_column arg * readme fix --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Linoy Tsaban <linoy@huggingface.co>	2023-11-21 17:38:43 +01:00
co63oc	ee519cfef5	Update README.md (#5855 )	2023-11-21 11:56:13 +01:00
Patrick von Platen	3303aec5f8	make style	2023-11-20 12:54:52 +01:00
ginjia	4abbbff618	fix an issue that ipex occupy too much memory, it will not impact per… (#5625 ) * fix an issue that ipex occupy too much memory, it will not impact performance * make style --------- Co-authored-by: root <jun.chen@intel.com> Co-authored-by: Meng Guoqing <guoqing.meng@intel.com>	2023-11-20 12:43:29 +01:00
Aryan V S	3ab921166d	[Community] [WIP] LCM Interpolation Pipeline (#5767 ) * wip: add interpolate pipeline for lcm * update documentation * update documentation	2023-11-20 12:18:17 +01:00
Kashif Rasul	6b04d61cf6	[Styling] stylify using ruff (#5841 ) * ruff format * not need to use doc-builder's black styling as the doc is styled in ruff * make fix-copies * comment * use run_ruff	2023-11-20 11:48:34 +01:00
Lucain	c896b841e4	Set `usedforsecurity=False` in hashlib methods (FIPS compliance) (#5790 ) * Set usedforsecurity=False in hashlib methods (FIPS compliance) * update version dependency * bump hfh version * bump hfh version	2023-11-17 14:56:58 +01:00
MilkClouds	3517fb9430	fix: enabled num_images_per_prompt>1 for lpw_stable_diffusion_xl (community pipeline) (#5807 ) * fix: enabled num_images_per_prompt>1 for lpw_stable_diffusion_xl * style: fixed isort	2023-11-15 07:40:29 -10:00
Kadir Nar	c7260ce253	🔧 Fix import codes in diffusers library (#5792 ) * 🔧 Fix import codes in diffusers library * ✨Refactor imports in community examples	2023-11-14 11:37:59 -08:00
Sayak Paul	ded93f798c	[Refactor] refactor `loaders.py` to make it cleaner and leaner. (#5771 ) * refactor loaders.py to make it cleaner and leaner. * refactor loaders init * inits. * textual inversion to the init. * inits. * remove certain modules from the main init. * AttnProcsLayers * fix imports * avoid circular import. * fix circular import pt 2. * address PR comments * imports * fix: imports. * remove from main init for avoiding circular deps. * remove spurious deps. * fix-copies. * fix imports. * more debug * more debug * Apply suggestions from code review * Apply suggestions from code review --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-11-14 12:54:28 +01:00
co63oc	069123f66e	Update checkpoint_merger.py (#5780 )	2023-11-14 10:46:11 +01:00
Long(Tony) Lian	5b231aa38b	Fix the pipeline name in the examples for LMD+ pipeline. Add a colab link to pipeline README. (#5775 ) * Fix the pipeline name in the examples for LMD+ pipeline * Add LMD+ colab link * Apply code formatting --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-11-14 07:43:37 +05:30
Thuan H. Nguyen	8fcd52febb	Correct code for distributed training of RealFill (#5740 ) Correct code for distributed training	2023-11-13 19:01:15 +01:00
Nicolas Hug	0488810f61	Fix realfill example compatibility with latest torchvision version (#5736 )	2023-11-13 18:55:17 +01:00
Patrick von Platen	ef7787ea59	make style	2023-11-13 18:54:53 +01:00
Jianqi Pan	1ce4b5f3e3	fix: fix forward function signature of controlnet reference_only pipeline example (#5717 ) fix: ignore other args Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-11-13 18:54:21 +01:00
Younes Belkada	8789d0b6c7	fix styling issues on main (#5754 ) fix styling issues	2023-11-13 20:05:15 +05:30
Long(Tony) Lian	b1fbef544c	Add LLM-grounded Diffusion (LMD+) pipeline (#5634 ) * Add LLM-grounded Diffusion (LMD+) pipeline * Update the formatting * Applied formatting	2023-11-11 12:51:05 +05:30
Sayak Paul	1477865e48	post release v0.23.0 (#5730 ) * post release * fix: variant test * up * fix: test	2023-11-10 16:35:44 +05:30
aihao	1f87f83e68	add load_datasete data_dir parameter (#5747 )	2023-11-09 23:03:14 -10:00
Suraj Patil	db2d8e76f8	Add LCM Scripts (#5727 ) * add lcm scripts * Co-authored-by: dgu8957@gmail.com	2023-11-09 17:29:12 +01:00
Sayak Paul	64603389da	post release (v0.22.0) (#5658 ) post release	2023-11-06 16:23:38 +01:00
Patrick von Platen	4f2bf67355	Revert "Fix the order of width and height of original size in SDXL training script" (#5614 ) Revert "Fix the order of width and height of original size in SDXL training script (#5382)" This reverts commit `45db049973`.	2023-11-01 22:04:47 +01:00
M. Tolga Cangöz	442017ccc8	[Docs] Fix typos (#5583 ) * Add Copyright info * Fix typos, improve, update * Update deepfloyd_if.md * Update ldm3d_diffusion.md * Update opt_overview.md	2023-10-31 10:04:08 -07:00
YiYi Xu	ce9484b139	fix a mistake in text2image training script for kandinsky2.2 (#5244 ) fix Co-authored-by: yiyixuxu <yixu@Yis-MacBook-Pro.local>	2023-10-30 23:06:16 -10:00
Jincheng Miao	ed00ead345	[Community Pipelines] add textual inversion support for stable_diffusion_ipex (#5571 )	2023-10-31 11:54:16 +05:30
Thuan H. Nguyen	5b087e82d1	Add realfill (#5456 ) * Add realfill * Move realfill folder * Fix some format issues	2023-10-30 15:21:40 +01:00
jiaqiw09	e140c0562e	fix error reported 'find_unused_parameters' running in mutiple GPUs (#5355 ) * fix error reported 'find_unused_parameters' running in mutiple GPUs or NPUs * fix code check of importing module by its alphabetic order --------- Co-authored-by: jiaqiw <wangjiaqi50@huawei.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2023-10-27 22:49:14 +05:30
nickkolok	0d4b459be6	Update train_dreambooth.py - fix typos (#5539 )	2023-10-26 13:35:05 -07:00
Ran Ran	8959c5b9de	Add from_pt flag to enable model from PT (#5501 ) * Add from_pt flag to enable model from PT * Format the file * Reformat the file	2023-10-25 23:07:34 +02:00
Patrick von Platen	d420d71398	make style	2023-10-25 16:12:14 +02:00
Logan	a1fad8286f	Add a new community pipeline (#5477 ) * Add a new community pipeline examples/community/latent_consistency_img2img.py which can be called like this import torch from diffusers import DiffusionPipeline pipe = DiffusionPipeline.from_pretrained( "SimianLuo/LCM_Dreamshaper_v7", custom_pipeline="latent_consistency_txt2img", custom_revision="main") # To save GPU memory, torch.float16 can be used, but it may compromise image quality. pipe.to(torch_device="cuda", torch_dtype=torch.float32) img2img=LatentConsistencyModelPipeline_img2img( vae=pipe.vae, text_encoder=pipe.text_encoder, tokenizer=pipe.tokenizer, unet=pipe.unet, #scheduler=pipe.scheduler, scheduler=None, safety_checker=None, feature_extractor=pipe.feature_extractor, requires_safety_checker=False, ) img = Image.open("thisismyimage.png") result = img2img(prompt,img,strength,num_inference_steps=4) * Apply suggestions from code review Fix name formatting for scheduler Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * update readme (and run formatter on latent_consistency_img2img.py) --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-10-25 16:11:56 +02:00
Patrick von Platen	1ade42f729	make style	2023-10-23 19:43:54 +02:00
Shyam Marjit	677df5ac12	fixed SDXL text encoder training bug #5016 (#5078 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-10-23 19:43:43 +02:00
Andrei Filatov	16851efa0f	Update README.md (#5497 ) Right now, only "main" branch has this community pipeline code. So, adding it manually into pipeline	2023-10-23 18:57:43 +02:00
linjiapro	45db049973	Fix the order of width and height of original size in SDXL training script (#5382 ) * wip * wip --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-10-18 22:38:52 +02:00
Liang Hou	a68f5062fb	style(sdxl): remove identity assignments (#5418 )	2023-10-18 22:38:20 +02:00
Patrick von Platen	85dccab7fd	Add latent consistency (#5438 ) * Add latent consistency * Update examples/community/README.md * Add latent consistency * make fix copies * Apply suggestions from code review	2023-10-18 14:18:31 +02:00
Susheel Thapa	324d18fba2	Chore: Typo fixed in multiple files (#5422 )	2023-10-17 08:17:03 -07:00
Sayak Paul	cc12f3ec92	[Examples] Update with HFApi (#5393 ) * update training examples to use HFAPI. * update training example. * reflect the changes in the korean version too. * Empty-Commit	2023-10-16 19:34:46 +05:30
Heinz-Alexander Fuetterer	0ea78f9707	chore: fix typos (#5386 ) * chore: fix typos * Update src/diffusers/pipelines/shap_e/renderer.py Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com> --------- Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com>	2023-10-16 15:23:37 +02:00
Kashif Rasul	d03c9099bc	[Wuerstchen] text to image training script (#5052 ) * initial script * formatting * prior trainer wip * add efficient_net_encoder * add CLIPTextModel * add prior ema support * optimizer * fix typo * add dataloader * prompt_embeds and image_embeds * intial training loop * fix output_dir * fix add_noise * accelerator check * make effnet_transforms dynamic * fix training loop * add validation logging * use loaded text_encoder * use PreTrainedTokenizerFast * load weigth from pickle * save_model_card * remove unused file * fix typos * save prior pipeilne in its own folder * fix imports * fix pipe_t2i * scale image_embeds * remove snr_gamma * format * initial lora prior training * log_validation and save * initial gradient working * remove save/load hooks * set set_attn_processor on prior_prior * add lora script * typos * use LoraLoaderMixin for prior pipeline * fix usage * make fix-copies * yse repo_id * write_lora_layers is a staitcmethod * use defualts * fix defaults * undo * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_prior.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/loaders.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/loaders.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/wuerstchen/modeling_wuerstchen_prior.py * Update src/diffusers/loaders.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/loaders.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * add graident checkpoint support to prior * gradient_checkpointing * formatting * Update examples/wuerstchen/text_to_image/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/wuerstchen/text_to_image/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/wuerstchen/text_to_image/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/wuerstchen/text_to_image/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/wuerstchen/text_to_image/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/wuerstchen/text_to_image/train_text_to_image_lora_prior.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update src/diffusers/loaders.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/wuerstchen/text_to_image/train_text_to_image_prior.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * use default unet and text_encoder * fix test --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-10-16 15:00:33 +02:00
Sayak Paul	93df5bb670	[Examples] fix unconditioning generation training example for mixed-precision training (#5407 ) * fix: unconditional generation example * fix: float in loss. * apply styling.	2023-10-16 14:11:35 +05:30
Sayak Paul	0fa32bd674	[Examples] use loralinear instead of depecrecated lora attn procs. (#5331 ) * use loralinear instead of depecrecated lora attn procs. * fix parameters() * fix saving * add back support for add kv proj. * fix: param accumul,ation. * propagate the changes.	2023-10-11 13:02:42 +02:00
ssusie	aea73834f6	Adding PyTorch XLA support for sdxl inference (#5273 ) * Added mark_step for sdxl to run with pytorch xla. Also updated README with instructions for xla * adding soft dependency on torch_xla * fix some styling --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2023-10-11 12:35:37 +02:00

1 2 3 4 5 ...

654 Commits