diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
ちくわぶ	896fb6d8d7	Fix duplicate variable assignments in SD3's JointAttnProcessor (#8516 ) * Fix duplicate variable assignments. * Fix duplicate variable assignments.	2024-06-12 21:52:35 -10:00
Beinsezii	7f51f286a5	Add Hunyuan AutoPipe mapping (#8505 )	2024-06-12 16:11:55 -10:00
kkj15dk	829f6defa4	Fix spelling in scheduling_flow_match_euler_discrete.py (#8497 ) Update scheduling_flow_match_euler_discrete.py Spelling: Foward -> Forward Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-06-12 12:37:47 -10:00
Beinsezii	24bdf4b215	Add SD3 AutoPipeline mappings (#8489 )	2024-06-12 12:31:36 -10:00
Sayak Paul	6cf0be5d3d	fix warning log for Transformer SD3 (#8496 ) fix warning log	2024-06-12 12:25:18 -10:00
Sayak Paul	ec068f9b5b	fix dual transformer2d import (#8491 ) fix	2024-06-12 21:10:27 +01:00
Dhruv Nair	04717fd861	Add Stable Diffusion 3 (#8483 ) * up * add sd3 * update * update * add tests * fix copies * fix docs * update * add dreambooth lora * add LoRA * update * update * update * update * import fix * update * Update src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * import fix 2 * update * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * update * update * update * fix ckpt id * fix more ids * update * missing doc * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update docs/source/en/api/pipelines/stable_diffusion/stable_diffusion_3.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update docs/source/en/api/pipelines/stable_diffusion/stable_diffusion_3.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update' * fix * update * Update src/diffusers/models/autoencoders/autoencoder_kl.py * Update src/diffusers/models/autoencoders/autoencoder_kl.py * note on gated access. * requirements * licensing --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-06-12 20:44:00 +01:00
Greg Hunkins	1066fe4cbc	🤫 Quiet IP Adapter Mask Warning (#8475 ) * quiet attn parameters * fix lint * make style && make quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-12 16:50:13 +01:00
Sayak Paul	d38f69ea25	change max_shard_size to 10GB (#8445 ) * change max_shard_size to 10GB * add notes to the documentation * Update src/diffusers/models/modeling_utils.py Co-authored-by: Lucain <lucainp@gmail.com> * change to abs limit --------- Co-authored-by: Lucain <lucainp@gmail.com>	2024-06-12 13:49:13 +01:00
Patrick	0a1c13af79	image_processor.py: Fixed an error in ValueError's message (#8447 ) * image_processor.py: Fixed an error in ValueError's message , as the string's join method tried to join types, instead of strings Bug that occurred: f"Input is in incorrect format. Currently, we only support {', '.join(supported_formats)}" TypeError: sequence item 0: expected str instance, type found * Fixed: C417 Unnecessary `map` usage (rewrite using a generator expression) --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-11 08:09:24 -10:00
YiYi Xu	0028c34432	fix SEGA pipeline (#8467 ) * fix * style --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-11 06:37:49 -10:00
Jianqi Pan	1d9a6a81b9	🔧 chore: use modeling_outputs.Transformer2DModelOutput (#8436 ) * 🔧 chore: use modeling_outputs.Transformer2DModelOutput * 🔧 chore: isort * 🔧 chore: isort * style --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2024-06-10 12:11:41 +01:00
Lucain	0d68ddf327	Move away from `cached_download` (#8419 ) * Move away from * unused constant * Add custom error	2024-06-07 15:43:00 +05:30
Sayak Paul	7d887118b9	[Core] support saving and loading of sharded checkpoints (#7830 ) * feat: support saving a model in sharded checkpoints. * feat: make loading of sharded checkpoints work. * add tests * cleanse the loading logic a bit more. * more resilience while loading from the Hub. * parallelize shard downloads by using snapshot_download()/ * default to a shard size. * more fix * Empty-Commit * debug * fix * uality * more debugging * fix more * initial comments from Benjamin * move certain methods to loading_utils * add test to check if the correct number of shards are present. * add a test to check if loading of sharded checkpoints from the Hub is okay * clarify the unit when passed as an int. * use hf_hub for sharding. * remove unnecessary code * remove unnecessary function * lucain's comments. * fixes * address high-level comments. * fix test * subfolder shenanigans./ * Update src/diffusers/utils/hub_utils.py Co-authored-by: Lucain <lucainp@gmail.com> * Apply suggestions from code review Co-authored-by: Lucain <lucainp@gmail.com> * remove _huggingface_hub_version as not needed. * address more feedback. * add a test for local_files_only=True/ * need hf hub to be at least 0.23.2 * style * final comment. * clean up subfolder. * deal with suffixes in code. * _add_variant default. * use weights_name_pattern * remove add_suffix_keyword * clean up downloading of sharded ckpts. * don't return something special when using index.json * fix more * don't use bare except * remove comments and catch the errors better * fix a couple of things when using is_file() * empty --------- Co-authored-by: Lucain <lucainp@gmail.com>	2024-06-07 14:49:10 +05:30
Sayak Paul	a3faf3f260	[Core] fix: legacy model mapping (#8416 ) * fix: legacy model mapping * remove print	2024-06-06 20:35:05 +05:30
Tolga Cangöz	98730c5dd7	Errata (#8322 ) * Fix typos * Trim trailing whitespaces * Remove a trailing whitespace * chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0 * Revert "chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0" This reverts commit `fd742b30b4`. * pokemon -> naruto * `DPMSolverMultistep` -> `DPMSolverMultistepScheduler` * Improve Markdown stylization * Improve style * Improve style * Refactor pipeline variable names for consistency * up style	2024-06-05 13:59:09 -07:00
Sayak Paul	48207d6689	[Scheduler] fix: EDM schedulers when using the exp sigma schedule. (#8385 ) * fix: euledm when using the exp sigma schedule. * fix-copies * remove print. * reduce friction * yiyi's suggestioms	2024-06-04 19:31:43 -10:00
Sayak Paul	2f6f426f66	[Hunyuan] allow Hunyuan DiT to run under 6GB for GPU VRAM (#8399 ) * allow hunyuan dit to run under 6GB for GPU VRAM * add section in the docs/	2024-06-05 08:24:19 +04:00
Sayak Paul	a0542c1917	[LoRA] Remove legacy LoRA code and related adjustments (#8316 ) * remove legacy code from load_attn_procs. * finish first draft * fix more. * fix more * add test * add serialization support. * fix-copies * require peft backend for lora tests * style * fix test * fix loading. * empty * address benjamin's feedback.	2024-06-05 08:15:30 +04:00
Sayak Paul	a8ad6664c2	[Hunyuan] feat: support chunked ff. (#8397 ) feat: support chunked ff.	2024-06-05 08:12:18 +04:00
Sayak Paul	14f7b545bd	[Hunyuan DiT] feat: enable fusing qkv projections when doing attention (#8396 ) * feat: introduce qkv fusion for Hunyuan * fix copies	2024-06-05 07:58:03 +04:00
leaps	07cd20041c	Update code example in pipeline_stable_unclip_img2img.py EXAMPLE_DOC_STRING (#8401 ) Update code example in pipeline_stable_unclip_img2img.py Previous code caused an error when run	2024-06-04 17:22:46 -10:00
Sayak Paul	6ddbf6222c	[Transformer2DModel] Handle `norm_type` safely while remapping (#8370 ) * handle norm_type of transformer2d_model safely. * log an info when old model class is being returned. * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * remove extra stuff --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-06-04 13:39:19 +04:00
Sayak Paul	3ff39e8e86	[HunyuanDiT] minor docs changes in hunyuandit (#8395 ) minor docs changes in hunyuandit	2024-06-04 12:18:53 +04:00
townwish4git	6be43bd855	Fix AsymmetricAutoencoderKL forward (#8378 )	2024-06-03 17:25:11 -10:00
XCL	413604405f	Tencent Hunyuan Team: add HunyuanDiT related updates (#8240 ) * Hunyuan Team: add HunyuanDiT related updates --------- Co-authored-by: XCLiu <liuxc1996@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>	2024-06-01 12:41:21 -10:00
39th president of the United States, probably	bc108e1533	Fix DREAM training (#8302 ) Co-authored-by: Jimmy <39@🇺🇸.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-06-01 11:27:57 +04:00
Sayak Paul	983dec3bf7	[Core] Introduce class variants for `Transformer2DModel` (#7647 ) * init for patches * finish patched model. * continuous transformer * vectorized transformer2d. * style. * inits. * fix-copies. * introduce DiTTransformer2DModel. * fixes * use REMAPPING as suggested by @DN6 * better logging. * add pixart transformer model. * inits. * caption_channels. * attention masking. * fix use_additional_conditions. * remove print. * debug * flatten * fix: assertion for sigma * handle remapping for modeling_utils * add tests for dit transformer2d * quality * placeholder for pixart tests * pixart tests * add _no_split_modules * add docs. * check * check * check * check * fix tests * fix tests * move Transformer output to modeling_output * move errors better and bring back use_additional_conditions attribute. * add unnecessary things from DiT. * clean up pixart * fix remapping * fix device_map things in pixart2d. * replace Transformer2DModel with appropriate classes in dit, pixart tests * empty * legacy mixin classes./ * use a remapping dict for fetching class names. * change to specifc model types in the pipeline implementations. * move _fetch_remapped_cls_from_config to modeling_loading_utils.py * fix dependency problems. * add deprecation note.	2024-05-31 13:40:27 +05:30
Dhruv Nair	f9fa8a868c	Change checkpoint key used to identify CLIP models in single file checkpoints (#8319 ) update	2024-05-31 11:20:31 +05:30
Jonah	05be622b1c	Fix depth pipeline "input/weight type should be the same" error at fp16 (#8321 ) Fix "input/weight type should be the same" Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-05-30 13:59:49 -10:00
Dhruv Nair	42cae93b94	Fix StableDiffusionPipeline when `text_encoder=None` (#8297 ) * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-29 09:00:51 -10:00
Tolga Cangöz	a2ecce26bc	Fix Copying Mechanism typo/bug (#8232 ) * Fix copying mechanism typos * fix copying mecha * Revert, since they are in TODO * Fix copying mechanism	2024-05-29 09:37:18 -07:00
Tolga Cangöz	f4a44b7707	Simplify `platform_info` assignment in `diffusers-cli env` (#8298 ) chore: Simplify `platform_info` assignment	2024-05-29 17:57:42 +05:30
Sayak Paul	581d8aacf7	post release v0.28.0 (#8286 ) * post release v0.28.0 * style	2024-05-29 07:13:22 +05:30
Sayak Paul	ba1bfac20b	[Core] Refactor `IPAdapterPlusImageProjection` a bit (#7994 ) * use IPAdapterPlusImageProjectionBlock in IPAdapterPlusImageProjection * reposition IPAdapterPlusImageProjection * refactor complete? * fix heads param retrieval. * update test dict creation method.	2024-05-29 06:30:47 +05:30
Sayak Paul	5edd0b34fa	move `vqmodel` to `models.autoencoders`. (#8292 ) move vqmodel to models.autoencoders.	2024-05-29 06:30:35 +05:30
Sayak Paul	3a28e36aa1	[Post release 0.28.0] remove deprecated blocks. (#8291 ) * remove deprecated blocks. * update the location paths.	2024-05-29 06:29:43 +05:30
Vladimir Mandic	3393c01c9d	fix pixart-sigma negative prompt handling (#8299 ) * fix negative prompt * fix --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-05-28 13:10:35 -10:00
Álvaro Somoza	b2030a249c	Fix object has no attribute 'flush' when using without a console (#8271 ) fix	2024-05-28 11:19:01 -10:00
Sayak Paul	e6df8edadc	[LoRA] attempt at fixing onetrainer lora. (#8242 ) * attempt at fixing onetrainer lora. * fix	2024-05-28 08:25:54 -10:00
Álvaro Somoza	ba82414106	[docs] Add controlnet example to marigold (#8289 ) * initial doc * fix wrong LCM sentence * implement binary colormap without requiring matplotlib update section about Marigold for ControlNet update formatting of marigold_usage.md * fix indentation --------- Co-authored-by: anton <anton.obukhov@gmail.com>	2024-05-28 11:58:06 -04:00
Anton Obukhov	b3d10d6d65	[Pipeline] Marigold depth and normals estimation (#7847 ) * implement marigold depth and normals pipelines in diffusers core * remove bibtex * remove deprecations * remove save_memory argument * remove validate_vae * remove config output * remove batch_size autodetection * remove presets logic move default denoising_steps and processing_resolution into the model config make default ensemble_size 1 * remove no_grad * add fp16 to the example usage * implement is_matplotlib_available use is_matplotlib_available, is_scipy_available for conditional imports in the marigold depth pipeline * move colormap, visualize_depth, and visualize_normals into export_utils.py * make the denoising loop more lucid fix the outputs to always be 4d tensors or lists of pil images support a 4d input_image case attempt to support model_cpu_offload_seq move check_inputs into a separate function change default batch_size to 1, remove any logic to make it bigger implicitly * style * rename denoising_steps into num_inference_steps * rename input_image into image * rename input_latent into latents * remove decode_image change decode_prediction to use the AutoencoderKL.decode method * move clean_latent outside of progress_bar * refactor marigold-reusable image processing bits into MarigoldImageProcessor class * clean up the usage example docstring * make ensemble functions members of the pipelines * add early checks in check_inputs rename E into ensemble_size in depth ensembling * fix vae_scale_factor computation * better compatibility with torch.compile better variable naming * move export_depth_to_png to export_utils * remove encode_prediction * improve visualize_depth and visualize_normals to accept multi-dimensional data and lists remove visualization functions from the pipelines move exporting depth as 16-bit PNGs functionality from the depth pipeline update example docstrings * do not shortcut vae.config variables * change all asserts to raise ValueError * rename output_prediction_type to output_type * better variable names clean up variable deletion code * better variable names * pass desc and leave kwargs into the diffusers progress_bar implement nested progress bar for images and steps loops * implement scale_invariant and shift_invariant flags in the ensemble_depth function add scale_invariant and shift_invariant flags readout from the model config further refactor ensemble_depth support ensembling without alignment add ensemble_depth docstring * fix generator device placement checks * move encode_empty_text body into the pipeline call * minor empty text encoding simplifications * adjust pipelines' class docstrings to explain the added construction arguments * improve the scipy failure condition add comments improve docstrings change the default use_full_z_range to True * make input image values range check configurable in the preprocessor refactor load_image_canonical in preprocessor to reject unknown types and return the image in the expected 4D format of tensor and on right device support a list of everything as inputs to the pipeline, change type to PipelineImageInput implement a check that all input list elements have the same dimensions improve docstrings of pipeline outputs remove check_input pipeline argument * remove forgotten print * add prediction_type model config * add uncertainty visualization into export utils fix NaN values in normals uncertainties * change default of output_uncertainty to False better handle the case of an attempt to export or visualize none * fix `output_uncertainty=False` * remove kwargs fix check_inputs according to the new inputs of the pipeline * rename prepare_latent into prepare_latents as in other pipelines annotate prepare_latents in normals pipeline with "Copied from" annotate encode_image in normals pipeline with "Copied from" * move nested-capable `progress_bar` method into the pipelines revert the original `progress_bar` method in pipeline_utils * minor message improvement * fix cpu offloading * move colormap, visualize_depth, export_depth_to_16bit_png, visualize_normals, visualize_uncertainty to marigold_image_processing.py update example docstrings * fix missing comma * change torch.FloatTensor to torch.Tensor * fix importing of MarigoldImageProcessor * fix vae offloading fix batched image encoding remove separate encode_image function and use vae.encode instead * implement marigold's intial tests relax generator checks in line with other pipelines implement return_dict __call__ argument in line with other pipelines * fix num_images computation * remove MarigoldImageProcessor and outputs from import structure update tests * update docstrings * update init * update * style * fix * fix * up * up * up * add simple test * up * update expected np input/output to be channel last * move expand_tensor_or_array into the MarigoldImageProcessor * rewrite tests to follow conventions - hardcoded slices instead of image artifacts write more smoke tests * add basic docs. * add anton's contribution statement * remove todos. * fix assertion values for marigold depth slow tests * fix assertion values for depth normals. * remove print * support AutoencoderTiny in the pipelines * update documentation page add Available Pipelines section add Available Checkpoints section add warning about num_inference_steps * fix missing import in docstring fix wrong value in visualize_depth docstring * [doc] add marigold to pipelines overview * [doc] add section "usage examples" * fix an issue with latents check in the pipelines * add "Frame-by-frame Video Processing with Consistency" section * grammarly * replace tables with images with css-styled images (blindly) * style * print * fix the assertions. * take from the github runner. * take the slices from action artifacts * style. * update with the slices from the runner. * remove unnecessary code blocks. * Revert "[doc] add marigold to pipelines overview" This reverts commit a505165150afd8dab23c474d1a054ea505a56a5f. * remove invitation for new modalities * split out marigold usage examples * doc cleanup --------- Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2024-05-27 17:21:49 +05:30
Tolga Cangöz	db33af065b	Fix a grammatical error in the `raise` messages (#8272 ) Fix grammatical error	2024-05-24 11:15:00 -07:00
Lucain	edf5ba6a17	Respect `resume_download` deprecation V2 (#8267 ) * Fix resume_downoad FutureWarning * only resume download	2024-05-24 12:11:03 +02:00
Dhruv Nair	370146e4e0	Use `freedesktop_os_release()` in diffusers cli for Python >=3.10 (#8235 ) * update * update	2024-05-24 13:30:40 +05:30
Dhruv Nair	67b3fe0aae	Fix resize issue in SVD pipeline with VideoProcessor (#8229 ) update Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-23 11:57:34 +05:30
BootesVoid	509741aea7	fix: Attribute error in Logger object (logger.warning) (#8183 )	2024-05-22 12:29:11 +05:30
Steven Liu	fdb1baa05c	[docs] VideoProcessor (#7965 ) * fix? * fix? * fix	2024-05-21 08:18:21 +05:30
Vinh H. Pham	6529ee67ec	Make VAE compatible to torch.compile() (#7984 ) make VAE compatible to torch.compile() Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-05-20 13:43:59 -04:00
Sai-Suraj-27	df2bc5ef28	fix: Fixed few `docstrings` according to the Google Style Guide (#7717 ) Fixed few docstrings according to the Google Style Guide.	2024-05-20 10:26:05 -07:00

1 2 3 4 5 ...

2292 Commits