diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
yiyixuxu	42c06e90f4	update doc	2025-06-23 17:55:32 +02:00
yiyixuxu	085ade03be	add doc (developer guide)	2025-06-23 16:12:31 +02:00
yiyixuxu	78d2454c7c	fix	2025-06-23 16:06:17 +02:00
yiyixuxu	19545fd3e1	update components manager __repr__	2025-06-22 12:59:19 +02:00
yiyixuxu	d12531ddf7	lora: only remove hooks that we add back	2025-06-22 12:32:04 +02:00
yiyixuxu	4751d456f2	shorten loop subblock name	2025-06-22 12:31:16 +02:00
yiyixuxu	083479c365	ordereddict -> insertableOrderedDict; make sure loader to method works	2025-06-21 04:28:10 +02:00
yiyixuxu	04c16d0a56	update	2025-06-21 04:25:12 +02:00
yiyixuxu	9e58856b7a	add __repr__ method for InsertableOrderedDict	2025-06-21 04:24:44 +02:00
yiyixuxu	45392cce11	update the description of StableDiffusionXLDenoiseLoopWrapper	2025-06-20 07:46:54 +02:00
yiyixuxu	8913d59bf3	add to method to modular loader, copied from DiffusionPipeline, not tested yet	2025-06-20 07:46:53 +02:00
yiyixuxu	5a8c1b5f19	add block mappings to modular_diffusers.stable_diffusion_xl.__init__	2025-06-20 07:46:53 +02:00
yiyixuxu	7ad01a6350	rename modular_pipeline_block_mappings.py to modular_block_mapping	2025-06-20 07:46:45 +02:00
YiYi Xu	a8e853b791	[modular diffusers] more refactor (#11235 ) * add componentspec and configspec * up * up * move methods to blocks * Modular Diffusers Guiders (#11311) * cfg; slg; pag; sdxl without controlnet * support sdxl controlnet * support controlnet union * update * update * cfg zero* * use unwrap_module for torch compiled modules * remove guider kwargs * remove commented code * remove old guider * fix slg bug * remove debug print * autoguidance * smoothed energy guidance * add note about seg * tangential cfg * cfg plus plus * support cfgpp in ddim * apply review suggestions * refactor * rename enable/disable * remove cfg++ for now * rename do_classifier_free_guidance->prepare_unconditional_embeds * remove unused * [modular diffusers] introducing ModularLoader (#11462) * cfg; slg; pag; sdxl without controlnet --------- Co-authored-by: Aryan <aryan@huggingface.co> * make loader optional * remove lora step and ip-adapter step -> no longer needed * rename pipeline -> components, data -> block_state * seperate controlnet step into input + denoise * refactor controlnet union * reefactor pipeline/block states so that it can dynamically accept kwargs * remove controlnet union denoise step, refactor & reuse controlnet denoisee step to accept aditional contrlnet kwargs * allow input_fields as input & update message * update input formating, consider kwarggs_type inputs with no name, e/g _controlnet_kwargs refactor the denoiseestep using LoopSequential! also add a new file for denoise step * change warning to debug * fix get_execusion blocks with loopsequential * fix auto denoise so all tests pass * update imports on guiders * remove modular reelated change from pipelines folder * made a modular_pipelines folder! * update __init__ * add notes * add block state will also make sure modifed intermediates_inputs will be updated * move block mappings to its own file * make inputs truly immutable, remove the output logic in sequential pipeline, and update so that intermediates_outputs are only new variables * decode block, if skip decoding do not need to update latent * fix imports * fix import * fix more * remove the output step * make generator intermediates (it is mutable) * after_denoise -> decoders * add a to-do for guider cconfig mixin * refactor component spec: replace create/create_from_pretrained/create_from_config to just create and load method * refactor modular loader: 1. load only load (pretrained components only if not specific names) 2. update acceept create spec 3. move the updte _componeent_spec logic outside register_components to each method that create/update the component: __init__/update/load * update components manager * up * [WIP] Modular Diffusers support custom code/pipeline blocks (#11539) * update * update * remove the duplicated components_manager file I forgot to deletee * fix import in block mapping * add a to-do for modular loader * prepare_latents_img2img pipeline method -> function, maybe do the same for others? * update input for loop blocks, do not need to include intermediate * solve merge conflict: manually add back the remote code change to modular_pipeline * add node_utils * modular node! * add * refator based on dhruv's feedbacks * update doc format for kwargs_type * up * updatee modular_pipeline.from_pretrained, modular_repo ->pretrained_model_name_or_path * save_pretrained for serializing config. (#11603) * save_pretrained for serializing config. * remove pushtohub * diffusers-cli rough --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-06-19 15:34:17 -10:00
YiYi Xu	6a509ba862	Merge branch 'main' into modular-diffusers	2025-04-30 17:56:25 -10:00
Yao Matrix	06beecafc5	make autoencoders. controlnet_flux and wan_transformer3d_single_file pass on xpu (#11461 ) * make autoencoders. controlnet_flux and wan_transformer3d_single_file pass on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * Apply style fixes --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Aryan <aryan@huggingface.co>	2025-05-01 02:43:31 +05:30
Vaibhav Kumawat	daf0a23958	Add LANCZOS as default interplotation mode. (#11463 ) * Add LANCZOS as default interplotation mode. * LANCZOS as default interplotation * LANCZOS as default interplotation mode * Added LANCZOS as default interplotation mode	2025-04-30 14:22:38 -04:00
tongyu	38ced7ee59	[test_models_transformer_hunyuan_video] help us test torch.compile() for impactful models (#11431 ) * Update test_models_transformer_hunyuan_video.py * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-04-30 19:11:42 +08:00
Yao Matrix	23c98025b3	make safe diffusion test cases pass on XPU and A100 (#11458 ) * make safe diffusion test cases pass on XPU and A100 Signed-off-by: Yao Matrix <matrix.yao@intel.com> * calibrate A100 expected values Signed-off-by: YAO Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com> Signed-off-by: YAO Matrix <matrix.yao@intel.com>	2025-04-30 16:05:28 +05:30
captainzz	8cd7426e56	Add StableDiffusion3InstructPix2PixPipeline (#11378 ) * upload StableDiffusion3InstructPix2PixPipeline * Move to community * Add readme * Fix images * remove images * Change image url * fix * Apply style fixes	2025-04-30 06:13:12 -04:00
Daniel Socek	fbce7aeb32	Add generic support for Intel Gaudi accelerator (hpu device) (#11328 ) * Add generic support for Intel Gaudi accelerator (hpu device) Signed-off-by: Daniel Socek <daniel.socek@intel.com> Co-authored-by: Libin Tang <libin.tang@intel.com> * Add loggers for generic HPU support Signed-off-by: Daniel Socek <daniel.socek@intel.com> * Refactor hpu support with is_hpu_available() logic Signed-off-by: Daniel Socek <daniel.socek@intel.com> * Fix style for hpu support update Signed-off-by: Daniel Socek <daniel.socek@intel.com> * Decouple soft HPU check from hard device validation to support HPU migration Signed-off-by: Daniel Socek <daniel.socek@intel.com> --------- Signed-off-by: Daniel Socek <daniel.socek@intel.com> Co-authored-by: Libin Tang <libin.tang@intel.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-04-30 14:45:02 +05:30
Yao Matrix	35fada4169	enable unidiffuser test cases on xpu (#11444 ) * enable unidiffuser cases on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix a typo Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-04-30 13:58:00 +05:30
Yao Matrix	fbe2fe5578	enable consistency test cases on XPU, all passed (#11446 ) Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-04-30 12:41:29 +05:30
Aryan	c86511586f	`torch.compile` fullgraph compatibility for Hunyuan Video (#11457 ) udpate	2025-04-30 11:21:17 +05:30
Yao Matrix	60892c55a4	enable marigold_intrinsics cases on XPU (#11445 ) Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-04-30 11:07:37 +05:30
Aryan	8fe5a14d9b	Raise warning instead of error for block offloading with streams (#11425 ) raise warning instead of error	2025-04-30 08:26:16 +05:30
Youlun Peng	58431f102c	Set LANCZOS as the default interpolation for image resizing in ControlNet training (#11449 ) Set LANCZOS as the default interpolation for image resizing	2025-04-29 08:47:02 -04:00
urpetkov-amd	4a9ab650aa	Fixing missing provider options argument (#11397 ) * Fixing missing provider options argument * Adding if else for provider options * Apply suggestions from code review Co-authored-by: YiYi Xu <yixu310@gmail.com> * Apply style fixes * Update src/diffusers/pipelines/onnx_utils.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/pipelines/onnx_utils.py Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: Uros Petkovic <urpektov@amd.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-04-28 10:23:05 -10:00
Linoy Tsaban	0ac1d5b482	[Hi-Dream LoRA] fix bug in validation (#11439 ) remove unnecessary pipeline moving to cpu in validation Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-04-28 06:22:32 -10:00
Yao Matrix	7567adfc45	enable 28 GGUF test cases on XPU (#11404 ) * enable gguf test cases on XPU Signed-off-by: YAO Matrix <matrix.yao@intel.com> * make SD35LargeGGUFSingleFileTests::test_pipeline_inference pas Signed-off-by: root <root@a4bf01945cfe.jf.intel.com> * make FluxControlLoRAGGUFTests::test_lora_loading pass Signed-off-by: Yao Matrix <matrix.yao@intel.com> * polish code Signed-off-by: Yao Matrix <matrix.yao@intel.com> * Apply style fixes --------- Signed-off-by: YAO Matrix <matrix.yao@intel.com> Signed-off-by: root <root@a4bf01945cfe.jf.intel.com> Signed-off-by: Yao Matrix <matrix.yao@intel.com> Co-authored-by: root <root@a4bf01945cfe.jf.intel.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-04-28 21:32:04 +05:30
tongyu	3da98e7ee3	[train_text_to_image_lora] Better image interpolation in training scripts follow up (#11427 ) * Update train_text_to_image_lora.py * update_train_text_to_image_lora	2025-04-28 11:23:24 -04:00
tongyu	b3b04fefde	[train_text_to_image] Better image interpolation in training scripts follow up (#11426 ) * Update train_text_to_image.py * update	2025-04-28 10:50:33 -04:00
Sayak Paul	0e3f2713c2	[tests] fix import. (#11434 ) fix import.	2025-04-28 13:32:28 +08:00
Yao Matrix	a7e9f85e21	enable test_layerwise_casting_memory cases on XPU (#11406 ) * enable test_layerwise_casting_memory cases on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-04-28 06:38:39 +05:30
Yao Matrix	9ce89e2efa	enable group_offload cases and quanto cases on XPU (#11405 ) * enable group_offload cases and quanto cases on XPU Signed-off-by: YAO Matrix <matrix.yao@intel.com> * use backend APIs Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: YAO Matrix <matrix.yao@intel.com> Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-04-28 06:37:16 +05:30
Sayak Paul	aa5f5d41d6	[tests] add tests to check for graph breaks, recompilation, cuda syncs in pipelines during torch.compile() (#11085 ) * test for better torch.compile stuff. * fixes * recompilation and graph break. * clear compilation cache. * change to modeling level test. * allow running compilation tests during nightlies.	2025-04-28 08:36:33 +08:00
Mert Erbak	bd96a084d3	[train_dreambooth_lora.py] Set LANCZOS as default interpolation mode for resizing (#11421 ) * Set LANCZOS as default interpolation mode for resizing * [train_dreambooth_lora.py] Set LANCZOS as default interpolation mode for resizing	2025-04-26 01:58:41 -04:00
co63oc	f00a995753	Fix typos in strings and comments (#11407 )	2025-04-24 08:53:47 -10:00
Ishan Modi	e8312e7ca9	[BUG] fixed WAN docstring (#11226 ) update	2025-04-24 08:49:37 -10:00
Emiliano	7986834572	Fix Flux IP adapter argument in the pipeline example (#11402 ) Fix Flux IP adapter argument in the example IP-Adapter example had a wrong argument. Fix `true_cfg` -> `true_cfg_scale`	2025-04-24 08:41:12 -10:00
Linoy Tsaban	edd7880418	[HiDream LoRA] optimizations + small updates (#11381 ) * 1. add pre-computation of prompt embeddings when custom prompts are used as well 2. save model card even if model is not pushed to hub 3. remove scheduler initialization from code example - not necessary anymore (it's now if the base model's config) 4. add skip_final_inference - to allow to run with validation, but skip the final loading of the pipeline with the lora weights to reduce memory reqs * pre encode validation prompt as well * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * pre encode validation prompt as well * Apply style fixes * empty commit * change default trained modules * empty commit * address comments + change encoding of validation prompt (before it was only pre-encoded if custom prompts are provided, but should be pre-encoded either way) * Apply style fixes * empty commit * fix validation_embeddings definition * fix final inference condition * fix pipeline deletion in last inference * Apply style fixes * empty commit * layers * remove readme remarks on only pre-computing when instance prompt is provided and change example to 3d icons * smol fix * empty commit --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-04-24 07:48:19 +03:00
Teriks	b4be42282d	Kolors additional pipelines, community contrib (#11372 ) * Kolors additional pipelines, community contrib --------- Co-authored-by: Teriks <Teriks@users.noreply.github.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2025-04-23 11:07:27 -10:00
Ishan Modi	a4f9c3cbc3	[Feature] Added Xlab Controlnet support (#11249 ) update	2025-04-23 10:43:50 -10:00
Ishan Dutta	4b60f4b602	[train_dreambooth_flux] Add LANCZOS as the default interpolation mode for image resizing (#11395 )	2025-04-23 10:47:05 -04:00
Aryan	6cef71de3a	Fix group offloading with block_level and use_stream=True (#11375 ) * fix * add tests * add message check	2025-04-23 18:17:53 +05:30
Ameer Azam	026507c06c	Update README_hidream.md (#11386 ) Small change requirements_sana.txt to requirements_hidream.txt	2025-04-22 20:08:26 -04:00
YiYi Xu	448c72a230	[HiDream] move deprecation to 0.35.0 (#11384 ) up	2025-04-22 08:08:08 -10:00
Aryan	f108ad8888	Update modeling imports (#11129 ) update	2025-04-22 06:59:25 -10:00
Linoy Tsaban	e30d3bf544	[LoRA] add LoRA support to HiDream and fine-tuning script (#11281 ) * initial commit * initial commit * initial commit * initial commit * initial commit * initial commit * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Bagheera <59658056+bghira@users.noreply.github.com> * move prompt embeds, pooled embeds outside * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: hlky <hlky@hlky.ac> * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: hlky <hlky@hlky.ac> * fix import * fix import and tokenizer 4, text encoder 4 loading * te * prompt embeds * fix naming * shapes * initial commit to add HiDreamImageLoraLoaderMixin * fix init * add tests * loader * fix model input * add code example to readme * fix default max length of text encoders * prints * nullify training cond in unpatchify for temp fix to incompatible shaping of transformer output during training * smol fix * unpatchify * unpatchify * fix validation * flip pred and loss * fix shift!!! * revert unpatchify changes (for now) * smol fix * Apply style fixes * workaround moe training * workaround moe training * remove prints * to reduce some memory, keep vae in `weight_dtype` same as we have for flux (as it's the same vae) `bbd0c161b5/examples/dreambooth/train_dreambooth_lora_flux.py (L1207)` * refactor to align with HiDream refactor * refactor to align with HiDream refactor * refactor to align with HiDream refactor * add support for cpu offloading of text encoders * Apply style fixes * adjust lr and rank for train example * fix copies * Apply style fixes * update README * update README * update README * fix license * keep prompt2,3,4 as None in validation * remove reverse ode comment * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * vae offload change * fix text encoder offloading * Apply style fixes * cleaner to_kwargs * fix module name in copied from * add requirements * fix offloading * fix offloading * fix offloading * update transformers version in reqs * try AutoTokenizer * try AutoTokenizer * Apply style fixes * empty commit * Delete tests/lora/test_lora_layers_hidream.py * change tokenizer_4 to load with AutoTokenizer as well * make text_encoder_four and tokenizer_four configurable * save model card * save model card * revert T5 * fix test * remove non diffusers lumina2 conversion --------- Co-authored-by: Bagheera <59658056+bghira@users.noreply.github.com> Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-04-22 11:44:02 +03:00
apolinário	6ab62c7431	Add stochastic sampling to FlowMatchEulerDiscreteScheduler (#11369 ) * Add stochastic sampling to FlowMatchEulerDiscreteScheduler This PR adds stochastic sampling to FlowMatchEulerDiscreteScheduler based on `b1aeddd7cc` ltx_video/schedulers/rf.py * Apply style fixes * Use config value directly * Apply style fixes * Swap order * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-04-21 17:18:30 -10:00

1 2 3 4 5 ...

5509 Commits