diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
sayakpaul	7828d4eb00	Release: v0.28.0 v0.28.0	2024-05-27 17:24:18 +05:30
Anton Obukhov	b3d10d6d65	[Pipeline] Marigold depth and normals estimation (#7847 ) * implement marigold depth and normals pipelines in diffusers core * remove bibtex * remove deprecations * remove save_memory argument * remove validate_vae * remove config output * remove batch_size autodetection * remove presets logic move default denoising_steps and processing_resolution into the model config make default ensemble_size 1 * remove no_grad * add fp16 to the example usage * implement is_matplotlib_available use is_matplotlib_available, is_scipy_available for conditional imports in the marigold depth pipeline * move colormap, visualize_depth, and visualize_normals into export_utils.py * make the denoising loop more lucid fix the outputs to always be 4d tensors or lists of pil images support a 4d input_image case attempt to support model_cpu_offload_seq move check_inputs into a separate function change default batch_size to 1, remove any logic to make it bigger implicitly * style * rename denoising_steps into num_inference_steps * rename input_image into image * rename input_latent into latents * remove decode_image change decode_prediction to use the AutoencoderKL.decode method * move clean_latent outside of progress_bar * refactor marigold-reusable image processing bits into MarigoldImageProcessor class * clean up the usage example docstring * make ensemble functions members of the pipelines * add early checks in check_inputs rename E into ensemble_size in depth ensembling * fix vae_scale_factor computation * better compatibility with torch.compile better variable naming * move export_depth_to_png to export_utils * remove encode_prediction * improve visualize_depth and visualize_normals to accept multi-dimensional data and lists remove visualization functions from the pipelines move exporting depth as 16-bit PNGs functionality from the depth pipeline update example docstrings * do not shortcut vae.config variables * change all asserts to raise ValueError * rename output_prediction_type to output_type * better variable names clean up variable deletion code * better variable names * pass desc and leave kwargs into the diffusers progress_bar implement nested progress bar for images and steps loops * implement scale_invariant and shift_invariant flags in the ensemble_depth function add scale_invariant and shift_invariant flags readout from the model config further refactor ensemble_depth support ensembling without alignment add ensemble_depth docstring * fix generator device placement checks * move encode_empty_text body into the pipeline call * minor empty text encoding simplifications * adjust pipelines' class docstrings to explain the added construction arguments * improve the scipy failure condition add comments improve docstrings change the default use_full_z_range to True * make input image values range check configurable in the preprocessor refactor load_image_canonical in preprocessor to reject unknown types and return the image in the expected 4D format of tensor and on right device support a list of everything as inputs to the pipeline, change type to PipelineImageInput implement a check that all input list elements have the same dimensions improve docstrings of pipeline outputs remove check_input pipeline argument * remove forgotten print * add prediction_type model config * add uncertainty visualization into export utils fix NaN values in normals uncertainties * change default of output_uncertainty to False better handle the case of an attempt to export or visualize none * fix `output_uncertainty=False` * remove kwargs fix check_inputs according to the new inputs of the pipeline * rename prepare_latent into prepare_latents as in other pipelines annotate prepare_latents in normals pipeline with "Copied from" annotate encode_image in normals pipeline with "Copied from" * move nested-capable `progress_bar` method into the pipelines revert the original `progress_bar` method in pipeline_utils * minor message improvement * fix cpu offloading * move colormap, visualize_depth, export_depth_to_16bit_png, visualize_normals, visualize_uncertainty to marigold_image_processing.py update example docstrings * fix missing comma * change torch.FloatTensor to torch.Tensor * fix importing of MarigoldImageProcessor * fix vae offloading fix batched image encoding remove separate encode_image function and use vae.encode instead * implement marigold's intial tests relax generator checks in line with other pipelines implement return_dict __call__ argument in line with other pipelines * fix num_images computation * remove MarigoldImageProcessor and outputs from import structure update tests * update docstrings * update init * update * style * fix * fix * up * up * up * add simple test * up * update expected np input/output to be channel last * move expand_tensor_or_array into the MarigoldImageProcessor * rewrite tests to follow conventions - hardcoded slices instead of image artifacts write more smoke tests * add basic docs. * add anton's contribution statement * remove todos. * fix assertion values for marigold depth slow tests * fix assertion values for depth normals. * remove print * support AutoencoderTiny in the pipelines * update documentation page add Available Pipelines section add Available Checkpoints section add warning about num_inference_steps * fix missing import in docstring fix wrong value in visualize_depth docstring * [doc] add marigold to pipelines overview * [doc] add section "usage examples" * fix an issue with latents check in the pipelines * add "Frame-by-frame Video Processing with Consistency" section * grammarly * replace tables with images with css-styled images (blindly) * style * print * fix the assertions. * take from the github runner. * take the slices from action artifacts * style. * update with the slices from the runner. * remove unnecessary code blocks. * Revert "[doc] add marigold to pipelines overview" This reverts commit a505165150afd8dab23c474d1a054ea505a56a5f. * remove invitation for new modalities * split out marigold usage examples * doc cleanup --------- Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2024-05-27 17:21:49 +05:30
Dhruv Nair	b82f9f5666	Add zip package to doc builder image (#8284 ) update	2024-05-27 15:50:00 +05:30
Sayak Paul	6a5ba1b719	[Workflows] add a more secure way to run tests from a PR. (#7969 ) * add a more secure way to run tests from a PR. * make pytest more secure. * address dhruv's comments. * improve validation check. * Update .github/workflows/run_tests_from_a_pr.yml Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-05-27 13:47:50 +05:30
Dhaivat Bhatt	4d40c9140c	Add details about 1-stage implementation in I2VGen-XL docs (#8282 ) * Add details about 1-stage implementation * Add details about 1-stage implementation	2024-05-27 09:56:32 +05:30
Tolga Cangöz	0ab63ff647	Fix CPU Offloading Usage & Typos (#8230 ) * Fix typos * Fix `pipe.enable_model_cpu_offload()` usage * Fix cpu offloading * Update numbers	2024-05-24 11:25:29 -07:00
Tolga Cangöz	db33af065b	Fix a grammatical error in the `raise` messages (#8272 ) Fix grammatical error	2024-05-24 11:15:00 -07:00
Yue Wu	1096f88e2b	sampling bug fix in diffusers tutorial "basic_training.md" (#8223 ) sampling bug fix in basic_training.md In the diffusers basic training tutorial, setting the manual seed argument (generator=torch.manual_seed(config.seed)) in the pipeline call inside evaluate() function rewinds the dataloader shuffling, leading to overfitting due to the model seeing same sequence of training examples after every evaluation call. Using generator=torch.Generator(device='cpu').manual_seed(config.seed) avoids this.	2024-05-24 11:14:32 -07:00
Dhruv Nair	cef4a51223	Clean up `from_single_file` docs (#8268 ) * update * update	2024-05-24 17:43:51 +05:30
Lucain	edf5ba6a17	Respect `resume_download` deprecation V2 (#8267 ) * Fix resume_downoad FutureWarning * only resume download	2024-05-24 12:11:03 +02:00
Sayak Paul	9941f1f61b	[Chore] run the documentation workflow in a custom container. (#8266 ) run the documentation workflow in a custom container.	2024-05-24 15:10:02 +05:30
Yifan Zhou	46a9db0336	[Community Pipeline] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation (#8239 ) * code and doc * update paper link * remove redundant codes * add example video --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-24 14:44:20 +05:30
Dhruv Nair	370146e4e0	Use `freedesktop_os_release()` in diffusers cli for Python >=3.10 (#8235 ) * update * update	2024-05-24 13:30:40 +05:30
Dhruv Nair	5cd45c24bf	Create custom container for doc builder (#8263 ) * update * update	2024-05-24 12:53:48 +05:30
Dhruv Nair	67b3fe0aae	Fix resize issue in SVD pipeline with VideoProcessor (#8229 ) update Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-23 11:57:34 +05:30
Dhruv Nair	baab065679	Remove unnecessary single file tests for SD Cascade UNet (#7996 ) update	2024-05-22 12:29:59 +05:30
BootesVoid	509741aea7	fix: Attribute error in Logger object (logger.warning) (#8183 )	2024-05-22 12:29:11 +05:30
Lucain	e1df77ee1e	Use HF_TOKEN env var in CI (#7993 )	2024-05-21 14:58:10 +05:30
Steven Liu	fdb1baa05c	[docs] VideoProcessor (#7965 ) * fix? * fix? * fix	2024-05-21 08:18:21 +05:30
Vinh H. Pham	6529ee67ec	Make VAE compatible to torch.compile() (#7984 ) make VAE compatible to torch.compile() Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-05-20 13:43:59 -04:00
Sai-Suraj-27	df2bc5ef28	fix: Fixed few `docstrings` according to the Google Style Guide (#7717 ) Fixed few docstrings according to the Google Style Guide.	2024-05-20 10:26:05 -07:00
Aleksei Zhuravlev	a7bf77fc28	Passing `cross_attention_kwargs` to `StableDiffusionInstructPix2PixPipeline` (#7961 ) * Update pipeline_stable_diffusion_instruct_pix2pix.py Add `cross_attention_kwargs` to `__call__` method of `StableDiffusionInstructPix2PixPipeline`, which are passed to UNet. * Update documentation for pipeline_stable_diffusion_instruct_pix2pix.py * Update docstring * Update docstring * Fix typing import	2024-05-20 13:14:34 -04:00
Junsong Chen	0f0defdb65	[docs] add doc for PixArtSigmaPipeline (#7857 ) * 1. add doc for PixArtSigmaPipeline; --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Guillaume LEGENDRE <glegendre01@gmail.com> Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> Co-authored-by: Bagheera <59658056+bghira@users.noreply.github.com> Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Hyoungwon Cho <jhw9811@korea.ac.kr> Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: Tolga Cangöz <46008593+standardAI@users.noreply.github.com> Co-authored-by: Philip Pham <phillypham@google.com>	2024-05-20 12:40:57 -04:00
Nikita	19df9f3ec0	Update pipeline_controlnet_inpaint_sd_xl.py (#7983 )	2024-05-20 12:24:49 -04:00
Jacob Marks	d6ca120987	Fix typo in "attention" (#7977 )	2024-05-20 11:54:29 -04:00
Sayak Paul	fb7ae0184f	[tests] fix Pixart Sigma tests (#7966 ) * checking tests * checking ii. * remove prints. * test_pixart_1024 * fix 1024.	2024-05-19 20:56:31 +05:30
Sayak Paul	70f8d4b488	remove unsafe workflow. (#7967 )	2024-05-17 13:46:24 +05:30
Álvaro Somoza	6c60e430ee	Consistent SDXL Controlnet callback tensor inputs (#7958 ) * make _callback_tensor_inputs consistent between sdxl pipelines * forgot this one * fix failing test * fix test_components_function * fix controlnet inpaint tests	2024-05-16 07:15:10 -10:00
Alphin Jain	1221b28eac	Fix AttributeError in train_lcm_distill_lora_sdxl_wds.py (#7923 ) Fix conditional teacher model check in train_lcm_distill_lora_sdxl_wds.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-16 15:49:54 +05:30
Liang Hou	746f603b20	Fix the text tokenizer name in logger warning of PixArt pipelines (#7912 ) Fix CLIP to T5 in logger warning	2024-05-15 18:49:29 -10:00
Sai-Suraj-27	2afea72d29	refactor: Refactored code by Merging `isinstance` calls (#7710 ) * Merged isinstance calls to make the code simpler. * Corrected formatting errors using ruff. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-05-15 18:33:19 -10:00
Sayak Paul	0f111ab794	[Workflows] add a workflow that can be manually triggered on a PR. (#7942 ) * add a workflow that can be manually triggered on a PR. * remove sudo * add command * small fixes.	2024-05-15 17:18:56 +05:30
Guillaume LEGENDRE	4dd7aaa06f	move to GH hosted M1 runner (#7949 )	2024-05-15 13:47:36 +05:30
Isamu Isozaki	d27e996ccd	Adding VQGAN Training script (#5483 ) * Init commit * Removed einops * Added default movq config for training * Update explanation of prompts * Fixed inheritance of discriminator and init_tracker * Fixed incompatible api between muse and here * Fixed output * Setup init training * Basic structure done * Removed attention for quick tests * Style fixes * Fixed vae/vqgan styles * Removed redefinition of wandb * Fixed log_validation and tqdm * Nothing commit * Added commit loss to lookup_from_codebook * Update src/diffusers/models/vq_model.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Adding perliminary README * Fixed one typo * Local changes * Fixed main issues * Merging * Update src/diffusers/models/vq_model.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Testing+Fixed bugs in training script * Some style fixes * Added wandb to docs * Fixed timm test * get testing suite ready. * remove return loss * remove return_loss * Remove diffs * Remove diffs * fix ruff format --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-05-15 08:47:12 +05:30
Sayak Paul	72780ff5b1	[tests] decorate StableDiffusion21PipelineSingleFileSlowTests with slow. (#7941 ) decorate StableDiffusion21PipelineSingleFileSlowTests with slow.	2024-05-14 14:26:21 -10:00
Jingyang Zhang	69fdb8720f	[Pipeline] Adding BoxDiff to community examples (#7947 ) add boxdiff to community examples	2024-05-14 11:18:29 -10:00
Nikita	b2140a895b	Fix `added_cond_kwargs` when using IP-Adapter in StableDiffusionXLControlNetInpaintPipeline (#7924 ) Fix `added_cond_kwargs` when using IP-Adapter Fix error when using IP-Adapter in pipeline and passing `ip_adapter_image_embeds` instead of `ip_adapter_image` Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-05-14 10:32:08 -10:00
Sayak Paul	e0e8c58f64	[Core] separate the loading utilities in modeling similar to pipelines. (#7943 ) separate the loading utilities in modeling similar to pipelines.	2024-05-14 22:33:43 +05:30
Sayak Paul	cbea5d1725	update to use hf-workflows for reporting the Docker build statuses (#7938 ) update to use hf-workflows for reporting	2024-05-14 09:25:13 +05:30
Tolga Cangöz	a1245c2c61	Expansion proposal of `diffusers-cli env` (#7403 ) * Expand `diffusers-cli env` * SafeTensors -> Safetensors Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Move `safetensors_version = "not installed"` to `else` * Update `safetensors_version` checking * Add GPU detection for Linux, Mac OS, and Windows * Add accelerator detection to environment command * Add is_peft_version to import_utils * Update env.py * Add `huggingface_hub` reference * Add `transformers` reference * Add reference for `huggingface_hub` * Fix print statement in env.py for unusual OS * Up * Fix platform information in env.py * up * Fix import order in env.py * ruff * make style * Fix platform system check in env.py * Fix run method return type in env.py * 🤗 * No need f-string * Remove location info * Remove accelerate config * Refactor env.py to remove accelerate config * feat: Add support for `bitsandbytes` library in environment command --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-05-14 08:20:24 +05:30
bssrdf	cdda94f412	fix VAE loading issue in train_dreambooth (#7632 ) * fixed vae loading issue #7619 * rerun make style && make quality * bring back model_has_vae and add change \ to / in config_file_name on windows os to make match work * add missing import platform * bring back import model_info * make config_file_name OS independent * switch to using Path.as_posix() to resolve OS dependence * improve style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: bssrdf <bssrdf@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-05-14 08:19:53 +05:30
dependabot[bot]	5b830aa356	Bump transformers from 4.36.0 to 4.38.0 in /examples/research_projects/realfill (#7635 ) Bump transformers in /examples/research_projects/realfill Bumps [transformers](https://github.com/huggingface/transformers) from 4.36.0 to 4.38.0. - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](https://github.com/huggingface/transformers/compare/v4.36.0...v4.38.0) --- updated-dependencies: - dependency-name: transformers dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-14 08:17:06 +05:30
Kohei	9e7bae9881	Update requirements.txt for text_to_image (#7892 ) Update requirements.txt If the datasets library is old, it will not read the metadata.jsonl and the label will default to an integer of type int. Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-14 08:09:12 +05:30
rebel-kblee	b41ce1e090	fix multicontrolnet `save_pretrained` logic for compatibility (#7821 ) fix multicontrolnet save_pretrained logic for compatibility Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-13 09:32:06 -10:00
Sayak Paul	95d3748453	[LoRA] Fix LoRA tests (side effects of RGB ordering) part ii (#7932 ) * check * check 2. * update slices	2024-05-13 09:23:48 -10:00
Fabio Rigano	44aa9e566d	fix AnimateDiff creation with a unet loaded with IP Adapter (#7791 ) * Fix loading from_pipe * Fix style --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-05-13 08:15:01 -10:00
Álvaro Somoza	fdb05f54ef	Official callbacks (#7761 )	2024-05-12 17:10:29 -10:00
HelloWorldBeginner	98ba18ba55	Add Ascend NPU support for SDXL. (#7916 ) Co-authored-by: mhh001 <mahonghao1@huawei.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-12 13:34:23 +02:00
Sayak Paul	5bb38586a9	[Core] fix offload behaviour when device_map is enabled. (#7919 ) fix offload behaviour when device_map is enabled.	2024-05-12 13:29:43 +02:00
Sai-Suraj-27	ec9e88139a	fix: Fixed a wrong link to supported python versions in `contributing.md` file (#7638 ) * Fixed a wrong link to python versions in contributing.md file. * Updated the link to a permalink, so that it will permanently point to the specific line.	2024-05-12 13:21:18 +02:00

1 2 3 4 5 ...

4128 Commits