Sayak Paul
d457beed92
Update README.md to update the MaPO project ( #8470 )
...
Update README.md
2024-06-11 10:10:45 +01:00
Tolga Cangöz
98730c5dd7
Errata ( #8322 )
...
* Fix typos
* Trim trailing whitespaces
* Remove a trailing whitespace
* chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0
* Revert "chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0"
This reverts commit fd742b30b4 .
* pokemon -> naruto
* `DPMSolverMultistep` -> `DPMSolverMultistepScheduler`
* Improve Markdown stylization
* Improve style
* Improve style
* Refactor pipeline variable names for consistency
* up style
2024-06-05 13:59:09 -07:00
Hzzone
d3881f35b7
Gligen training ( #7906 )
...
* add training code of gligen
* fix code quality tests.
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-06-05 16:26:42 +04:00
satani99
352d96eb82
Modularize train_text_to_image_lora_sdxl inferencing during and after training in example ( #8335 )
...
* Modularized the train_lora_sdxl file
* Modularized the train_lora_sdxl file
* Modularized the train_lora_sdxl file
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-05-31 04:52:22 +05:30
Genius Patrick
3511a9623f
fix(training): lr scheduler doesn't work properly in distributed scenarios ( #8312 )
2024-05-30 15:23:19 +05:30
Tolga Cangöz
a2ecce26bc
Fix Copying Mechanism typo/bug ( #8232 )
...
* Fix copying mechanism typos
* fix copying mecha
* Revert, since they are in TODO
* Fix copying mechanism
2024-05-29 09:37:18 -07:00
satani99
3bc3b48c10
Modularize train_text_to_image_lora SD inferencing during and after training in example ( #8283 )
...
* Modularized the train_lora file
* Modularized the train_lora file
* Modularized the train_lora file
* Modularized the train_lora file
* Modularized the train_lora file
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-05-29 10:08:02 +05:30
Sayak Paul
581d8aacf7
post release v0.28.0 ( #8286 )
...
* post release v0.28.0
* style
2024-05-29 07:13:22 +05:30
Sajad Norouzi
67bef2027c
Add Kohya fix to SD pipeline for high resolution generation ( #7633 )
...
add kohya high resolution fix.
2024-05-28 10:00:04 -10:00
Tolga Cangöz
0ab63ff647
Fix CPU Offloading Usage & Typos ( #8230 )
...
* Fix typos
* Fix `pipe.enable_model_cpu_offload()` usage
* Fix cpu offloading
* Update numbers
2024-05-24 11:25:29 -07:00
Tolga Cangöz
db33af065b
Fix a grammatical error in the raise messages ( #8272 )
...
Fix grammatical error
2024-05-24 11:15:00 -07:00
Yifan Zhou
46a9db0336
[Community Pipeline] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation ( #8239 )
...
* code and doc
* update paper link
* remove redundant codes
* add example video
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-05-24 14:44:20 +05:30
Sai-Suraj-27
df2bc5ef28
fix: Fixed few docstrings according to the Google Style Guide ( #7717 )
...
Fixed few docstrings according to the Google Style Guide.
2024-05-20 10:26:05 -07:00
Alphin Jain
1221b28eac
Fix AttributeError in train_lcm_distill_lora_sdxl_wds.py ( #7923 )
...
Fix conditional teacher model check in train_lcm_distill_lora_sdxl_wds.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-05-16 15:49:54 +05:30
Sai-Suraj-27
2afea72d29
refactor: Refactored code by Merging isinstance calls ( #7710 )
...
* Merged isinstance calls to make the code simpler.
* Corrected formatting errors using ruff.
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-05-15 18:33:19 -10:00
Isamu Isozaki
d27e996ccd
Adding VQGAN Training script ( #5483 )
...
* Init commit
* Removed einops
* Added default movq config for training
* Update explanation of prompts
* Fixed inheritance of discriminator and init_tracker
* Fixed incompatible api between muse and here
* Fixed output
* Setup init training
* Basic structure done
* Removed attention for quick tests
* Style fixes
* Fixed vae/vqgan styles
* Removed redefinition of wandb
* Fixed log_validation and tqdm
* Nothing commit
* Added commit loss to lookup_from_codebook
* Update src/diffusers/models/vq_model.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* Adding perliminary README
* Fixed one typo
* Local changes
* Fixed main issues
* Merging
* Update src/diffusers/models/vq_model.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* Testing+Fixed bugs in training script
* Some style fixes
* Added wandb to docs
* Fixed timm test
* get testing suite ready.
* remove return loss
* remove return_loss
* Remove diffs
* Remove diffs
* fix ruff format
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2024-05-15 08:47:12 +05:30
Jingyang Zhang
69fdb8720f
[Pipeline] Adding BoxDiff to community examples ( #7947 )
...
add boxdiff to community examples
2024-05-14 11:18:29 -10:00
bssrdf
cdda94f412
fix VAE loading issue in train_dreambooth ( #7632 )
...
* fixed vae loading issue #7619
* rerun make style && make quality
* bring back model_has_vae and add change \ to / in config_file_name on windows os to make match work
* add missing import platform
* bring back import model_info
* make config_file_name OS independent
* switch to using Path.as_posix() to resolve OS dependence
* improve style
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: bssrdf <bssrdf@gmail.com >
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2024-05-14 08:19:53 +05:30
dependabot[bot]
5b830aa356
Bump transformers from 4.36.0 to 4.38.0 in /examples/research_projects/realfill ( #7635 )
...
Bump transformers in /examples/research_projects/realfill
Bumps [transformers](https://github.com/huggingface/transformers ) from 4.36.0 to 4.38.0.
- [Release notes](https://github.com/huggingface/transformers/releases )
- [Commits](https://github.com/huggingface/transformers/compare/v4.36.0...v4.38.0 )
---
updated-dependencies:
- dependency-name: transformers
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-14 08:17:06 +05:30
Kohei
9e7bae9881
Update requirements.txt for text_to_image ( #7892 )
...
Update requirements.txt
If the datasets library is old, it will not read the metadata.jsonl and the label will default to an integer of type int.
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-05-14 08:09:12 +05:30
HelloWorldBeginner
98ba18ba55
Add Ascend NPU support for SDXL. ( #7916 )
...
Co-authored-by: mhh001 <mahonghao1@huawei.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-05-12 13:34:23 +02:00
HelloWorldBeginner
0267c5233a
fix bugs when using deepspeed in sdxl ( #7917 )
...
fix bugs when using deepspeed
Co-authored-by: mhh001 <mahonghao1@huawei.com >
2024-05-11 20:49:09 +02:00
Mark Van Aken
be4afa0bb4
#7535 Update FloatTensor type hints to Tensor ( #7883 )
...
* find & replace all FloatTensors to Tensor
* apply formatting
* Update torch.FloatTensor to torch.Tensor in the remaining files
* formatting
* Fix the rest of the places where FloatTensor is used as well as in documentation
* formatting
* Update new file from FloatTensor to Tensor
2024-05-10 09:53:31 -10:00
Tolga Cangöz
c1c42698c9
Remove dead code and fix f-string issue ( #7720 )
...
* Remove dead code
* PylancereportGeneralTypeIssues: Strings nested within an f-string cannot use the same quote character as the f-string prior to Python 3.12.
* Remove dead code
2024-05-08 13:15:28 -10:00
Hyoungwon Cho
c2217142bd
Modification on the PAG community pipeline (re) ( #7876 )
...
* edited_pag_implementation
* update
---------
Co-authored-by: yiyixuxu <yixu310@gmail.com >
2024-05-07 16:35:15 -10:00
Bagheera
8edaf3b79c
7879 - adjust documentation to use naruto dataset, since pokemon is now gated ( #7880 )
...
* 7879 - adjust documentation to use naruto dataset, since pokemon is now gated
* replace references to pokemon in docs
* more references to pokemon replaced
* Japanese translation update
---------
Co-authored-by: bghira <bghira@users.github.com >
2024-05-07 09:36:39 -07:00
HelloWorldBeginner
58237364b1
Add Ascend NPU support for SDXL fine-tuning and fix the model saving bug when using DeepSpeed. ( #7816 )
...
* Add Ascend NPU support for SDXL fine-tuning and fix the model saving bug when using DeepSpeed.
* fix check code quality
* Decouple the NPU flash attention and make it an independent module.
* add doc and unit tests for npu flash attention.
---------
Co-authored-by: mhh001 <mahonghao1@huawei.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-05-03 08:14:34 -10:00
YiYi Xu
21a7ff12a7
update the logic of is_sequential_cpu_offload ( #7788 )
...
* up
* add comment to the tests + fix dit
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-05-01 06:25:57 -10:00
Linoy Tsaban
26a7851e1e
Add B-Lora training option to the advanced dreambooth lora script ( #7741 )
...
* add blora
* add blora
* add blora
* add blora
* little changes
* little changes
* remove redundancies
* fixes
* add B LoRA to readme
* style
* inference
* defaults + path to loras+ generation
* minor changes
* style
* minor changes
* minor changes
* blora arg
* added --lora_unet_blocks
* style
* Update examples/advanced_diffusion_training/README.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* add commit hash to B-LoRA repo cloneing
* change inference, remove cloning
* change inference, remove cloning
add section about configureable unet blocks
* change inference, remove cloning
add section about configureable unet blocks
* Apply suggestions from code review
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-04-30 09:46:30 +05:30
Nilesh
235d34cf56
Check for latents, before calling prepare_latents - sdxlImg2Img ( #7582 )
...
* Check for latents, before calling prepare_latents - sdxlImg2Img
* Added latents check for all the img2img pipeline
* Fixed silly mistake while checking latents as None
2024-04-28 14:53:29 -10:00
39th president of the United States, probably
9d16daaf64
Add DREAM training ( #6381 )
...
A new function compute_dream_and_update_latents has been added to the
training utilities that allows you to do DREAM rectified training in line
with the paper https://arxiv.org/abs/2312.00210 . The method can be used
with an extra argument in the train_text_to_image.py script.
Co-authored-by: Jimmy <39@🇺🇸 .com>
2024-04-27 07:19:15 +05:30
Steven Liu
7404f1e9dc
[docs] Clean up toctree ( #7715 )
...
* toctree
* optim
* feedback
* improve overview
2024-04-23 09:30:33 -07:00
Abhinav Gopal
d1e3f489e9
Animatediff Controlnet Community Pipeline IP Adapter Fix ( #7413 )
...
* fixed encode_image function signature in controlnet animatediff
* copied encode_image from stable diffusion pipeline
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-04-19 15:35:07 -10:00
Sai-Suraj-27
db969cc16d
fix: Fixed type annotations for compatability with python 3.8 ( #7648 )
...
* Fixed type annotations for compatability with python 3.8
* Add required imports.
2024-04-18 19:34:09 -10:00
Dhruv Nair
90250d9e48
Cast height, width to int inside prepare latents ( #7691 )
...
update
2024-04-18 19:30:39 -10:00
Fabio Rigano
b5c8b555d7
Move IP Adapter Face ID to core ( #7186 )
...
* Switch to peft and multi proj layers
* Move Face ID loading and inference to core
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-04-18 14:13:27 -10:00
UmerHA
fda1531d8a
Fixing implementation of ControlNet-XS ( #6772 )
...
* CheckIn - created DownSubBlocks
* Added extra channels, implemented subblock fwd
* Fixed connection sizes
* checkin
* Removed iter, next in forward
* Models for SD21 & SDXL run through
* Added back pipelines, cleared up connections
* Cleaned up connection creation
* added debug logs
* updated logs
* logs: added input loading
* Update umer_debug_logger.py
* log: Loading hint
* Update umer_debug_logger.py
* added logs
* Changed debug logging
* debug: added more logs
* Fixed num_norm_groups
* Debug: Logging all of SDXL input
* Update umer_debug_logger.py
* debug: updated logs
* checkim
* Readded tests
* Removed debug logs
* Fixed Slow Tests
* Added value ckecks | Updated model_cpu_offload_seq
* accelerate-offloading works ; fast tests work
* Made unet & addon explicit in controlnet
* Updated slow tests
* Added dtype/device to ControlNetXS
* Filled in test model paths
* Added image_encoder/feature_extractor to XL pipe
* Fixed fast tests
* Added comments and docstrings
* Fixed copies
* Added docs ; Updates slow tests
* Moved changes to UNetMidBlock2DCrossAttn
* tiny cleanups
* Removed stray prints
* Removed ip adapters + freeU
- Removed ip adapters + freeU as they don't make sense for ControlNet-XS
- Fixed imports of UNet components
* Fixed test_save_load_float16
* Make style, quality, fix-copies
* Changed loading/saving API for ControlNetXS
- Changed loading/saving API for ControlNetXS
- other small fixes
* Removed ControlNet-XS from research examples
* Make style, quality, fix-copies
* Small fixes
- deleted ControlNetXSModel.init_original
- added time_embedding_mix to StableDiffusionControlNetXSPipeline .from_pretrained / StableDiffusionXLControlNetXSPipeline.from_pretrained
- fixed copy hints
* checkin May 11 '23
* CheckIn Mar 12 '24
* Fixed tests for SD
* Added tests for UNetControlNetXSModel
* Fixed SDXL tests
* cleanup
* Delete Pipfile
* CheckIn Mar 20
Started replacing sub blocks by `ControlNetXSCrossAttnDownBlock2D` and `ControlNetXSCrossAttnUplock2D`
* check-in Mar 23
* checkin 24 Mar
* Created init for UNetCnxs and CnxsAddon
* CheckIn
* Made from_modules, from_unet and no_control work
* make style,quality,fix-copies & small changes
* Fixed freezing
* Added gradient ckpt'ing; fixed tests
* Fix slow tests(+compile) ; clear naming confusion
* Don't create UNet in init ; removed class_emb
* Incorporated review feedback
- Deleted get_base_pipeline / get_controlnet_addon for pipes
- Pipes inherit from StableDiffusionXLPipeline
- Made module dicts for cnxs-addon's down/mid/up classes
- Added support for qkv fusion and freeU
* Make style, quality, fix-copies
* Implemented review feedback
* Removed compatibility check for vae/ctrl embedding
* make style, quality, fix-copies
* Delete Pipfile
* Integrated review feedback
- Importing ControlNetConditioningEmbedding now
- get_down/mid/up_block_addon now outside class
- renamed `do_control` to `apply_control`
* Reduced size of test tensors
For this, added `norm_num_groups` as parameter everywhere
* Renamed cnxs-`Addon` to cnxs-`Adapter`
- `ControlNetXSAddon` -> `ControlNetXSAdapter`
- `ControlNetXSAddonDownBlockComponents` -> `DownBlockControlNetXSAdapter`, and similarly for mid/up
- `get_mid_block_addon` -> `get_mid_block_adapter`, and similarly for mid/up
* Fixed save_pretrained/from_pretrained bug
* Removed redundant code
---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2024-04-16 21:56:20 +05:30
kabachuha
2f23437618
Add (Scheduled) Pseudo-Huber Loss training scripts to research projects ( #7527 )
...
* add scheduled pseudo-huber loss training scripts
See #7488
* add reduction modes to huber loss
* [DB Lora] *2 multiplier to huber loss cause of 1/2 a^2 conv.
pairing of c6495def1f
* [DB Lora] add option for smooth l1 (huber / delta)
Pairing of dd22958caa
* [DB Lora] unify huber scheduling
Pairing of 19a834c3ab
* [DB Lora] add snr huber scheduler
Pairing of 47fb1a6854
* fixup examples link
* use snr schedule by default in DB
* update all huber scripts with snr
* code quality
* huber: make style && make quality
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-04-13 07:26:08 +05:30
dg845
0bee4d336b
LCM Distill Scripts Fix Bug when Initializing Target U-Net ( #6848 )
...
* Initialize target_unet from unet rather than teacher_unet so that we correctly add time_embedding.cond_proj if necessary.
* Use UNet2DConditionModel.from_config to initialize target_unet from unet's config.
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-04-11 07:52:12 -10:00
satani99
37e9d695af
Modularize instruct_pix2pix SD inferencing during and after training in examples ( #7603 )
...
* Modularize instruct_pix2pix code
* quality check
* quality check
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-04-10 11:19:16 +05:30
Christopher Beckham
8e46d97cd8
Add missing restore() EMA call in train SDXL script ( #7599 )
...
* Restore unet params back to normal from EMA when validation call is finished
* empty commit
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-04-09 18:07:55 +05:30
Bagheera
8e963d1c2a
7529 do not disable autocast for cuda devices ( #7530 )
...
* 7529 do not disable autocast for cuda devices
* Remove typecasting error check for non-mps platforms, as a correct autocast implementation makes it a non-issue
* add autocast fix to other training examples
* disable native_amp for dreambooth (sdxl)
* disable native_amp for pix2pix (sdxl)
* remove tests from remaining files
* disable native_amp on huggingface accelerator for every training example that uses it
* convert more usages of autocast to nullcontext, make style fixes
* make style fixes
* style.
* Empty-Commit
---------
Co-authored-by: bghira <bghira@users.github.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-04-02 20:15:06 +05:30
Álvaro Somoza
73ba81090e
[Community pipeline] SDXL Differential Diffusion Img2Img Pipeline ( #7550 )
...
* initial-commit pipeline created
* updated README.md
2024-04-01 18:15:30 -10:00
YiYi Xu
7956c36aaa
add a from_pipe method to DiffusionPipeline ( #7241 )
...
* add from_pipe
---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2024-04-01 13:02:00 -10:00
haikmanukyan
5266ab7935
add HD-Painter pipeline ( #7520 )
...
* add HD-Painter pipeline
* style fixing
* refactor, change doc, fix ruff
* fix docs
* used correct ruff version
---------
Co-authored-by: Hayk Manukyan <youremail@yourdomain.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-04-01 15:10:44 +05:30
Bingxin Ke
c2e87869be
[Community pipeline] Marigold depth estimation update -- align with marigold v0.1.5 ( #7524 )
...
* add resample option; check denoise_step; update ckpt path
* Add seeding in pipeline to increase reproducibility
* fix typo
* fix typo
2024-03-30 07:09:02 -10:00
Hyoungwon Cho
9d20ed37a2
Perturbed-Attention Guidance ( #7512 )
...
* pag_initial
* pag_docs
* edit_docs
* custom
* typo
* delete_docs
* whitespace
* make style
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-03-30 10:52:51 +05:30
Linoy Tsaban
bda1d4faf8
add Instant id sdxl image2image pipeline ( #7507 )
...
* initial commit - instantid img2img
* adapting to img2img
* change add_time_ids
* change add_time_ids
* WIP changes
* add strength to timesteps
* check insightface import
* style
* check insightface import changed to warning
* check insightface import changed to warning
* style
---------
Co-authored-by: apolinário <joaopaulo.passos@gmail.com >
2024-03-30 10:25:21 +05:30
Bagheera
d78acdedc1
apple mps: training support for SDXL (ControlNet, LoRA, Dreambooth, T2I) ( #7447 )
...
* apple mps: training support for SDXL LoRA
* sdxl: support training lora, dreambooth, t2i, pix2pix, and controlnet on apple mps
---------
Co-authored-by: bghira <bghira@users.github.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-03-28 14:26:18 +05:30
Thomas Liang
0cc5630945
[Chore] Fix Colab notebook links in README.md ( #7495 )
2024-03-27 12:36:36 -10:00