diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Patrick von Platen	e7534542a2	Release: v0.15.0 v0.15.0	2023-04-12 15:15:31 +00:00
Andranik Movsisyan	b9b891621e	Text2video zero refinements (#3070 ) * fix progress bar issue in pipeline_text_to_video_zero.py. Copy scheduler after first backward * fix tensor loading in test_text_to_video_zero.py * make style && make quality	2023-04-12 14:27:09 +01:00
Ernie Chu	a43934371a	Fix a bug of pano when not doing CFG (#3030 ) * Fix a bug of pano when not doing CFG * enhance code quality * apply formatting. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-04-12 14:20:25 +01:00
Pedro Cuenca	caa5884e8a	Update Flax TPU tests (#3069 ) Update Flax TPU tests. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-12 14:17:36 +01:00
Sayak Paul	fa736e321d	[Docs] refactor text-to-video zero (#3049 ) * fix: norm group test for UNet3D. * refactor text-to-video zero docs.	2023-04-12 14:15:26 +01:00
Patrick von Platen	a4b233e5b5	Finish docs textual inversion (#3068 ) * Finish docs textual inversion * Apply suggestions from code review Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-04-12 13:35:58 +01:00
Nipun Jindal	524535b5f2	[2064]: Add Karras to DPMSolverMultistepScheduler (#3001 ) * [2737]: Add Karras DPMSolverMultistepScheduler * [2737]: Add Karras DPMSolverMultistepScheduler * Add test * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix: repo consistency. * remove Copied from statement from the set_timestep method. * fix: test * Empty commit. Co-authored-by: njindal <njindal@adobe.com> --------- Co-authored-by: njindal <njindal@adobe.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-04-12 18:04:51 +05:30
Sean Sube	7b2407f4d7	add support for pre-calculated prompt embeds to Stable Diffusion ONNX pipelines (#2597 ) * add support for prompt embeds to SD ONNX pipeline * fix up the pipeline copies * add prompt embeds param to other ONNX pipelines * fix up prompt embeds param for SD upscaling ONNX pipeline * add missing type annotations to ONNX pipes	2023-04-12 12:19:56 +01:00
Will Berman	639f6455b4	fix pipeline __setattr__ value == None (#3063 ) * fix pipeline __setattr__ * add test --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-12 12:11:09 +01:00
Andy	9d7c08f95e	[WIP] implement rest of the test cases (LoRA tests) (#2824 ) * inital commit for lora test cases * help a bit with lora for 3d * fixed lora tests * replaced redundant code --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-04-12 15:32:14 +05:30
Pedro Cuenca	dc277501c7	Flax memory efficient attention (#2889 ) * add use_memory_efficient params placeholder * test * add memory efficient attention jax * add memory efficient attention jax * newline * forgot dot * Rename use_memory_efficient * Keep dtype last. * Actually use key_chunk_size * Rename symbol * Apply style * Rename use_memory_efficient * Keep dtype last * Pass `use_memory_efficient_attention` in `from_pretrained` * Move JAX memory efficient attention to attention_flax. * Simple test. * style --------- Co-authored-by: muhammad_hanif <muhammad_hanif@sofcograha.co.id> Co-authored-by: MuhHanif <48muhhanif@gmail.com>	2023-04-12 10:17:51 +01:00
Susung Hong	0df47efee2	[Docs] update Self-Attention Guidance docs (#2952 ) * Update index.mdx * Edit docs & add HF space link * Only change equation numbers in comments	2023-04-12 10:14:32 +01:00
Sayak Paul	5a7d35e29c	Fix InstructPix2Pix training in multi-GPU mode (#2978 ) * fix: norm group test for UNet3D. * fix: unet rejig. * fix: unwrapping when running validation inputs. * unwrapping the unet too. * fix: device. * better unwrapping. * unwrapping before ema. * unwrapping.	2023-04-12 10:13:53 +01:00
Patrick von Platen	0c72006e3a	fix slow tsets (#3066 ) * fix slow tsets * make style	2023-04-12 10:23:52 +02:00
Sayak Paul	a89a14fa7a	[LoRA] Enabling limited LoRA support for text encoder (#2918 ) * add: first draft for a better LoRA enabler. * make fix-copies. * feat: backward compatibility. * add: entry to the docs. * add: tests. * fix: docs. * fix: norm group test for UNet3D. * feat: add support for flat dicts. * add depcrcation message instead of warning.	2023-04-12 08:29:04 +05:30
Sayak Paul	e607a582cf	[Examples] Fix type-casting issue in the ControlNet training script (#2994 ) * fix: norm group test for UNet3D. * fix: type-casting issue in controlnet training.	2023-04-12 06:35:06 +05:30
Will Berman	ea39cd7e64	Attn added kv processor torch 2.0 block (#3023 ) add AttnAddedKVProcessor2_0 block	2023-04-11 16:54:22 -07:00
Will Berman	98c5e5da31	Attention processor cross attention norm group norm (#3021 ) add group norm type to attention processor cross attention norm This lets the cross attention norm use both a group norm block and a layer norm block. The group norm operates along the channels dimension and requires input shape (batch size, channels, ) where as the layer norm with a single `normalized_shape` dimension only operates over the least significant dimension i.e. (, channels). The channels we want to normalize are the hidden dimension of the encoder hidden states. By convention, the encoder hidden states are always passed as (batch size, sequence length, hidden states). This means the layer norm can operate on the tensor without modification, but the group norm requires flipping the last two dimensions to operate on (batch size, hidden states, sequence length). All existing attention processors will have the same logic and we can consolidate it in a helper function `prepare_encoder_hidden_states` prepare_encoder_hidden_states -> norm_encoder_hidden_states re: @patrickvonplaten move norm_cross defined check to outside norm_encoder_hidden_states add missing attn.norm_cross check	2023-04-11 15:51:40 -07:00
Will Berman	2d52e81cb9	unet time embedding activation function (#3048 ) * unet time embedding activation function * typo act_fn -> time_embedding_act_fn * flatten conditional	2023-04-11 15:51:29 -07:00
Chanchana Sornsoontorn	52c4d32d41	Fix typo and format BasicTransformerBlock attributes (#2953 ) * ⚙️chore(train_controlnet) fix typo in logger message * ⚙️chore(models) refactor modules order; make them the same as calling order When printing the BasicTransformerBlock to stdout, I think it's crucial that the attributes order are shown in proper order. And also previously the "3. Feed Forward" comment was not making sense. It should have been close to self.ff but it's instead next to self.norm3 * correct many tests * remove bogus file * make style * correct more tests * finish tests * fix one more * make style * make unclip deterministic * ⚙️chore(models/attention) reorganize comments in BasicTransformerBlock class --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-12 00:31:05 +02:00
Will Berman	c6180a311c	add only cross attention to simple attention blocks (#3011 ) * add only cross attention to simple attention blocks * add test for only_cross_attention re: @patrickvonplaten * mid_block_only_cross_attention better default allow mid_block_only_cross_attention to default to `only_cross_attention` when `only_cross_attention` is given as a single boolean	2023-04-11 14:38:50 -07:00
Pedro Cuenca	e3095c5f47	Fix invocation of some slow Flax tests (#3058 ) * Fix invocation of some slow tests. We use __call__ rather than pmapping the generation function ourselves because the number of static arguments is different now. * style	2023-04-11 23:21:25 +02:00
Pedro Cuenca	526827c3d1	Fix scheduler type mismatch (#3041 ) When doing generation manually and using guidance_scale as a static argument.	2023-04-11 23:20:35 +02:00
George Ogden	cb63febf2e	Update documentation (#2996 ) * Update documentation Based on sampling, the width and height must be powers of 2 as the samples halve in size each time * make style	2023-04-11 19:02:13 +01:00
Will Berman	8c6b47cfde	`AttentionProcessor.group_norm` num_channels should be `query_dim` (#3046 ) * `AttentionProcessor.group_norm` num_channels should be `query_dim` The group_norm on the attention processor should really norm the number of channels in the query _not_ the inner dim. This wasn't caught before because the group_norm is only used by the added kv attention processors and the added kv attention processors are only used by the karlo models which are configured such that the inner dim is the same as the query dim. * add_{k,v}_proj should be projecting to inner_dim	2023-04-11 10:32:55 -07:00
Will Berman	67ec9cf513	accelerate min version for ProjectConfiguration import (#3042 )	2023-04-11 10:12:28 -07:00
Will Berman	80bc0c0ced	config fixes (#3060 )	2023-04-11 17:54:50 +01:00
Patrick von Platen	091a058236	make style	2023-04-11 15:51:21 +00:00
J N Hearns	881a6b58c3	Fix imports for composable_stable_diffusion pipeline (#3002 ) * Update composable_stable_diffusion.py Fix imports * Formatting * Formatting * Formatting	2023-04-11 16:50:25 +01:00
Steven Liu	cb9d77af23	[docs] Reusing components (#3000 ) * reuse-components * format	2023-04-11 15:34:34 +01:00
Patrick von Platen	8b451eb63b	Fix config prints and save, load of pipelines (#2849 ) * [Config] Fix config prints and save, load * Only use potential nn.Modules for dtype and device * Correct vae image processor * make sure in_channels is not accessed directly * make sure in channels is only accessed via config * Make sure schedulers only access config attributes * Make sure to access config in SAG * Fix vae processor and make style * add tests * uP * make style * Fix more naming issues * Final fix with vae config * change more	2023-04-11 13:35:42 +02:00
Patrick von Platen	8369196703	fix report tool (#3047 )	2023-04-11 10:55:00 +02:00
Mishig	4f48476dd6	Update contribution.mdx (#3054 ) * Update contribution.mdx hotfix for doc-builder parsing quote in heading bug * quoteation replace	2023-04-11 09:23:58 +02:00
Pedro Cuenca	fbc9a736dd	mps: skip unstable test (#3037 )	2023-04-11 06:36:54 +05:30
Rogério Júnior	67c3518f68	Small typo correction in comments (#3012 )	2023-04-10 13:48:35 -07:00
Andranik Movsisyan	ba49272db8	[Pipeline] Add TextToVideoZeroPipeline (#2954 ) * add TextToVideoZeroPipeline and CrossFrameAttnProcessor * add docs for text-to-video zero * add teaser image for text-to-video zero docs * Fix review changes. Add Documentation. Add test * clean up the codes in pipeline_text_to_video.py. Add descriptive comments and docstrings * make style && make quality * make fix-copies * make requested changes to docs. use huggingface server links for resources, delete res folder * make style && make quality && make fix-copies * make style && make quality * Apply suggestions from code review --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-04-10 22:09:53 +02:00
William Berman	074d281ae0	tests and additional scheduler fixes	2023-04-10 12:59:33 -07:00
William Berman	953c9d14eb	[bug fix] dpm multistep solver duplicate timesteps	2023-04-10 12:59:33 -07:00
luanjintai	85f1c19282	find another one accelerate parameter error	2023-04-10 12:23:17 -07:00
luanjintai	b5d0a9131d	fix wrong parameter name for accelerate	2023-04-10 12:23:17 -07:00
Pedro Cuenca	983a7fbfd8	Initial draft of Core ML docs (#2987 ) * Initial draft of Core ML docs. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Fix Core ML spelling * Apply the rest of suggestions. * Attempt to fix hyperlink inside Tip. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestions from code review --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-04-10 21:09:04 +02:00
William Berman	c413353e8e	add `encoder_hid_dim` to unet `encoder_hid_dim` provides an additional projection for the input `encoder_hidden_states` from `encoder_hidden_dim` to `cross_attention_dim`	2023-04-09 23:00:16 -07:00
William Berman	8db5e5b37d	allow unet varying number of layers per block	2023-04-09 22:57:26 -07:00
William Berman	707341aebe	resnet skip time activation and output scale factor	2023-04-09 22:55:33 -07:00
William Berman	26b4319ac5	do not overwrite scheduler instance variables with type casted versions	2023-04-09 22:34:29 -07:00
William Berman	18ebd57bd8	add missing AttnProcessor2_0 to AttentionProcessor union	2023-04-09 22:02:14 -07:00
William Berman	b6cc050245	fix simple attention processor encoder hidden states ordering	2023-04-09 21:57:56 -07:00
William Berman	0cbefefac3	clamp comment @sayakpaul	2023-04-09 21:54:50 -07:00
William Berman	1875c35aeb	remove extra min arg @sayakpaul	2023-04-09 21:54:50 -07:00
William Berman	1dc856e508	ddpm scheduler variance fixes	2023-04-09 21:54:50 -07:00

1 2 3 4 5 ...

2164 Commits