Stable Diffusion: Unterschied zwischen den Versionen
Anlumo (Diskussion | Beiträge) Keine Bearbeitungszusammenfassung |
Anlumo (Diskussion | Beiträge) examples |
||
Zeile 25: | Zeile 25: | ||
* Text-to-image: Enter a text prompt (positive and negative) and generate a low-res image out of that. | * Text-to-image: Enter a text prompt (positive and negative) and generate a low-res image out of that. | ||
* Image-to-image: Take an image as input and modify it based on a text prompt. This can be used for style transfer for example, or taking the composition of another image for a new creation. | * Image-to-image: Take an image as input and modify it based on a text prompt. This can be used for style transfer for example, or taking the composition of another image for a new creation. | ||
* Inpainting: Same as image-to-image, but only modify a part of the image. This can be used to add or remove details in images, for example. | * Inpainting: Same as image-to-image, but only modify a part of the image. This can be used to add or remove details in images, for example. ([https://www.reddit.com/r/StableDiffusion/comments/11gbijd example]) | ||
* Controlnet: Applicable to any of the above. Take a reference image, extract some property of it, like the pose of a person or a depth map, and nudge the AI to generate one of the above outputs with this extra information. This can also be used in text-to-image to convert a pencil sketch to a photorealistic image, for example. | * Controlnet: Applicable to any of the above. Take a reference image, extract some property of it, like the pose of a person or a depth map, and nudge the AI to generate one of the above outputs with this extra information ([https://www.reddit.com/r/StableDiffusion/comments/11fn96y example]). This can also be used in text-to-image to convert a pencil sketch to a photorealistic image, for example. | ||
* Upscaling of images: This can increase the resolution of an image by adding details that weren't in the original image (like individual strands of hair). Usually this is used to increase the low resolution output of the techniques above to usable resolutions. | * Upscaling of images: This can increase the resolution of an image by adding details that weren't in the original image (like individual strands of hair). Usually this is used to increase the low resolution output of the techniques above to usable resolutions. | ||
There were some recent attempts of applying these capabilities to video as well. [https://www.youtube.com/shorts/iZZdogrTBVE Here is a YouTube Shorts] demonstrating this ([https://www.youtube.com/watch?v=GVT3WUa-48Y Full video of the end result]). | |||
So, SD is much more capable than the commercial offerings like Midjourney. However, it also has way more nobs to adjust and settings to optimize. Thus, the idea for this talk. | So, SD is much more capable than the commercial offerings like Midjourney. However, it also has way more nobs to adjust and settings to optimize. Thus, the idea for this talk. |