In text-to-image modelling, Stable Diffusion has increased the pace of development when it comes to generative models. However, it does not come with its problems - including slow convergence and difficulty handling high-dimensional data (S., S., n.d.). Some researchers have proposed a finetuned variant of the model instead, named DreamShaper.

In fact, DreamShaper has 8 versions already - of which the last is presumed to be the final one.

Read this article if you wish to know more about what Stable Diffusion problems it solves. In this article, we'll focus on using it instead!

In this article, we're going to build a diffusers pipeline with (an LCM-LoRA-finetuned version of, for speeding up inference) DreamShaper 7. See the header image for what it's capable of generating!

Required packages

In order to run the code you'll create, you need to install:

The diffusers library, which is the go-to library for (pretrained) diffusion models.
To run it, you'll need torch too.
Finally, install matplotlib.

Imports and global settings

Let's create a file named dreamshaperpipeline.py. In it, we're starting with the imports and filling some settings:

import torch
from diffusers import DiffusionPipeline, LCMScheduler
import matplotlib.pyplot as plt

size = 512 # 512x512 pixels
num_inference_steps = 4 # number of diffusion steps
guidance_scale = 0.0 # no guidance

Torch is needed because diffusers depends on it; we'll visualize the images with Matplotlib.

As you can see, in this example, you're generating 512 x 512 pixel images (feel free to set it to smaller or larger ones, but do recognize that this may impact the hardware you'll need to run it successfully!). We use 4 diffusion steps for doing so and let no classifier guide the model (this is a technical step related to the LCM-LoRA process).

Loading the DreamShaper 7 pipeline with LCM-LoRA adapters

The next step involves actually creating the DreamShaper 7 pipeline. We're using HuggingFace's DiffusionPipeline for this goal.

The DiffusionPipeline is the quickest way to load any pretrained diffusion pipeline from the Hub for inference HuggingFace, n.d..

We do this by initializing the DiffusionPipeline from the pretrained Lykon/dreamshaper-7 model. Subsequently, we check if CUDA is available - in other words, if you can run this pipeline on your GPU - and if so, enable it. This will speed up running the pipeline significantly.

Then, we're using the LCMScheduler with the pipeline configuration and load the latent-consistency/lcm-lora-sdv1-5 weights. These weights are LoRA weights weights meaning that the model was finetuned using the LoRA technique. However, it was done in a particular way: to enable fast inference. In fact, using these weights speeds up inference a lot.

Multistep and onestep scheduler (Algorithm 3) introduced alongside latent consistency models in the paper Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference by Simian Luo, Yiqin Tan, Longbo Huang, Jian Li, and Hang Zhao. This scheduler should be able to generate good samples from LatentConsistencyModelPipeline in 1-8 steps (HuggingFace, n.d.)

Finally, we return the pipeline.

Here's the code:

def load_dreamshaper_lora_pipeline():
    """
    Load the DreamShaper 7 model with LCM LoRA adapters for fast inference.
    """

    # Create a DiffusionPipeline using the pretrained DreamShaper 7 model
    pipeline = DiffusionPipeline.from_pretrained("Lykon/dreamshaper-7")

    # Use CUDA if available
    if torch.cuda.is_available():
        pipeline.to("cuda")

    # Use the LCM LoRA adapters for fast inference
    pipeline.scheduler = LCMScheduler.from_config(pipeline.scheduler.config)
    pipeline.load_lora_weights("latent-consistency/lcm-lora-sdv1-5")

    return pipeline

Asking for the prompt

Then, we ask the user for the prompt - in other words, what they want to visualize:

def ask_for_prompt():
    """
    Ask the user for a prompt.
    """
    prompt = input("What do you want to visualize?\n")

    return prompt

Generating the images

This is followed by a definition which allows us to generate the images. It takes the pipeline, the prompt and some extra settings:

The num_inference_steps parameter, which indicates the number of diffusion steps during inference. In our case, that's 4 steps.
The guidance parameter, which signals whether we classifier guide the pipeline; in our case, we do not.
The size parameter, describing the width and the height of the image.

def generate_images(pipeline, prompt, num_inference_steps, guidance_scale, size):
    """
    Generate images using the pipeline.
    """
    results = pipeline(
        prompt=prompt,
        num_inference_steps=num_inference_steps,
        guidance_scale=guidance_scale,
        height=size,
        width=size
    )

    return results

Showing the final image

Subsequently, we show the final image - this part is just visualizing the image with Matplotlib.

def show_image(results):
    """
    Show an image.
    """
    # Create a figure without any border or axis
    fig, ax = plt.subplots(figsize=(30, 30))
    ax.imshow(results.images[0])
    ax.axis('off')  # Turn off axis labels and ticks

    # Show the image without borders
    plt.subplots_adjust(left=0, right=1, top=1, bottom=0)  # Remove extra white space
    plt.show()

Combining everything together

Finally, we combine everything together in a main def:

We load the pipeline.
We then ask for a prompt.
We use the pipeline and the prompt to generate images.
Finally, we show the final image.

def main():
    """
    Main function.
    """
    # Load the pipeline
    pipeline = load_dreamshaper_lora_pipeline()

    # Ask for a prompt
    prompt = ask_for_prompt()

    # Generate images
    results = generate_images(pipeline, prompt, num_inference_steps, guidance_scale, size)

    # Show the image
    show_image(results)


if __name__ == "__main__":
    main()

Generated examples

Let's now run the script.

> python dreamshaper7pipeline.py

...and now take a look at what it produces for some basic prompts.

An orange at a beach:

The skyline of New York City during sunset, dreamscape:

I also let ChatGPT generate a more complex prompt.

Create an image that combines the concept of 'bioluminescent jungle' with 'steampunk cityscape.' Imagine a lush, glowing forest filled with exotic flora and fauna juxtaposed against a sprawling metropolis of intricate, Victorian-inspired machinery. The blending of natural wonder and mechanical innovation should be visually stunning and captivating.

This is what it looks like:

Here's another one:

Imagine a world where gravity is reversed, and people live on the undersides of floating islands in the sky. Create an image that showcases the everyday life of the island-dwellers, from their upside-down houses and gardens to their unique modes of transportation. Highlight the challenges and innovations of living in a world with 'reverse gravity.'

Pretty awesome!

References

Hugging Face. (n.d.). Diffusion pipeline. Hugging Face Docs. Retrieved 21 November 2023, from https://huggingface.co/docs/diffusers/main/en/api/diffusion_pipeline
Luo, S., Tan, Y., Huang, L., Li, J., & Zhao, H. (2023). Latent consistency models: Synthesizing high-resolution images with few-step inference. arXiv preprint arXiv:2310.04378 https://arxiv.org/pdf/2310.04378.pdf.
Luo, S., Tan, Y., Patil, S., Gu, D., von Platen, P., Passos, A., ... & Zhao, H. (2023). LCM-LoRA: A Universal Stable-Diffusion Acceleration Module. arXiv preprint arXiv:2311.05556 https://huggingface.co/latent-consistency/lcm-lora-sdxl/resolve/main/LCM-LoRA-Technical-Report.pdf.
Hugging Face. (n.d.). LCM schedulers. Hugging Face Docs. Retrieved 21 November 2023, from https://huggingface.co/docs/diffusers/api/schedulers/lcm
S., S. (n.d.). Dreamshaper: A Fine-Tuned Version of Stable Diffusion. Medium. Retrieved November 22, 2023, from https://medium.com/@s1610.2003/dreamshaper-a-fine-tuned-version-of-stable-diffusion-da1dd84c3b2b

Appendix: Full code

Here's the full code if you're interested:

import torch
from diffusers import DiffusionPipeline, LCMScheduler
import matplotlib.pyplot as plt

size = 512 # 512x512 pixels
num_inference_steps = 4 # number of diffusion steps
guidance_scale = 0.0 # no guidance


def load_dreamshaper_lora_pipeline():
    """
    Load the DreamShaper 7 model with LCM LoRA adapters for fast inference.
    """

    # Create a DiffusionPipeline using the pretrained DreamShaper 7 model
    pipeline = DiffusionPipeline.from_pretrained("Lykon/dreamshaper-7")

    # Use CUDA if available
    if torch.cuda.is_available():
        pipeline.to("cuda")

    # Use the LCM LoRA adapters for fast inference
    pipeline.scheduler = LCMScheduler.from_config(pipeline.scheduler.config)
    pipeline.load_lora_weights("latent-consistency/lcm-lora-sdv1-5")

    return pipeline


def ask_for_prompt():
    """
    Ask the user for a prompt.
    """
    prompt = input("What do you want to visualize?\n")

    return prompt


def generate_images(pipeline, prompt, num_inference_steps, guidance_scale, size):
    """
    Generate images using the pipeline.
    """
    results = pipeline(
        prompt=prompt,
        num_inference_steps=num_inference_steps,
        guidance_scale=guidance_scale,
        height=size,
        width=size
    )

    return results


def show_image(results):
    """
    Show an image.
    """
    # Create a figure without any border or axis
    fig, ax = plt.subplots(figsize=(30, 30))
    ax.imshow(results.images[0])
    ax.axis('off')  # Turn off axis labels and ticks

    # Show the image without borders
    plt.subplots_adjust(left=0, right=1, top=1, bottom=0)  # Remove extra white space
    plt.show()


def main():
    """
    Main function.
    """
    # Load the pipeline
    pipeline = load_dreamshaper_lora_pipeline()

    # Ask for a prompt
    prompt = ask_for_prompt()

    # Generate images
    results = generate_images(pipeline, prompt, num_inference_steps, guidance_scale, size)

    # Show the image
    show_image(results)


if __name__ == "__main__":
    main()

Hi, I'm Chris!

I know a thing or two about AI and machine learning. Welcome to MachineCurve.com, where machine learning is explained in gentle terms.

Getting started

Foundation models

Learn how large language models and other foundation models are working and how you can train open source ones yourself.

Keras

Keras is a high-level API for TensorFlow. It is one of the most popular deep learning frameworks.

TensorFlow

TensorFlow is the most popular deep learning framework. It is is used by many companies.

PyTorch

PyTorch is a deep learning framework which is popular for its ease of use and flexibility.

Machine learning theory

Read about the fundamentals of machine learning, deep learning and artificial intelligence.

Transformer architectures

Emerging since 2017, Transformer architectures are part of the state of the art in deep learning.

Most recent articles

January 8, 2024

LLM in a Flash: improving memory requirements of large language models

January 2, 2024

What is Retrieval-Augmented Generation?

December 27, 2023

Building a zero-shot image classifier with CLIP and HuggingFace Transformers

December 27, 2023

In-Context Learning: what it is and how it works

December 22, 2023

CLIP: how it works, how it's trained and how to use it

Article tags

large models

stable diffusion

lora

lcm lora

large language models

Connect on social media

Connect with me on LinkedIn

To get in touch with me, please connect with me on LinkedIn. Make sure to write me a message saying hi!

See my work on GitHub

My work is available on GitHub. Feel free to check it out and see if it can be of use to you!

Side info

The content on this website is written for educational purposes. In writing the articles, I have attempted to be as correct and precise as possible. Should you find any errors, please let me know by creating an issue or pull request in this GitHub repository.

All text on this website written by me is copyrighted and may not be used without prior permission. Creating citations using content from this website is allowed if a reference is added, including an URL reference to the referenced article.

If you have any questions or remarks, feel free to get in touch.

TensorFlow, the TensorFlow logo and any related marks are trademarks of Google Inc.

PyTorch, the PyTorch logo and any related marks are trademarks of The Linux Foundation.

Montserrat and Source Sans are fonts licensed under the SIL Open Font License version 1.1.

Mathjax is licensed under the Apache License, Version 2.0.

Building a Stable Diffusion like text-to-image pipeline using DreamShaper 7

November 22, 2023 by Chris

Required packages

Imports and global settings

Loading the DreamShaper 7 pipeline with LCM-LoRA adapters

Asking for the prompt

Generating the images

Showing the final image

Combining everything together

Generated examples

References

Appendix: Full code

Hi, I'm Chris!

I know a thing or two about AI and machine learning. Welcome to MachineCurve.com, where machine learning is explained in gentle terms.

Getting started

Foundation models

Keras

TensorFlow

PyTorch

Machine learning theory

Transformer architectures

Most recent articles

January 8, 2024

LLM in a Flash: improving memory requirements of large language models

January 2, 2024

What is Retrieval-Augmented Generation?

December 27, 2023

Building a zero-shot image classifier with CLIP and HuggingFace Transformers

December 27, 2023

In-Context Learning: what it is and how it works

December 22, 2023

CLIP: how it works, how it's trained and how to use it

Article tags

Most popular articles

February 18, 2020

How to use K-fold Cross Validation with TensorFlow 2 and Keras?

December 28, 2020

Introduction to Transformers in Machine Learning

December 27, 2021

StyleGAN, a step-by-step introduction

July 17, 2019

This Person Does Not Exist - how does it work?

October 26, 2020

Your First Machine Learning Project with TensorFlow 2.0 and Keras

Connect on social media

Connect with me on LinkedIn

See my work on GitHub

Side info

Getting started

Foundation models

Keras

TensorFlow

PyTorch

Machine learning theory

Transformer architectures

Most popular articles

February 18, 2020

How to use K-fold Cross Validation with TensorFlow 2 and Keras?

December 28, 2020

Introduction to Transformers in Machine Learning

December 27, 2021

StyleGAN, a step-by-step introduction

July 17, 2019

This Person Does Not Exist - how does it work?

October 26, 2020

Your First Machine Learning Project with TensorFlow 2.0 and Keras

Side info

Connect with me on LinkedIn

See my work on GitHub