For the longest time I thought "diffusion" meant shrinking a model for production πŸ˜…

I thought the idea was: Take a large pretrained model β†’ reduce it to the size needed for production β†’ deploy only what you need.

Original Source

Introduction

I thought the idea was: Take a large pretrained model β†’ reduce it to the size needed for production β†’ deploy only what you need.

Basically some combination of pruning, compression, or reducing parameters.

Original Source

Why Diffusion?

Diffusion models are an entirely different thing related to generating outputs from noise.

Now I'm curious: Can anyone explain the idea behind diffusion?

---