For the longest time I thought "diffusion" meant shrinking a model for production π
I thought the idea was: Take a large pretrained model β reduce it to the size needed for production β deploy only what you need.
Original SourceBasically some combination of pruning, compression, or reducing parameters.
Original SourceNow I'm curious: Can anyone explain the idea behind diffusion?
---