Table of contents About This is a short post to collect some references and notes about generative modeling and to try to experiment with generating images of cats using KPN denoising in pixel space. It鈥檚 not a comprehensive technical report, rather I was just curious if I get anything at all with this approach. Post Usually for image generation diffusion models operate in latent space [3] using direct prediction of noise, but I wanted to see how well it works in pixel space by using KPN bilateral filters and on top of that predict the low rank target directly, instead of predicting noised velocity, this allows to have a low rank bottleneck that can help with generalization and reduce the capacity of the network compared to those that need to be predicting full rank off-manifold targets. src [4] So what I鈥檓 trying is iterative projection to the low rank manifold using a denoising kernel operator. The advantage of KPN is that it has a really good regularization bias as well as behaving well after quantization which makes it suitable for deployment on edge devices. Also KPN filters can be efficiently implemented on GPUs. The model is trained on 64x64 images of cats from the Cats faces 64x64 (For generative models) using an architecture with a 8x8 patch transformer in the backbone and a stack of upscaling convolutions that drive KPN filtering network for denoising. The training process involves lerping the image to Gaussian noise, such that information is gradually lost and then training the model to predict the original image from that noised input using L2 and LPIPS [7]. The problem with bilateral filters is that they compute the output as a convex combination of input pixels, which makes it hard to create information that is not already present in the input. To mitigate this, I use a separate low capacity network to predict color drift(bias) that is added after the filtering step, the architecure of the drift prediction model is much simpler which guarantees that it...
First seen: 2025-11-27 22:39
Last seen: 2025-11-27 23:39