Figure317
Home
About
Portfolio
Blog
Contact
Blogs
All
Model Compression
Object detection
Segmentation
Diffusion
GAN
Autonomy-driving
Dev course
Coding
[Perception] A Gentle Introduction to Face Recognition in Deep Learning
Read More
[Paper review] StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets
Read More
[Paper review] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis
Read More
[Paper review] In-Domain GAN Inversion for Real Image Editing
Read More
[Paper review] Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Read More
[Paper review] Semantic Image Synthesis via Diffusion Models
Read More
[Paper review] PTI: Pivotal Tuning for Latent-based Editing of Real Images
Read More
[Paper review] NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
Read More
[Paper review] MFIM: Megapixel Facial Identity Manipulation
Read More
[Paper review] Generating Long Videos of Dynamic Scenes (LongVideoGAN)
Read More
[Paper review] Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
Read More
[Paper review] HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing
Read More
[Paper review] Attention to Scale: Scale-aware Semantic Image Segmentation
Read More
[Autonomy devcourse 1$_{st}$] Linux
Read More
[Paper review] Label-Efficient Semantic Segmentation with Diffusion Models
Read More
[Paper review] TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation
Read More
[Paper review] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer (DualStyle...
Read More
[Paper review] Denoising Diffusion Probabilistic Models (DDPM)
Read More
[Paper review] CoAtNet: Marrying Convolution and Attention for All Data Sizes
Read More
[Paper review] BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time Semantic Segme...
Read More
[Paper review] CARD: Classification and Regression Diffusion Models
Read More
[Paper review] Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
Read More
[Paper review] DiffCollage: Parallel Generation of Large Content with Diffusion Models
Read More
[Paper review] Noise2Music: Text-conditioned Music Generation with Diffusion Models
Read More
[Paper review] Robust One-Shot Singing Voice Conversion (ROSVC)
Read More
[Paper review] Imitating Human Behaviour with Diffusion Models
Read More
[Paper review] Semi-Parametric Neural Image Synthesis
Read More
[Paper review] Training language models to follow instructions with human feedback (InstructGPT)
Read More
[Paper review] Regularized Vector Quantization for Tokenized Image Synthesis (Reg-VQ)
Read More
[Paper review] Diffusion-LM Improves Controllable Text Generation
Read More
[Paper review] SeqDiffuSeq: Text Diffusion with Encoder-Decoder Transformers
Read More
[Paper review] WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Read More
[Paper review] WaveGrad: Estimating Gradients for Waveform Generation
Read More
[Paper review] PARASOL: Parametric Style Control for Diffusion Image Synthesis
Read More
[Paper review] Hybrid Transformers for Music Source Separation (HT Demucs)
Read More
[Paper review] Learning to Simulate Complex Physics with Graph Networks (GNS)
Read More
[Paper review] Diffusion-based Generative Speech Source Separation (DiffSep)
Read More
[Paper review] Fast Text-Conditional Discrete Denoising on Vector-Quantized Latent Spaces (Paella)
Read More
[Paper review] eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Read More
[Paper review] Towards Practical Plug-and-Play Diffusion Models (PPAP)
Read More
[Paper review] GLIGEN: Open-Set Grounded Text-to-Image Generation
Read More
[Paper review] Scaling up GANs for Text-to-Image Synthesis (GigaGAN)
Read More
[Paper review] AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
Read More
[Paper review] Scalable Adaptive Computation for Iterative Generation (RIN)
Read More
[Paper review] SinFusion: Training Diffusion Models on a Single Image or Video
Read More
[Paper review] Restoration based Generative Models (RGM)
Read More
[Paper review] Unlimited-Size Diffusion Restoration
Read More
[Paper review] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model (DDNM)
Read More
[Paper review] Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech (Face-TTS)
Read More
[Paper review] Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for...
Read More
[Paper review] DreamFusion: Text-to-3D using 2D Diffusion
Read More
[Paper review] Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding (Im...
Read More
[Paper review] Fast Sampling of Diffusion Models via Operator Learning (DSNO)
Read More
[Paper review] Refining Generative Process with Discriminator Guidance in Score-based Diffusion M...
Read More
[Paper review] Star-Shaped Denoising Diffusion Probabilistic Models (SS-DDPM)
Read More
[Paper review] PhysDiff: Physics-Guided Human Motion Diffusion Model
Read More
[Paper review] Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via ...
Read More
[Paper review] 3D Shape Generation and Completion through Point-Voxel Diffusion (PVD)
Read More
[Paper review] Point-E: A System for Generating 3D Point Clouds from Complex Prompts
Read More
[Paper review] Symbolic Music Generation with Diffusion Models
Read More
[Paper review] Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untr...
Read More
[Paper review] Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance
Read More
[Paper review] Cross-domain Compositing with Pretrained Diffusion Models
Read More
[Paper review] simple diffusion: End-to-end diffusion for high resolution images
Read More
[Paper review] Don’t Play Favorites: Minority Guidance for Diffusion Models
Read More
[Paper review] Planning with Diffusion for Flexible Behavior Synthesis (Diffuser)
Read More
[Paper review] Improved Denoising Diffusion Probabilistic Models (Improved DDPM)
Read More
[Paper review] Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Read More
[Paper review] Make-A-Video: Text-to-Video Generation without Text-Video Data
Read More
[Paper review] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Gene...
Read More
[Paper review] Conditional Image Generation with Score-Based Diffusion Models (CMDE)
Read More
[Paper review] Video Probabilistic Diffusion Models in Projected Latent Space (PVDM)
Read More
[Paper review] DAG: Depth-Aware Guidance with Denoising Diffusion Probabilistic Models
Read More
[Paper review] D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
Read More
[Paper review] DiffusionInst: Diffusion Model for Instance Segmentation
Read More
[Paper review] Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
Read More
[Paper review] ADIR: Adaptive Diffusion for Image Reconstruction
Read More
[Paper review] Improving Sample Quality of Diffusion Models Using Self-Attention Guidance
Read More
[Paper review] Conffusion: Confidence Intervals for Diffusion Models
Read More
[Paper review] SDM: Spatial Diffusion Model for Large Hole Image Inpainting
Read More
[Paper review] Latent Diffusion for Language Generation
Read More
[Paper review] HS-Diffusion: Learning a Semantic-Guided Diffusion Model for Head Swapping
Read More
[Paper review] Blended Diffusion for Text-driven Editing of Natural Images
Read More
[Paper review] Dynamic Dual-Output Diffusion Models
Read More
[Paper review] Diffusion Autoencoders: Toward a Meaningful and Decodable Representation
Read More
[Paper review] Few-Shot Diffusion Models
Read More
[Paper review] Variational Diffusion Models
Read More
[Paper review] DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Read More
[Paper review] DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Read More
[Paper review] Emerging Properties in Self-Supervised Vision Transformers (DINO)
Read More
[Paper review] Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme
Read More
[Paper review] Perception Prioritized Training of Diffusion Models (P2 weighting)
Read More
[Paper review] Diffusion models for Handwriting Generation
Read More
[Paper review] DiffFace: Diffusion-based Face Swapping with Facial Guidance
Read More
[Paper review] Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
Read More
[Paper review] RePaint: Inpainting using Denoising Diffusion Probabilistic Models
Read More
[Paper review] Any-speaker Adaptive Text-To-Speech Synthesis with Diffusion Models (Grad-StyleSpe...
Read More
[Paper review] DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Read More
[Paper review] Cascaded Diffusion Models for High Fidelity Image Generation
Read More
[Paper review] Diffusion-GAN: Training GANs with Diffusion
Read More
[Paper review] DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion
Read More
[Paper review] Classifier-Free Diffusion Guidance
Read More
[Paper review] Score-Based Generative Modeling through Stochastic Differential Equations
Read More
[Paper review] Diffusion Models Beat GANs on Image Synthesis
Read More
[Paper review] Improved Vector Quantized Diffusion Models (Improved VQ-Diffusion)
Read More
[Paper review] VQ-Diffusion: Vector Quantized Diffusion Model for Text-to-Image Synthesis
Read More
[Paper review] High-Resolution Image Synthesis with Latent Diffusion Models (Stable Diffusion)
Read More
[Paper review] Autoregressive Image Generation using Residual Quantization (RQ-VAE-Transformer)
Read More
[Paper review] Denoising Diffusion Implicit Models (DDIM)
Read More
[Paper review] ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement
Read More
Redesigning Skip Connections to Exploit (UNet++)
Read More
[Paper review] Multi-Scale Context Aggregation by Dilated Convolutions (DilatedNet)
Read More
[Paper review] DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
Read More