Paper Review - Audio-Visual Related Research (WIP)
Created|ML/CV/NLP
|Post Views:
Author: Vines
Copyright Notice: All articles on this blog are licensed under CC BY-NC-SA 4.0 unless otherwise stated.
Related Articles

2022-02-11
Paper Review - AnimeGAN
Studying image-to-image translation. Overview of 2019 ISICA paper "AnimeGAN - A Novel Lightweight GAN for Photo Animation".

2022-01-21
Paper Review - CartoonGAN
Studying image-to-image translation. Overview of 2018 CVPR paper "CartoonGAN- Generative Adversarial Networks for Photo Cartoonizations".

2022-08-18
Paper Review - Pix2Pix, CycleGAN
Studying image-to-image translation. Overview of 2017 CVPR paper "Image-to-Image Translation with Conditional Adversarial Networks" and 2017 ICCV paper "Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks".

2024-07-17
Paper Review - ColorPeel
An interesting paper from ECCV2024. It talks about the color and shape disentanglement on Text-to-Image models. The solution is simple yet effective.

2024-10-16
Paper Review - Modality Gap and Alignment in Multi-modal Contrastive Learning
Contrastive learning is a popular self-supervised learning technique that has shown remarkable success in training deep neural networks. The core idea behind contrastive learning is to learn representations that are not only discriminative but also invariant to various transformations. This is achieved by contrasting positive and negative samples in the embedding space.

2023-01-31
Paper Review - Diffusion Models Applications
Some keypoints and details jot from CVPR 2022 tutorial - Tutorial on Denoising Diffusion-based Generative Modeling - Foundations and Applications
Announcement
Breaking Change - :year/:month/:day/:title/ => :title/