Tag: deep-learning
- LongLoRA Efficient Fine-tuning of Long-Context Large Language Models (27 Sep 2023)
- Video-ChatGPT Towards Detailed Video Understanding via Large Vision and Language Models (26 Sep 2023)
- VideoChat Chat-Centric Video Understanding (25 Sep 2023)
- MaMMUT A Simple Architecture for Joint Learning for MultiModal Tasks (24 Sep 2023)
- Scaling Vision Transformers (23 Sep 2023)
- Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone (22 Sep 2023)
- An Empirical Study of Training End-to-End Vision-and-Language Transformers (21 Sep 2023)
- FreeU Free Lunch in Diffusion U-Net (20 Sep 2023)
- 360 Reconstruction From a Single Image Using Space Carved Outpainting (19 Sep 2023)
- Rerender A Video Zero-Shot Text-Guided Video-to-Video Translation (18 Sep 2023)
- OmnimatteRF Robust Omnimatte with 3D Background Modeling (17 Sep 2023)
- NExT-GPT Any-to-Any Multimodal LLM (16 Sep 2023)
- Towards Practical Capture of High-Fidelity Relightable Avatars (15 Sep 2023)
- Mobile V-MoEs Scaling Down Vision Transformers via Sparse Mixture-of-Experts (14 Sep 2023)
- PhotoVerse Tuning-Free Image Customization with Text-to-Image Diffusion Models (13 Sep 2023)
- Large Language Models as Optimizers (12 Sep 2023)
- MagiCapture High-Resolution Multi-Concept Portrait Customization (11 Sep 2023)
- InstructDiffusion A Generalist Modeling Interface for Vision Tasks (10 Sep 2023)
- InstaFlow One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation (09 Sep 2023)
- Dynamic Mesh-Aware Radiance Fields (08 Sep 2023)
- Key-Locked Rank One Editing for Text-to-Image Personalization (07 Sep 2023)
- DiffBIR Towards Blind Image Restoration with Generative Diffusion Prior (06 Sep 2023)
- Neuralangelo High-Fidelity Neural Surface Reconstruction (03 Sep 2023)
- Multimodal Learning with Transformers A Survey (02 Sep 2023)
- DreamFusion Text-to-3D using 2D Diffusion (01 Sep 2023)
- DreamBooth Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation (31 Aug 2023)
- MovieChat From Dense Token to Sparse Memory for Long Video Understanding (30 Aug 2023)
- TokenFlow Consistent Diffusion Features for Consistent Video Editing (29 Aug 2023)
- Diff-Instruct A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models (28 Aug 2023)
- Efficient Geometry-aware 3D Generative Adversarial Networks (27 Aug 2023)
- Tool Learning with Foundation Models (26 Aug 2023)
- Knowledge Distillation A Survey (25 Aug 2023)
- 3D Gaussian Splatting for Real-Time Radiance Field Rendering (24 Aug 2023)
- ProlificDreamer High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (20 Aug 2023)
- Elucidating the Design Space of Diffusion-Based Generative Models (19 Aug 2023)
- Scalable Adaptive Computation for Iterative Generation (18 Aug 2023)
- NeuralField-LDM Scene Generation with Hierarchical Latent Diffusion Models (17 Aug 2023)
- Unified Model for Image, Video, Audio and Language Tasks (16 Aug 2023)
- Teach LLMs to Personalize -An Approach inspired by Writing Education (15 Aug 2023)
- FineRecon Depth-aware Feed-forward Network for Detailed 3D Reconstruction (14 Aug 2023)
- Link-Context Learning for Multimodal LLMs (13 Aug 2023)
- AVIS Autonomous Visual Information Seeking with Large Language Models (12 Aug 2023)
- ProPainter Improving Propagation and Transformer for Video Inpainting (10 Aug 2023)
- MusicLM Generating Music From Text (09 Aug 2023)
- MVSNeRF Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo (08 Aug 2023)
- SimVLM Simple Visual Language Model Pretraining with Weak Supervision (07 Aug 2023)
- InternVideo General Video Foundation Models via Generative and Discriminative Learning (06 Aug 2023)
- Image as a Foreign Language BEiT Pretraining for All Vision and Vision-Language Tasks (05 Aug 2023)
- BLIP-2 Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models (04 Aug 2023)
- DualToken-ViT Position-aware Efficient Vision Transformer with Dual Token Fusion (03 Aug 2023)
- Visual Instruction Tuning (02 Aug 2023)
- CoCa Contrastive Captioners are Image-Text Foundation Models (31 Jul 2023)
- FLAVA A Foundational Language And Vision Alignment Model (30 Jul 2023)
- DALL-E, DALL-E2 and StoryDALL-E (30 Sep 2022)
- Pix2seq A Language Modeling Framework for Object Detection (28 Sep 2022)
- DreamBooth Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation (28 Sep 2022)
- HeadNeRF A Real-time NeRF-based Parametric Head Model (27 Sep 2022)
- CLIP Learning Transferable Visual Models From Natural Language Supervision (27 Sep 2022)
- pixelNeRF Neural Radiance Fields from One or Few Images (26 Sep 2022)
- NeuMan Neural Human Radiance Field from a Single Video (26 Sep 2022)
- Nerfies Deformable Neural Radiance Fields (26 Sep 2022)
- NeRF in the Wild (25 Sep 2022)
- GIRAFFE Representing Scenes as Compositional Generative Neural Feature Fields (25 Sep 2022)
- Encoding Method for NERF (24 Sep 2022)
- Recent Adavances of Diffusion Models (24 Sep 2022)
- unCLIP-Hierarchical Text-Conditional Image Generation with CLIP Latents (23 Sep 2022)
- Stable Diffusion (23 Sep 2022)
- Diffusion Model (22 Sep 2022)
- Neural Radiance Field (15 Apr 2022)
- Modern Convolution Neutral Network (20 Mar 2022)
- Self Supervised Learning Reading Note (02 Aug 2021)
- Unsupervised Domain Adaption (30 Jul 2021)
- Neural Lumigraph Rendering (24 Jun 2021)
- NeX Real-time View Synthesis with Neural Basis Expansion (23 Jun 2021)
- GeoSim Realistic Video Simulation via Geometry-Aware Composition for Self-Driving (23 Jun 2021)
- CVPR 2021 Best Papers Candidates (23 Jun 2021)
- Landmark Detection for Animal Face and 3D Reconstructions (01 Jun 2021)
- Residual Parameter Transfer for Deep Domain Adaptation (23 May 2021)
- Domain Adaption Paper Reading List (10 May 2021)
- MLP-Mixer An all-MLP Architecture for Vision (08 May 2021)
- GANFIT Generative Adversarial Network Fitting for High Fidelity 3D Face Reconstruction (01 May 2021)
- Avatarme Realistically renderable 3d facial reconstruction in-The-wild (28 Apr 2021)
- RingNet-Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision (23 Apr 2021)
- Learning an Animatable Detailed 3D Face Model from In-The-Wild Images (18 Apr 2021)
- AlphaPose--Multip Personal Human Pose Estimation (17 Apr 2021)
- Transformer Introduction (14 Apr 2021)
- Swin Transformer (11 Apr 2021)
- CVPR 2021 Transformer Paper (11 Apr 2021)
- My Paper Reading List For Facial Landmark Detection (01 Apr 2021)
- ViT AN IMAGE IS WORTH 16X16 WORDS TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE (28 Mar 2021)
- My Paper Reading List for 3D Face Reconstructions (20 Mar 2021)
- End-to-End Object Detection with Transformers (07 Mar 2021)
- Must-read AI Papers (16 Feb 2021)
- Transformer in Computer Vision (03 Feb 2021)
- 61 Interesting Paper from NeurIPS 2019 (10 Nov 2019)
- 7 Interesting Papers from ACM MM 2019 (10 Nov 2019)
- Low Light Enhancement (22 Sep 2019)
- Anchor Free Object Detection (15 Sep 2019)
- 3D Reconstruction (15 Sep 2019)
- GAN Roadmap (07 Sep 2019)
- Text Detection (20 Aug 2019)
- CVPR 45 Paper into Best Paper Finals (12 Aug 2019)
- Image Alignment (10 Aug 2019)
- Optical Flow (05 Aug 2019)
- Self Attention (01 Aug 2019)
- 3D Object Detection (01 Aug 2019)
- Video Object Segmentation (31 Jul 2019)
- Siamese Network (21 Jul 2019)
- Face Reconstruction in CVPR 2019 (18 Jul 2019)
- Face Detection, Landmark Detection in CVPR 2019 (17 Jul 2019)
- Face Attribute in CVPR 2019 (17 Jul 2019)
- Anti Face Spoofing (16 Jul 2019)
- Graph Convolutional Neural Network (08 Jul 2019)
- Network Compression Updates in 2019 (30 Jun 2019)
- Graph Embedding (29 Jun 2019)
- Install PyTorch for Nvidia Jetson Nano (25 Jun 2019)
- Pyramid in Neural Network (24 Jun 2019)
- Deep Learning based Semantic Segmentation Algorithm (22 Jun 2019)
- ICML 2019 Best Papers and Honourable of Mention (11 Jun 2019)
- Neural Architecture Search A Survey (30 May 2019)
- Object Detection Update (2019/1~2019/3) (22 May 2019)
- SKNet, GCNet, GloRe, Octave (16 May 2019)
- Siamese Network Based Single Object Tracking (15 May 2019)
- Setting Up Jetson Nano (10 May 2019)
- IJCAI Best Papers (10 May 2019)
- Single Object Tracking (08 May 2019)
- Human Pose Estimation with Deep Learning (05 May 2019)
- A Recipe for Training Neural Networks (29 Apr 2019)
- Neural Architecture Search (24 Apr 2019)
- Face Recognition (18 Apr 2019)
- Loss Functions (15 Apr 2019)
- Face Landmark Detection (15 Apr 2019)
- Convolution Nerual Network Backbone (15 Apr 2019)
- Pose Estimation (14 Apr 2019)
- Deep Learning Hardware for Embedded (13 Apr 2019)
- Visual Localization via Deep Learning (13 Apr 2019)
- An Overview of Normalization Methods in Deep Learning (10 Apr 2019)
- ResNet and Its Variations (08 Apr 2019)
- Effcient Deep Neural Network (07 Apr 2019)
- Generative adversarial network (05 Apr 2019)
- Transfer Learning (04 Apr 2019)
- Network Compression (03 Apr 2019)
- Different Types of Convolutions in Deep Learning (02 Apr 2019)
- Object Detections (02 Apr 2019)
- Image Segmentation (02 Apr 2019)