欢迎分享CVPR 2024 论文和代码 / Welcome to share the paper and code of CVPR 2024 #210

amusi · 2024-02-27T03:57:45Z

[The format of the issue]
Paper name/title:
Paper link:
Code link:

iamhankai · 2024-02-27T06:02:23Z

Paper name/title: ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks
Paper link: https://arxiv.org/abs/2306.14525
Code link: https://parameternet.github.io/

iamhankai · 2024-02-27T06:03:21Z

Paper name/title: An Empirical Study of Scaling Law for OCR
Paper link: https://arxiv.org/abs/2401.00028
Code link: https://github.com/large-ocr-model/large-ocr-model.github.io

KuanchihHuang · 2024-02-27T06:35:04Z

Paper name/title: PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection
Paper link: https://arxiv.org/abs/2312.08371
Code link: https://github.com/kuanchihhuang/PTT

ShunyuanZheng · 2024-02-27T06:42:07Z

Paper name/title: GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
Paper link: https://arxiv.org/abs/2312.02155
Code link: https://github.com/ShunyuanZheng/GPS-Gaussian
Project link: https://shunyuanzheng.github.io/GPS-Gaussian

huliangxiao · 2024-02-27T06:52:17Z

Paper name/title: GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper link: https://arxiv.org/abs/2312.02134
Code link: https://github.com/huliangxiao/GaussianAvatar

TIANLE233 · 2024-02-27T07:24:38Z

Paper name/title: Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
Paper link: https://arxiv.org/abs/2312.04265
Code link: https://github.com/w1oves/Rein

zhuangshaobin · 2024-02-27T11:18:09Z

Paper name/title: Vlogger: Make Your Dream A Vlog
Paper link: https://arxiv.org/abs/2401.09414
Code link: https://github.com/Vchitect/Vlogger

BarqueroGerman · 2024-02-27T11:21:45Z

Paper name/title: Seamless Human Motion Composition with Blended Positional Encodings
Paper link: https://arxiv.org/abs/2402.15509
Code link: https://github.com/BarqueroGerman/FlowMDM

buaacyw · 2024-02-27T11:34:49Z

Paper name/title: GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
Paper link: https://arxiv.org/abs/2311.14521
Code link: https://github.com/buaacyw/GaussianEditor

Hansxsourse · 2024-02-27T13:50:06Z

Paper name/title: UniGS: Unified Representation for Image Generation and Segmentation
Paper link: https://arxiv.org/abs/2312.01985

classification could be: Diffusion / Image Generation / Segmentation

ch3cook-fdu · 2024-02-27T15:33:56Z

Paper name/title: LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Paper link: https://arxiv.org/abs/2311.18651
Code link: https://github.com/Open3DA/LL3DA
Project link: https://ll3da.github.io/

geometry-adaptation · 2024-02-27T16:26:10Z

Paper name/title: CLOVA: A Closed-LOop Visual Assistant with Tool Usage and Update
Paper link: https://arxiv.org/pdf/2312.10908.pdf
Project link: https://clova-tool.github.io/

thaoshibe · 2024-02-27T18:29:07Z

Paper name/title: Edit One for All: Interactive Batch Image Editing
Paper link: https://arxiv.org/abs/2401.10219
Code link: https://github.com/thaoshibe/edit-one-for-all
Project page: https://thaoshibe.github.io/edit-one-for-all

Nightmare-n · 2024-02-28T01:18:17Z

Paper name/title: UniPAD: A Universal Pre-training Paradigm for Autonomous Driving
Paper link: https://arxiv.org/abs/2310.08370
Code link: https://github.com/Nightmare-n/UniPAD

DearCaat · 2024-02-28T02:41:53Z

Paper name/title: Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology
Paper link: https://arxiv.org/abs/2402.17228
Code link: https://github.com/DearCaat/RRT-MIL

Luffy03 · 2024-02-28T04:28:32Z

Paper name/title: VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis
Paper link: https://arxiv.org/abs/2402.17300
Code link: https://github.com/Luffy03/VoCo

xb534 · 2024-02-28T06:26:00Z

Paper name/title: SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
Paper link: https://arxiv.org/abs/2311.15537
Code link: https://github.com/xb534/SED

WeichenFan · 2024-02-28T07:25:55Z

Paper name/title: Link-Context Learning for Multimodal LLMs
Paper link: https://arxiv.org/pdf/2308.07891.pdf
Code link: https://github.com/isekai-portal/Link-Context-Learning/tree/main

Murrol · 2024-02-28T07:49:54Z

Paper name/title: MoMask: Generative Masked Modeling of 3D Human Motions
Paper link: https://arxiv.org/abs/2312.00063
Code link: https://github.com/EricGuo5513/momask-codes

Andy1621 · 2024-02-28T09:13:28Z

Paper name/title: MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Paper link: https://arxiv.org/abs/2311.17005
Code link: https://github.com/OpenGVLab/Ask-Anything/tree/main/video_chat2

ethancohen123 · 2024-02-28T09:51:06Z

Paper name/title: ChAda-ViT : Channel Adaptive Attention for Joint Representation Learning of Heterogeneous Microscopy Images
Paper link: https://arxiv.org/abs/2311.15264
Code link: https://github.com/nicoboou/chada_vit

ingra14m · 2024-02-28T09:56:16Z

Paper name/title: Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction
Paper link: https://arxiv.org/abs/2309.13101
Code link: https://github.com/ingra14m/Deformable-3D-Gaussians
Project page: https://ingra14m.github.io/Deformable-Gaussians/

ingra14m · 2024-02-28T09:57:17Z

Paper name/title: SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes
Paper link: https://arxiv.org/abs/2312.14937
Code link: https://github.com/yihua7/SC-GS
Project page: https://yihua7.github.io/SC-GS-web/

yyvhang · 2024-02-28T11:13:47Z

Paper name/title: LEMON: Learning 3D Human-Object Interaction Relation from 2D Images (Embodied AI)
Paper link: https://arxiv.org/abs/2312.08963
Code link: https://github.com/yyvhang/lemon_3d

horseee · 2024-02-28T11:26:00Z

Paper name/title: DeepCache: Accelerating Diffusion Models for Free
Paper link: https://arxiv.org/abs/2312.00858
Code link: https://github.com/horseee/DeepCache

SunzeY · 2024-02-29T17:35:44Z

Paper name/title: Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Paper link: https://arxiv.org/abs/2312.03818
Code link: https://github.com/SunzeY/AlphaCLIP

yinanhe · 2024-03-01T04:55:51Z

Paper name/title: VBench: Comprehensive Benchmark Suite for Video Generative Models
Paper link: https://arxiv.org/abs/2311.17982
Code link: https://github.com/Vchitect/VBench
Project Page: https://vchitect.github.io/VBench-project/

shikiw · 2024-03-01T05:52:08Z

Paper name/title: OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
Paper link: https://arxiv.org/abs/2311.17911
Code link: https://github.com/shikiw/OPERA

jameslahm · 2024-03-01T06:23:49Z

Paper name/title: RepViT: Revisiting Mobile CNN From ViT Perspective
Paper link: https://arxiv.org/abs/2307.09283
Code link: https://github.com/THU-MIG/RepViT

lixinustc · 2024-03-02T05:46:55Z

Paper name/title: SeD: Semantic-Aware Discriminator for Image Super-Resolution
Paper link: https://arxiv.org/abs/2402.19387
Code link: https://github.com/lbc12345/SeD

zhengli97 · 2024-03-14T08:50:52Z

Paper name/title: PromptKD: Unsupervised Prompt Distillation for Vision-Language Models.
Paper link: https://arxiv.org/abs/2403.02781
Code link: https://github.com/zhengli97/PromptKD

FYTalon · 2024-03-14T21:44:06Z

Paper name/title: PIE-NeRF🍕: Physics-based Interactive Elastodynamics with NeRF
Paper link: https://arxiv.org/abs/2311.13099
Code link: https://github.com/FYTalon/pienerf/

jiuntian · 2024-03-15T08:30:40Z

Paper name/title: InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model
Paper link: https://arxiv.org/abs/2312.05849
Code link: https://github.com/jiuntian/interactdiffusion

924973292 · 2024-03-18T13:53:45Z

Paper name/title: Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification
Paper link: https://arxiv.org/abs/2403.10254
Code link: https://github.com/924973292/EDITOR

YixunLiang · 2024-03-19T06:03:47Z

Paper name/title: LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching
Paper link: https://arxiv.org/abs/2311.11284
Code link: https://github.com/EnVision-Research/LucidDreamer

aeolusguan · 2024-03-19T14:55:55Z

Paper name/title: Neural Markov Random Field for Stereo Matching
Paper link: https://arxiv.org/abs/2403.11193
Code link: https://github.com/aeolusguan/NMRF

Kiteretsu77 · 2024-03-21T02:26:10Z

Paper name/title: APISR: Anime Production Inspired Real-World Anime Super-Resolution
Paper link: https://arxiv.org/abs/2403.01598
Code link: https://github.com/Kiteretsu77/APISR

huangb23 · 2024-03-21T05:46:18Z

Paper name/title: VTimeLLM: Empower LLM to Grasp Video Moments
Paper link: https://arxiv.org/abs/2311.18445
Code link: https://github.com/huangb23/VTimeLLM

yangyijune · 2024-03-25T02:58:03Z

Paper name/title: MMA-Diffusion: MultiModal Attack on Diffusion Models
Paper link: https://arxiv.org/abs/2311.17516
Code link: https://github.com/yangyijune/MMA-Diffusion

HyeonHo99 · 2024-03-25T17:19:03Z

Paper name/title: VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper link: https://arxiv.org/abs/2312.00845
Code link: https://github.com/HyeonHo99/Video-Motion-Customization
Project Page: https://video-motion-customization.github.io/

xiuqhou · 2024-03-26T01:58:16Z

Paper name/title: Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement
Paper link: https://arxiv.org/abs/2403.16131
Code link: https://github.com/xiuqhou/Salience-DETR

zhangce01 · 2024-03-28T07:37:05Z

Paper name/title: HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
Paper link: https://arxiv.org/abs/2403.12033
Code link: https://github.com/zhangce01/HiKER-SGG
Project page: https://zhangce01.github.io/HiKER-SGG/

cjerry1243 · 2024-03-30T01:48:11Z

Paper name/title: Learning from Synthetic Human Group Activities
Paper link: https://arxiv.org/abs/2306.16772
Code link: https://github.com/cjerry1243/M3Act
Project page: https://cjerry1243.github.io/M3Act/

chen-si-jia · 2024-04-05T08:32:38Z

Paper name/title: Delving into the Trajectory Long-tail Distribution for Muti-object Tracking
Paper link: https://arxiv.org/abs/2403.04700
Code link: https://github.com/chen-si-jia/Trajectory-Long-tail-Distribution-for-MOT

Vegetebird · 2024-04-07T14:01:36Z

Paper name/title: Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
Paper link: https://arxiv.org/pdf/2311.12028.pdf
Code link: https://github.com/NationalGAILab/HoT

cherishleon · 2024-04-08T05:37:13Z

Paper name/title: FairCLIP: Harnessing Fairness in Vision-Language Learning
Paper link: https://arxiv.org/abs/2403.19949
Code link: https://github.com/Harvard-Ophthalmology-AI-Lab/FairCLIP
Project Page: https://ophai.hms.harvard.edu/datasets/harvard-fairvlmed10k/

QinYang79 · 2024-04-08T08:09:23Z

Paper name/title: Noisy-Correspondence Learning for Text-to-Image Person Re-identification
Paper link: https://arxiv.org/pdf/2308.09911.pdf
Code link: https://github.com/QinYang79/RDE

littlepure2333 · 2024-04-12T09:46:26Z

Paper name/title: A Cross-Subject Brain Decoding Framework
Project Page: https://littlepure2333.github.io/MindBridge/
Paper link: https://arxiv.org/abs/2404.07850
Code link: https://github.com/littlepure2333/MindBridge

Osilly · 2024-04-16T18:51:55Z

Paper name/title: A General and Efficient Training for Transformer via Token Expansion
Paper link: https://arxiv.org/abs/2404.00672
Code link: https://github.com/Osilly/TokenExpansion

YuqiYang213 · 2024-04-17T11:03:54Z

Paper name/title: Multi-Task Dense Prediction via Mixture of Low-Rank Experts
Paper link: https://arxiv.org/abs/2403.17749
Code link: https://github.com/YuqiYang213/MLoRE

YuqiYang213 · 2024-04-18T13:26:10Z

Paper name/title: Traffic Scene Parsing through the TSP6K Dataset
Paper link: https://arxiv.org/pdf/2303.02835.pdf
Code link: https://github.com/PengtaoJiang/TSP6K

dahyun-kang · 2024-04-20T09:53:17Z

Paper name/title: Contrastive Mean-Shift Learning for Generalized Category Discovery
Paper link: https://arxiv.org/abs/2404.09451
Code link: https://github.com/sua-choi/CMS
Project page: https://postech-cvlab.github.io/cms/

littlepure2333 · 2024-04-23T11:44:28Z

Paper name/title: A Cross-Subject Brain Decoding Framework Project Page: https://littlepure2333.github.io/MindBridge/ Paper link: https://arxiv.org/abs/2404.07850 Code link: https://github.com/littlepure2333/MindBridge

Sorry, the title should be:
MindBridge: A Cross-Subject Brain Decoding Framework

pablomm · 2024-04-24T17:24:54Z

Paper name/title: Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
Paper link: https://arxiv.org/abs/2403.14291
Code link: https://github.com/vpulab/ovam

Dayan-Guan · 2024-04-29T11:11:29Z

Paper name/title: Efficient Test-Time Adaptation of Vision-Language Models
Paper link: https://arxiv.org/abs/2403.18293
Code link: https://github.com/kdiAAA/TDA

TQTQliu · 2024-04-29T13:42:18Z

Paper name/title: Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields
Paper link: https://arxiv.org/abs/2404.17528
Code link: https://github.com/TQTQliu/GeFu
Project page: https://gefucvpr24.github.io/

2y7c3 · 2024-05-09T03:34:39Z

Paper name/title: Adversarial Score Distillation: When score distillation meets GAN
Arxiv link: https://arxiv.org/abs/2312.00739 (updating)
Paper link: https://2y7c3.github.io/pdfs/asd.pdf
Code link: https://github.com/2y7c3/ASD

ZhaoChuyang · 2024-05-11T10:02:24Z

Paper name/title: MS-DETR: Efficient DETR Training with Mixed Supervision
Paer link: https://arxiv.org/pdf/2401.03989
Code link: https://github.com/Atten4Vis/MS-DETR

youngLBW · 2024-05-13T03:41:32Z

Paper name/title: DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaptation by Combining 3D GANs and Diffusion Priors
Paper link: https://arxiv.org/abs/2312.16837
Project page: https://younglbw.github.io/DiffusionGAN3D-homepage
Code link: https://github.com/youngLBW/DiffusionGAN3D

demo4ai · 2024-05-19T16:44:19Z

Paper name/title: BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition
Paper link: https://www.researchgate.net/publication/379411619_BlockGCN_Redefining_Topology_Awareness_for_Skeleton-Based_Action_Recognition
Code link: https://github.com/ZhouYuxuanYX/BlockGCN

欢迎分享CVPR 2024 论文和代码 / Welcome to share the paper and code of CVPR 2024 #210

欢迎分享CVPR 2024 论文和代码 / Welcome to share the paper and code of CVPR 2024 #210

Comments

amusi commented Feb 27, 2024

iamhankai commented Feb 27, 2024

iamhankai commented Feb 27, 2024

KuanchihHuang commented Feb 27, 2024

ShunyuanZheng commented Feb 27, 2024 • edited

huliangxiao commented Feb 27, 2024

TIANLE233 commented Feb 27, 2024

zhuangshaobin commented Feb 27, 2024

BarqueroGerman commented Feb 27, 2024

buaacyw commented Feb 27, 2024

Hansxsourse commented Feb 27, 2024

ch3cook-fdu commented Feb 27, 2024

geometry-adaptation commented Feb 27, 2024 • edited

thaoshibe commented Feb 27, 2024

Nightmare-n commented Feb 28, 2024

DearCaat commented Feb 28, 2024

Luffy03 commented Feb 28, 2024

xb534 commented Feb 28, 2024

WeichenFan commented Feb 28, 2024

Murrol commented Feb 28, 2024

Andy1621 commented Feb 28, 2024

ethancohen123 commented Feb 28, 2024

ingra14m commented Feb 28, 2024

ingra14m commented Feb 28, 2024

yyvhang commented Feb 28, 2024

horseee commented Feb 28, 2024

SunzeY commented Feb 29, 2024

yinanhe commented Mar 1, 2024

shikiw commented Mar 1, 2024

jameslahm commented Mar 1, 2024

lixinustc commented Mar 2, 2024 • edited

zhengli97 commented Mar 14, 2024

FYTalon commented Mar 14, 2024

jiuntian commented Mar 15, 2024

924973292 commented Mar 18, 2024

YixunLiang commented Mar 19, 2024

aeolusguan commented Mar 19, 2024

Kiteretsu77 commented Mar 21, 2024

huangb23 commented Mar 21, 2024

yangyijune commented Mar 25, 2024

HyeonHo99 commented Mar 25, 2024

xiuqhou commented Mar 26, 2024 • edited

zhangce01 commented Mar 28, 2024

cjerry1243 commented Mar 30, 2024

chen-si-jia commented Apr 5, 2024

Vegetebird commented Apr 7, 2024

cherishleon commented Apr 8, 2024

QinYang79 commented Apr 8, 2024

littlepure2333 commented Apr 12, 2024

Osilly commented Apr 16, 2024

YuqiYang213 commented Apr 17, 2024

YuqiYang213 commented Apr 18, 2024

dahyun-kang commented Apr 20, 2024

littlepure2333 commented Apr 23, 2024

pablomm commented Apr 24, 2024

Dayan-Guan commented Apr 29, 2024

TQTQliu commented Apr 29, 2024

2y7c3 commented May 9, 2024

ZhaoChuyang commented May 11, 2024

youngLBW commented May 13, 2024

demo4ai commented May 19, 2024

ShunyuanZheng commented Feb 27, 2024 •

edited

geometry-adaptation commented Feb 27, 2024 •

edited

lixinustc commented Mar 2, 2024 •

edited

xiuqhou commented Mar 26, 2024 •

edited