Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

欢迎分享CVPR 2024 论文和代码 / Welcome to share the paper and code of CVPR 2024 #210

Open
amusi opened this issue Feb 27, 2024 · 75 comments

Comments

@amusi
Copy link
Owner

amusi commented Feb 27, 2024

[The format of the issue]
Paper name/title:
Paper link:
Code link:

@iamhankai
Copy link

Paper name/title: ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks
Paper link: https://arxiv.org/abs/2306.14525
Code link: https://parameternet.github.io/

@iamhankai
Copy link

Paper name/title: An Empirical Study of Scaling Law for OCR
Paper link: https://arxiv.org/abs/2401.00028
Code link: https://github.com/large-ocr-model/large-ocr-model.github.io

@KuanchihHuang
Copy link

Paper name/title: PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection
Paper link: https://arxiv.org/abs/2312.08371
Code link: https://github.com/kuanchihhuang/PTT

@ShunyuanZheng
Copy link

ShunyuanZheng commented Feb 27, 2024

Paper name/title: GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
Paper link: https://arxiv.org/abs/2312.02155
Code link: https://github.com/ShunyuanZheng/GPS-Gaussian
Project link: https://shunyuanzheng.github.io/GPS-Gaussian

@huliangxiao
Copy link

Paper name/title: GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper link: https://arxiv.org/abs/2312.02134
Code link: https://github.com/huliangxiao/GaussianAvatar

@TIANLE233
Copy link

Paper name/title: Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
Paper link: https://arxiv.org/abs/2312.04265
Code link: https://github.com/w1oves/Rein

@zhuangshaobin
Copy link

Paper name/title: Vlogger: Make Your Dream A Vlog
Paper link: https://arxiv.org/abs/2401.09414
Code link: https://github.com/Vchitect/Vlogger

@BarqueroGerman
Copy link

Paper name/title: Seamless Human Motion Composition with Blended Positional Encodings
Paper link: https://arxiv.org/abs/2402.15509
Code link: https://github.com/BarqueroGerman/FlowMDM

@buaacyw
Copy link

buaacyw commented Feb 27, 2024

Paper name/title: GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
Paper link: https://arxiv.org/abs/2311.14521
Code link: https://github.com/buaacyw/GaussianEditor

@Hansxsourse
Copy link

Paper name/title: UniGS: Unified Representation for Image Generation and Segmentation
Paper link: https://arxiv.org/abs/2312.01985

classification could be: Diffusion / Image Generation / Segmentation

@ch3cook-fdu
Copy link

Paper name/title: LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Paper link: https://arxiv.org/abs/2311.18651
Code link: https://github.com/Open3DA/LL3DA
Project link: https://ll3da.github.io/

@geometry-adaptation
Copy link

geometry-adaptation commented Feb 27, 2024

Paper name/title: CLOVA: A Closed-LOop Visual Assistant with Tool Usage and Update
Paper link: https://arxiv.org/pdf/2312.10908.pdf
Project link: https://clova-tool.github.io/

@thaoshibe
Copy link

Paper name/title: Edit One for All: Interactive Batch Image Editing
Paper link: https://arxiv.org/abs/2401.10219
Code link: https://github.com/thaoshibe/edit-one-for-all
Project page: https://thaoshibe.github.io/edit-one-for-all

@Nightmare-n
Copy link

Paper name/title: UniPAD: A Universal Pre-training Paradigm for Autonomous Driving
Paper link: https://arxiv.org/abs/2310.08370
Code link: https://github.com/Nightmare-n/UniPAD

@DearCaat
Copy link

Paper name/title: Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology
Paper link: https://arxiv.org/abs/2402.17228
Code link: https://github.com/DearCaat/RRT-MIL

@Luffy03
Copy link

Luffy03 commented Feb 28, 2024

Paper name/title: VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis
Paper link: https://arxiv.org/abs/2402.17300
Code link: https://github.com/Luffy03/VoCo

@xb534
Copy link

xb534 commented Feb 28, 2024

Paper name/title: SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
Paper link: https://arxiv.org/abs/2311.15537
Code link: https://github.com/xb534/SED

@WeichenFan
Copy link

Paper name/title: Link-Context Learning for Multimodal LLMs
Paper link: https://arxiv.org/pdf/2308.07891.pdf
Code link: https://github.com/isekai-portal/Link-Context-Learning/tree/main

@Murrol
Copy link

Murrol commented Feb 28, 2024

Paper name/title: MoMask: Generative Masked Modeling of 3D Human Motions
Paper link: https://arxiv.org/abs/2312.00063
Code link: https://github.com/EricGuo5513/momask-codes

@Andy1621
Copy link

Paper name/title: MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Paper link: https://arxiv.org/abs/2311.17005
Code link: https://github.com/OpenGVLab/Ask-Anything/tree/main/video_chat2

@ethancohen123
Copy link

Paper name/title: ChAda-ViT : Channel Adaptive Attention for Joint Representation Learning of Heterogeneous Microscopy Images
Paper link: https://arxiv.org/abs/2311.15264
Code link: https://github.com/nicoboou/chada_vit

@ingra14m
Copy link

Paper name/title: Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction
Paper link: https://arxiv.org/abs/2309.13101
Code link: https://github.com/ingra14m/Deformable-3D-Gaussians
Project page: https://ingra14m.github.io/Deformable-Gaussians/

@ingra14m
Copy link

Paper name/title: SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes
Paper link: https://arxiv.org/abs/2312.14937
Code link: https://github.com/yihua7/SC-GS
Project page: https://yihua7.github.io/SC-GS-web/

@yyvhang
Copy link

yyvhang commented Feb 28, 2024

Paper name/title: LEMON: Learning 3D Human-Object Interaction Relation from 2D Images (Embodied AI)
Paper link: https://arxiv.org/abs/2312.08963
Code link: https://github.com/yyvhang/lemon_3d

@horseee
Copy link

horseee commented Feb 28, 2024

Paper name/title: DeepCache: Accelerating Diffusion Models for Free
Paper link: https://arxiv.org/abs/2312.00858
Code link: https://github.com/horseee/DeepCache

@SunzeY
Copy link

SunzeY commented Feb 29, 2024

Paper name/title: Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Paper link: https://arxiv.org/abs/2312.03818
Code link: https://github.com/SunzeY/AlphaCLIP

@yinanhe
Copy link

yinanhe commented Mar 1, 2024

Paper name/title: VBench: Comprehensive Benchmark Suite for Video Generative Models
Paper link: https://arxiv.org/abs/2311.17982
Code link: https://github.com/Vchitect/VBench
Project Page: https://vchitect.github.io/VBench-project/

@shikiw
Copy link

shikiw commented Mar 1, 2024

Paper name/title: OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
Paper link: https://arxiv.org/abs/2311.17911
Code link: https://github.com/shikiw/OPERA

@jameslahm
Copy link

Paper name/title: RepViT: Revisiting Mobile CNN From ViT Perspective
Paper link: https://arxiv.org/abs/2307.09283
Code link: https://github.com/THU-MIG/RepViT

@lixinustc
Copy link

lixinustc commented Mar 2, 2024

Paper name/title: SeD: Semantic-Aware Discriminator for Image Super-Resolution
Paper link: https://arxiv.org/abs/2402.19387
Code link: https://github.com/lbc12345/SeD

@zhengli97
Copy link

Paper name/title: PromptKD: Unsupervised Prompt Distillation for Vision-Language Models.
Paper link: https://arxiv.org/abs/2403.02781
Code link: https://github.com/zhengli97/PromptKD

@FYTalon
Copy link

FYTalon commented Mar 14, 2024

Paper name/title: PIE-NeRF🍕: Physics-based Interactive Elastodynamics with NeRF
Paper link: https://arxiv.org/abs/2311.13099
Code link: https://github.com/FYTalon/pienerf/

@jiuntian
Copy link

Paper name/title: InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model
Paper link: https://arxiv.org/abs/2312.05849
Code link: https://github.com/jiuntian/interactdiffusion

@924973292
Copy link

Paper name/title: Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification
Paper link: https://arxiv.org/abs/2403.10254
Code link: https://github.com/924973292/EDITOR

@YixunLiang
Copy link

Paper name/title: LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching
Paper link: https://arxiv.org/abs/2311.11284
Code link: https://github.com/EnVision-Research/LucidDreamer

@aeolusguan
Copy link

Paper name/title: Neural Markov Random Field for Stereo Matching
Paper link: https://arxiv.org/abs/2403.11193
Code link: https://github.com/aeolusguan/NMRF

@Kiteretsu77
Copy link

Paper name/title: APISR: Anime Production Inspired Real-World Anime Super-Resolution
Paper link: https://arxiv.org/abs/2403.01598
Code link: https://github.com/Kiteretsu77/APISR

@huangb23
Copy link

Paper name/title: VTimeLLM: Empower LLM to Grasp Video Moments
Paper link: https://arxiv.org/abs/2311.18445
Code link: https://github.com/huangb23/VTimeLLM

@yangyijune
Copy link

Paper name/title: MMA-Diffusion: MultiModal Attack on Diffusion Models
Paper link: https://arxiv.org/abs/2311.17516
Code link: https://github.com/yangyijune/MMA-Diffusion

@HyeonHo99
Copy link

Paper name/title: VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper link: https://arxiv.org/abs/2312.00845
Code link: https://github.com/HyeonHo99/Video-Motion-Customization
Project Page: https://video-motion-customization.github.io/

@xiuqhou
Copy link

xiuqhou commented Mar 26, 2024

Paper name/title: Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement
Paper link: https://arxiv.org/abs/2403.16131
Code link: https://github.com/xiuqhou/Salience-DETR

@zhangce01
Copy link

Paper name/title: HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
Paper link: https://arxiv.org/abs/2403.12033
Code link: https://github.com/zhangce01/HiKER-SGG
Project page: https://zhangce01.github.io/HiKER-SGG/

@cjerry1243
Copy link

Paper name/title: Learning from Synthetic Human Group Activities
Paper link: https://arxiv.org/abs/2306.16772
Code link: https://github.com/cjerry1243/M3Act
Project page: https://cjerry1243.github.io/M3Act/

@chen-si-jia
Copy link

Paper name/title: Delving into the Trajectory Long-tail Distribution for Muti-object Tracking
Paper link: https://arxiv.org/abs/2403.04700
Code link: https://github.com/chen-si-jia/Trajectory-Long-tail-Distribution-for-MOT

@Vegetebird
Copy link

Paper name/title: Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
Paper link: https://arxiv.org/pdf/2311.12028.pdf
Code link: https://github.com/NationalGAILab/HoT

@cherishleon
Copy link

Paper name/title: FairCLIP: Harnessing Fairness in Vision-Language Learning
Paper link: https://arxiv.org/abs/2403.19949
Code link: https://github.com/Harvard-Ophthalmology-AI-Lab/FairCLIP
Project Page: https://ophai.hms.harvard.edu/datasets/harvard-fairvlmed10k/

@QinYang79
Copy link

Paper name/title: Noisy-Correspondence Learning for Text-to-Image Person Re-identification
Paper link: https://arxiv.org/pdf/2308.09911.pdf
Code link: https://github.com/QinYang79/RDE

@littlepure2333
Copy link

Paper name/title: A Cross-Subject Brain Decoding Framework
Project Page: https://littlepure2333.github.io/MindBridge/
Paper link: https://arxiv.org/abs/2404.07850
Code link: https://github.com/littlepure2333/MindBridge

@Osilly
Copy link

Osilly commented Apr 16, 2024

Paper name/title: A General and Efficient Training for Transformer via Token Expansion
Paper link: https://arxiv.org/abs/2404.00672
Code link: https://github.com/Osilly/TokenExpansion

@YuqiYang213
Copy link

Paper name/title: Multi-Task Dense Prediction via Mixture of Low-Rank Experts
Paper link: https://arxiv.org/abs/2403.17749
Code link: https://github.com/YuqiYang213/MLoRE

@YuqiYang213
Copy link

Paper name/title: Traffic Scene Parsing through the TSP6K Dataset
Paper link: https://arxiv.org/pdf/2303.02835.pdf
Code link: https://github.com/PengtaoJiang/TSP6K

@dahyun-kang
Copy link

Paper name/title: Contrastive Mean-Shift Learning for Generalized Category Discovery
Paper link: https://arxiv.org/abs/2404.09451
Code link: https://github.com/sua-choi/CMS
Project page: https://postech-cvlab.github.io/cms/

@littlepure2333
Copy link

Paper name/title: A Cross-Subject Brain Decoding Framework Project Page: https://littlepure2333.github.io/MindBridge/ Paper link: https://arxiv.org/abs/2404.07850 Code link: https://github.com/littlepure2333/MindBridge

Sorry, the title should be:
MindBridge: A Cross-Subject Brain Decoding Framework

@pablomm
Copy link

pablomm commented Apr 24, 2024

Paper name/title: Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
Paper link: https://arxiv.org/abs/2403.14291
Code link: https://github.com/vpulab/ovam

@Dayan-Guan
Copy link

Paper name/title: Efficient Test-Time Adaptation of Vision-Language Models
Paper link: https://arxiv.org/abs/2403.18293
Code link: https://github.com/kdiAAA/TDA

@TQTQliu
Copy link

TQTQliu commented Apr 29, 2024

Paper name/title: Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields
Paper link: https://arxiv.org/abs/2404.17528
Code link: https://github.com/TQTQliu/GeFu
Project page: https://gefucvpr24.github.io/

@2y7c3
Copy link

2y7c3 commented May 9, 2024

Paper name/title: Adversarial Score Distillation: When score distillation meets GAN
Arxiv link: https://arxiv.org/abs/2312.00739 (updating)
Paper link: https://2y7c3.github.io/pdfs/asd.pdf
Code link: https://github.com/2y7c3/ASD

@ZhaoChuyang
Copy link

Paper name/title: MS-DETR: Efficient DETR Training with Mixed Supervision
Paer link: https://arxiv.org/pdf/2401.03989
Code link: https://github.com/Atten4Vis/MS-DETR

@youngLBW
Copy link

Paper name/title: DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaptation by Combining 3D GANs and Diffusion Priors
Paper link: https://arxiv.org/abs/2312.16837
Project page: https://younglbw.github.io/DiffusionGAN3D-homepage
Code link: https://github.com/youngLBW/DiffusionGAN3D

@demo4ai
Copy link

demo4ai commented May 19, 2024

Paper name/title: BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition
Paper link: https://www.researchgate.net/publication/379411619_BlockGCN_Redefining_Topology_Awareness_for_Skeleton-Based_Action_Recognition
Code link: https://github.com/ZhouYuxuanYX/BlockGCN

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests