OpenGVLab

Welcome to OpenGVLab! 👋

We are a research group from Shanghai AI Lab focused on Vision-Centric AI research. The GV in our name, OpenGVLab, means general vision, a general understanding of vision, so little effort is needed to adapt to new vision-based tasks.

We develop model architecture and release pre-trained foundation models to the community to motivate further research in this area. We have made promising progress in general vision AI, with 109 SOTA🚀. In 2022, our open-sourced foundation model 65.5 mAP on the COCO object detection benchmark, 91.1% Top1 accuracy in Kinetics 400, achieved landmarks for AI vision👀 tasks for image🖼️ and video📹 understanding.

Based on solid vision foundations, we have expanded to Multi-Modality models and Generative AI（partner with Vchitect）. We aim to empower individuals and businesses by offering a higher starting point for developing vision-based AI products and lessening the burden of building an AI model from scratch.

Branches： Alpha (explore lattest advances in vision+language research) and uni-medical (focus on medical AI)

Follow us: Twitter 🤗Hugging Face Medium WeChat Zhihu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenGVLab

Welcome to OpenGVLab! 👋

Pinned

Repositories

People

Top languages

Most used topics