#

mscoco

Here are 56 public repositories matching this topic...

microsoft / Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

imagenet image-classification object-detection semantic-segmentation mscoco mask-rcnn ade20k swin-transformer

Updated Apr 12, 2024
Python

shunk031 / huggingface-datasets_MSCOCO

Microsoft COCO: Common Objects in Context for huggingface datasets

object-detection semantic-segmentation instance-segmentation mscoco mscoco-dataset microsoft-coco caption-generation keypoint-detection huggingface-datasets

Updated Mar 24, 2024
Python

CedricPicron / FQDet

FQDet: Fast-converging Query-based Detector

pytorch transformer object-detection mscoco

Updated Feb 20, 2024
Python

waikato-ufdl / wai-annotations

Python library for converting annotated datasets into various formats (e.g., image classification, object detection and speech datasets).

image-annotation conversion python3 vgg tfrecords mscoco deepspeech common-voice festvox

Updated Feb 2, 2024
Dockerfile

k2-gc / object-detection-format-converter

Object Detection Dataset Format Converter

python deep-learning python3 yolo object-detection kitti-dataset pascal-voc kitti mscoco mscoco-dataset pascal-voc-dataset dataset-converter yolo-dataset object-detection-datasets

Updated Jan 14, 2024
Python

Weed-AI / Weed-AI

A repository to support the development of a repository and interchange format for weed identification annotation

computer-vision datasets mscoco data-formats weed-recognition

Updated Dec 6, 2023
Python

labelformat

lightly-ai / labelformat

A tool for converting computer vision label formats.

annotation labels yolo object-detection bounding-boxes pascal-voc kitti mscoco yolov8

Updated Nov 24, 2023
Python

apple / ml-cvnets

CVNets: A library for training computer vision networks

machine-learning computer-vision deep-learning detection pytorch classification imagenet segmentation pascal-voc mscoco ade20k

Updated Oct 30, 2023
Python

pnkvalavala / image-captioning

Image Caption Generator using a Pretrained ResNet-50 and an LSTM architecture. Trained on COCO 2017 dataset, it's accessible via a Streamlit app.

python computer-vision deep-learning pytorch lstm image-captioning resnet mscoco streamlit

Updated Oct 5, 2023
Python

shunk031 / huggingface-datasets_COCOA

COCOA: Semantic Amodal Segmentation for huggingface datasets

cocoa semantic-segmentation mscoco huggingface huggingface-datasets bsds

Updated Sep 16, 2023
Python

CedricPicron / TPN

Trident Pyramid Networks for Object Detection (BMVC 2022)

pytorch object-detection mscoco

Updated Jul 5, 2023

YehLi / ImageNetModel

Official ImageNet Model repository

imagenet image-classification object-detection semantic-segmentation instance-segmentation mscoco cotnet vision-transformer contextual-transformer wave-vit dual-vit

Updated May 5, 2023
Jupyter Notebook

SwinTransformer / Swin-Transformer-Object-Detection

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

object-detection cascade mscoco mask-rcnn swin swin-transformer reppoints

Updated Apr 9, 2023
Python

ViTAE-Transformer / ViTAE-Transformer

The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"

deep-learning imagenet object-detection semantic-segmentation mscoco ade20k imagenet-classification vision-transformer vitae-transformer

Updated Apr 5, 2023
Python

HRNet / HRNet-Object-Detection

Object detection with multi-level representations generated from deep high-resolution representation learning (HRNetV2h). This is an official implementation for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919

faster-rcnn object-detection mscoco cascade-rcnn hrnets mmdetection

Updated Mar 8, 2023
Python

peteanderson80 / bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

caffe vqa faster-rcnn image-captioning captioning-images mscoco mscoco-dataset visual-question-answering

Updated Feb 3, 2023
Jupyter Notebook

peteanderson80 / coco-caption

Adds SPICE metric to coco-caption evaluation server codes

spice image-captioning mscoco-image-dataset captioning-images mscoco mscoco-dataset

Updated Feb 2, 2023
Jupyter Notebook

peteanderson80 / SPICE

Semantic Propositional Image Caption Evaluation

image-captioning captioning-images mscoco

Updated Feb 2, 2023
Java

gautamchitnis / cocoapi

Clone of COCO API - Dataset @ http://cocodataset.org/ - with changes to support Windows build and python3

mscoco mscoco-dataset cocodataset pycocotools

Updated Jan 14, 2023
Jupyter Notebook

deepplants / ViT-PCM

Official implementation of "Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentation"

pascal-voc mscoco weakly-supervised-segmentation vision-transformer

Updated Jan 13, 2023
Python

Improve this page

Add a description, image, and links to the mscoco topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mscoco topic, visit your repo's landing page and select "manage topics."