vision-transformer

The Brain Tumor MRI Dataset from Kaggle is employed for automated brain tumor detection and classification research. Investigated methods include using pre-trained models (VGG16, ResNet50, and ViT). 🧠🔍

deep-neural-networks deep-learning neural-network keras cnn vision neural-networks deeplearning vit transfer-learning pretrained-models vgg16 cnn-keras keras-tensorflow kaggle-dataset resnet-50 tumor-detection tumor-classification vision-transformer

Updated May 21, 2024
Jupyter Notebook

kyegomez / RT-X

Sponsor

Star

Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"

computer-vision artificial-intelligence vision attention-model attention-is-all-you-need multimodal vision-transformer gpt4 gpt4all

Updated May 21, 2024
Python

OpenGVLab / InternVideo

Star

Video Foundation Models & Data for Multimodal Understanding

benchmark action-recognition video-understanding video-data self-supervised multimodal video-dataset open-set-recognition video-retrieval video-question-answering masked-autoencoder temporal-action-localization contrastive-learning spatio-temporal-action-localization zero-shot-retrieval video-clip vision-transformer zero-shot-classification foundation-models instruction-tuning

Updated May 21, 2024
Python

mit-han-lab / efficientvit

Star

EfficientViT is a new family of vision models for efficient high-resolution vision.

imagenet segmentation high-resolution vision-transformer efficientvit segment-anything

Updated May 20, 2024
Python

open-mmlab / mmdetection

Star

OpenMMLab Detection Toolbox and Benchmark

Updated May 20, 2024
Python

dusty-nv / NanoLLM

Star

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

speech multimodal rag edge-ai vector-database vision-transformer llm-inference

Updated May 19, 2024
Python

tariqshaban / disaster-classification-with-xai

Star

A tool for classifying an image into a disaster type, utilizing Python

cnn-keras lime inception-v3 resnet-50 explainable-ai gradcam vgg-19 gradcam-plus-plus vision-transformer

Updated May 19, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the vision-transformer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-transformer topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision-transformer

Here are 741 public repositories matching this topic...

mdnuruzzamanKALLOL / VideoMAE_Tensorflow

Anne-Andresen / 3D-Vision-transformer

Bingzw / deep_learning_from_scratch

olibridge01 / MaskedImageModelling

uncbiag / Awesome-Foundation-Models

Blaizzy / mlx-vlm

google-research / scenic

ViTAE-Transformer / LeMeViT

RichardScottOZ / grid-mae

cmhungsteve / Awesome-Transformer-Attention

Lupin1998 / Awesome-MIM

afondiel / computer-vision-challenge

RuoyuChen10 / SMDL-Attribution

mohammad95labbaf / Brain-Tumor-TransferLearning

kyegomez / RT-X

OpenGVLab / InternVideo

mit-han-lab / efficientvit

open-mmlab / mmdetection

dusty-nv / NanoLLM

tariqshaban / disaster-classification-with-xai

Improve this page

Add this topic to your repo