vision

Star

Here are 1,499 public repositories matching this topic...

Skyvern-AI / skyvern

Star

Automate browser-based workflows with LLMs and Computer Vision

python api workflow automation browser computer vision gpt browser-automation rpa playwright llm

Updated May 20, 2024
Python

TIGER-AI-Lab / Mantis

Star

Official code for Paper "Mantis: Multi-Image Instruction Tuning"

language video vision mantis vlm multimodal lmm fuyu mllm llava-llama3 multi-image-understanding

Updated May 20, 2024
Python

Rahuletto / yolo

Sponsor

Star

My YOLOv8 learning pathway. Just for fun! You only look (live) once

vision computervision ultralytics yolov8

Updated May 20, 2024
Jupyter Notebook

Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development

Updated May 20, 2024
TypeScript

mrousavy / react-native-vision-camera

Sponsor

Star

📸 A powerful, high-performance React Native Camera library.

Updated May 20, 2024
Swift

frc4533-lincoln / chalkydri

Star

A blazingly fast FRC vision system

rust frc vision blazingly-fast

Updated May 20, 2024
Rust

michaelbach / FrACT10

Star

The Freiburg Vision Test (FrACT) assesses visual acuities and contrast thresholds. It runs in any modern browser, or as webApp.

contrast vision psychophysics cappuccino objective-j visual-acuity

Updated May 20, 2024
Objective-J

yankailab / OpenKAI

Star

OpenKAI: A modern framework for unmanned vehicle and robot control

framework robot drone pixhawk vision jetson unmanned

Updated May 20, 2024
C

DaniilShmoylove / CounterML-iOS

Star

iOS App that implements state-of-the-art machine learning and computer vision integration. The application is developed based on the Swift language and CoreML, Vision frameworks.

swift machine-learning avfoundation vision ios-swift macos-swift firebase-authentication coreml mvvm-ios firebase-firestore combine-framework

Updated May 20, 2024
C++

paperClub-hub / paperClub_daily

Star

PaperClub 资源站：不间断分享中小型项目, 主要分享各类视觉算法、文本算法和前后端等实用性工程项目，主要开发语言为python，vue等；

nlp machine-learning vue algorithms vision paperclub

Updated May 20, 2024
Jupyter Notebook

alexdredmon / crayeye

Star

Multimodal LLM visual analysis multitool

app mobile ai vision llm

Updated May 20, 2024
Dart

PhotonVision / photonvision

Star

PhotonVision is the free, fast, and easy-to-use computer vision solution for the FIRST Robotics Competition.

java opencv computer-vision frc vision wpilib vision-processing

Updated May 20, 2024
Java

0015 / ChatGPT_Client_For_Arduino

Sponsor

Star

Library for communication with ChatGPT. Now it supports Vision Question.

vision arduino-library thatproject chatgpt chatgpt-api gpt-4o-api

Updated May 19, 2024
C++

Sanj-bot / codingINCV

Star

The repo contains projects and learning related to computer vision

opencv computer vision cv2

Updated May 19, 2024
Python

GoogleCloudPlatform / java-docs-samples

Star

Java and Kotlin Code samples used on cloud.google.com

kotlin java appengine video cdn auth samples vision translate automl

Updated May 20, 2024
Java

prajolshrestha / prajolshrestha.github.io

Star

Blog and Portfolio page.

machine-learning deep-learning signal-processing artificial-intelligence computer vision

Updated May 18, 2024

youkpan / gemini-assistant

Star

Google Gemini Voice/Vision Assistant with gemini-1.5-pro / gemini-1.5-flash modal !

computer-vision assistant gemini webapp vision llm google-gemini gemini-pro gemini-15-pro gpt-4o gemini-flash

Updated May 18, 2024
TypeScript

eliranwong / freegenius

Star

FreeGenius AI, an advanced AI assistant that can talk and take multi-step actions. Supports numerous open-source LLMs via Llama.cpp or Ollama or Groq Cloud API, with optional integration with AutoGen agents, OpenAI API, Google Gemini Pro and unlimited plugins.

google ai gemini vision openai mistral autogen groq stable-diffusion chatgpt llava llamacpp ollama llama3

Updated May 18, 2024
Python

tii-racing / drone-racing-dataset

Star

A fully-annotated, open-design dataset of autonomous and piloted high-speed flight

control computer-vision robotics path-planning dataset vision motion-capture quadrotor visual-inertial-odometry motion-capture-data ros2 drone-racing autonomous-robots scene-understanding inertial-data

Updated May 17, 2024
Python

Amr-Abdellatif / Building-a-Vision-Transformer-from-scratch-using-PyTorch

Star

In This Repo I've Built Vision Transformer using PyTorch

pytorch vision pytorch-implementation vision-transformer

Updated May 17, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the vision topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision

Here are 1,499 public repositories matching this topic...

Skyvern-AI / skyvern

TIGER-AI-Lab / Mantis

Rahuletto / yolo

danny-avila / LibreChat

mrousavy / react-native-vision-camera

frc4533-lincoln / chalkydri

michaelbach / FrACT10

yankailab / OpenKAI

DaniilShmoylove / CounterML-iOS

paperClub-hub / paperClub_daily

alexdredmon / crayeye

PhotonVision / photonvision

0015 / ChatGPT_Client_For_Arduino

Sanj-bot / codingINCV

GoogleCloudPlatform / java-docs-samples

prajolshrestha / prajolshrestha.github.io

youkpan / gemini-assistant

eliranwong / freegenius

tii-racing / drone-racing-dataset

Amr-Abdellatif / Building-a-Vision-Transformer-from-scratch-using-PyTorch

Improve this page

Add this topic to your repo