grounding

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.

ai gcc multimodality vlm cradle computer-control lmm grounding ai-agent large-language-models llm generative-ai vision-language-model ai-agents-framework general-computer-control personoid foundation-agent

Updated Apr 15, 2024
Python

PathwayCommons / grounding-search

Star

A biological entity grounding search service

search identifiers biological-entities grounding

Updated Apr 2, 2024
JavaScript

mrzjy / ArknightsDialog

Star

Extracting character conversations in Arknights

game nlp corpus conversation story multi-language arknights grounding hypergryph

Updated Mar 21, 2024
Python

lukashermann / hulc

Star

Hierarchical Universal Language Conditioned Policies

natural-language-processing computer-vision deep-learning robotics pytorch vision manipulation vision-and-language grounding vision-language

Updated Mar 19, 2024
Python

allenai / lumos

Star

Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"

decision-making planning question-answering maths reasoning grounding web-agent language-agent

Updated Mar 19, 2024
Python

uvavision / SelfEQ

Star

[CVPR 2024] Code for "Improved Visual Grounding through Self-Consistent Explanations".

grounding visual-grounding cvpr2024

Updated Mar 1, 2024
Python

zjukg / DUET

Star

[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning

semantic pytorch transformer cross-modal zero-shot-learning knowledge-transfer grounding visual-grounding pretrained-language-model

Updated Feb 9, 2024
Python

mbzuai-oryx / Video-LLaVA

Star

PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

video transcription lmm grounding video-grounding llm video-conversation

Updated Jan 2, 2024
Python

cliport / cliport

Star

CLIPort: What and Where Pathways for Robotic Manipulation

natural-language-processing computer-vision deep-learning robotics pytorch vision manipulation clip rearrangement grounding vision-language

Updated Nov 2, 2023
Jupyter Notebook

mees / hulc2

Star

[ICRA2023] Grounding Language with Visual Affordances over Unstructured Data

natural-language-processing computer-vision deep-learning robotics pytorch vision manipulation vision-and-language grounding vision-language

Updated Oct 29, 2023
Python

LazaUK / AOAI-CognitiveSearch-AZD

Star

Adapting original Azure OpenAI sample from https://github.com/Azure-Samples/azure-search-openai-demo for newer GPT4-compatible "Chat Completion" syntax.

ai azure openai grounding cognitive-search gpt-4 chatgpt

Updated Jun 6, 2023
Python

TheShadow29 / awesome-grounding

Star

awesome grounding: A curated list of research papers in visual grounding

natural-language-processing computer-vision paper awesome-list arxiv papers video-understanding captioning-images captioning-videos phrase-grounding language-grounding multimodal-deep-learning grounding visual-grounding embodied-agent video-grounding image-grounding paper-roadmap

Updated Apr 9, 2023

eslambakr / LAR-Look-Around-and-Refer

Star

This is the official implementation for our paper;"LAR:Look Around and Refer".

machine-learning deep-neural-networks geometry transformers cnn deeplearning 3d projective-geometry multimodal-deep-learning grounding neurips neurips-2022 3dvisualgrounding

Updated Dec 1, 2022
C++

Improve this page

Add a description, image, and links to the grounding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the grounding topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

grounding

Here are 32 public repositories matching this topic...

FoundationVision / Groma

flowersteam / Grounding_LLMs_with_online_RL

mees / calvin

linhuixiao / CLIP-VG

linhuixiao / HiVG

erensahin / task-whisperer

manuvarkey / Earthing

BAAI-Agents / Cradle

PathwayCommons / grounding-search

mrzjy / ArknightsDialog

lukashermann / hulc

allenai / lumos

uvavision / SelfEQ

zjukg / DUET

mbzuai-oryx / Video-LLaVA

cliport / cliport

mees / hulc2

LazaUK / AOAI-CognitiveSearch-AZD

TheShadow29 / awesome-grounding

eslambakr / LAR-Look-Around-and-Refer

Improve this page

Add this topic to your repo