Skip to content

instill-ai/model-gemini-2b-dvc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Task Tags
TextGeneration
TextGeneration
Gemma-2b

Model-Gemini-2b-dvc

🔥🔥🔥 Deploy Gemma-2b model on VDP.

This repository contains the Gemma-2b Text Completion Generation Model in the Transformers format, managed using DVC.

Notes:

  • Disk Space Requirements: 1.7G
  • GPU Memory Requirements: 4G

Following is an example of query parameters:

Create Model

{
    "id": "gemma-2b-gpu",
    "description": "test containerized gemma 2b gpu model.",
    "model_definition": "model-definitions/container",
    "visibility": "VISIBILITY_PUBLIC",
    "region": "REGION_GCP_EUROPE_WEST_4",
    "hardware": "GPU",
    "configuration": {
        "task": "TEXT_GENERATION"
    }
}

Inference model

{
    "task_inputs": [
        {
            "text_generation": {
                "prompt": "The capital city of Franch is ",
                "max_new_tokens": "300",
                "temperature": "0.8",
                "top_k": "50",
                "random_seed": "42",
                "extra_params": "{\"top_p\": 0.8, \"repetition_penalty\": 2.0}"
            }
        }
    ]
}```

About

⚗️ Gemini 2b model repository trained by Google managed by DVC

Topics

Resources

Code of conduct

Security policy

Stars

Watchers

Forks

Packages

No packages published

Languages