Support download huggingface models in storage initializer #3545

lizzzcai · 2024-03-24T15:06:13Z

/kind feature

Describe the solution you'd like

I would like to use storage initializer to pull the huggingface models which fits the KServe experience. StorageUri will be in this format: hf://<repo>/<model>:<hash(optional)>

apiVersion: serving.kserve.io/v1beta1
kind: InferenceService
metadata:
  name: bloom
spec:
  predictor:
    model:
      modelFormat:
        name: huggingface
      #runtime: kserve-huggingface
      storageUri: hf://bigscience/bloom-560m:<hash>
      resources:
        limits:
          cpu: "2"
          memory: 4Gi
        requests:
          cpu: "1"
          memory: 2Gi

Anything else you would like to add:

Another option is to support any Git LFS.

Links to the design documents:
[Optional, start with the short-form RFC template to outline your ideas and get early feedback.]
[Required, use the longer-form design doc template to specify and discuss your design in more detail]

The text was updated successfully, but these errors were encountered:

andyi2it · 2024-03-27T06:32:22Z

/assign @andyi2it

jiangxiaobin96 · 2024-04-18T12:10:02Z

Hello, I have some questions about kServe model storage.
If I have a model installed from huggingface and a python file to load model, how to use InferenceService CRD?
Like the following YAML, where is the python file to load model?

apiVersion: serving.kserve.io/v1beta1
kind: InferenceService
metadata:
  name: bloom
spec:
  predictor:
    model:
      modelFormat:
        name: huggingface
      #runtime: kserve-huggingface
      storageUri: hf://bigscience/bloom-560m:<hash>
      resources:
        limits:
          cpu: "2"
          memory: 4Gi
        requests:
          cpu: "1"
          memory: 2Gi

oss-prow-bot bot added the kind/feature label Mar 24, 2024

oss-prow-bot bot assigned andyi2it Mar 27, 2024

andyi2it linked a pull request Apr 9, 2024 that will close this issue

Implement Huggingface model download in storage initializer #3584

Open

9 tasks

yuzisun added the kserve/huggingface label May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support download huggingface models in storage initializer #3545

Support download huggingface models in storage initializer #3545

lizzzcai commented Mar 24, 2024 •

edited

andyi2it commented Mar 27, 2024

jiangxiaobin96 commented Apr 18, 2024

Support download huggingface models in storage initializer #3545

Support download huggingface models in storage initializer #3545

Comments

lizzzcai commented Mar 24, 2024 • edited

andyi2it commented Mar 27, 2024

jiangxiaobin96 commented Apr 18, 2024

lizzzcai commented Mar 24, 2024 •

edited