Skip to content

autodistill/autodistill-base-model-template

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Autodistill Base Model Template

⚠️ Note: Before you start building a Base Model, check out our Available Models directory to see if a model is already being implemented. If your desired model is being implemented, check the Autodistill GitHub Issues for progress. We encourage you to offer support to models you want to see in Autodistill if work is already being done on them.

This repository contains a template for use in creating a Base Model for Autodistill.

A Base Model is a large model that you can use for automatically labeling data. Autodistill enables you to connect Base Models to a smaller Target Model. A new model is trained using the Target Model architecture and your labeled data. This model will be smaller and thus more cost effective to run.

Autodistill is an ecosystem of Base and Target Models, with the main Autodistill repository acting as the bridge between the two.

This repository contains a starter template from which you can create a Base Model extension.

Read the full Autodistill documentation.

Steps to Build a Base Model

To build a base model, first rename the src directory to the name of the model you want to implement:

mkdir autodistill_model_name

Use underscores to separate words in the folder name.

Next, open the model.py file. This is the file where your model loading and inference code will be stored. If you need to write helper functions for use with your model -- for example, long methods for loading data, processing extensions -- you may opt to create new files to store the helper scripts.

In model.py, replace the Model class name with the name of your model.

Next, implement the following functions:

  1. __init__: Code for loading the model.
  2. predict: A function that takes in an image name, runs inference, and returns a supervision Detections object (object detection) or a supervision Classifications object (classification).

Replace the import statement in the __init__.py file in your model directory to point to your model. You only need to import the model, such as:

from autodistill_clip.clip_model import CLIP

Your version should be set in the __init__.py file as 0.1.0 before submitting your model for review.

Update the setup.py file to use the name of your model where appropriate. Add all of the requisite dependencies to the install_requires section.

Your Base Model should feature a README that shows a minimal example of how to use the base model. This should only be a few lines of code. Refer to README_EXAMPLE.md for an example of an Autodistill Base Model README. Feel free to copy this example and replace all parts as required.

Your package must be licensed under the same license as the model you are using (i.e. if your model uses an Apache 2.0 license, your Autodistill extension must use the same license). Your license should be in a file called LICENSE, stored in the root directory of your Autodistill extension GitHub repository.

Update your README to note the license applied to your package.

When your Autodistill extension is ready for testing, open an Issue in the main Autodistill repository with a link to a public GitHub repository that contains your code.

An Autodistill maintainer will review your code. If accepted, we will:

  1. Add your package to the Autodistill documentation.
  2. Package your project up to PyPi and publish it as an official autodistill extension.
  3. Announce your project on social media.