2.0.6

HuggingFaceModelInfo

Package: flyte.prefetch

Information about a HuggingFace model to store.

class HuggingFaceModelInfo(
    repo: str,
    artifact_name: str | None,
    architecture: str | None,
    task: str,
    modality: tuple[str, ...],
    serial_format: str | None,
    model_type: str | None,
    short_description: str | None,
    shard_config: flyte.prefetch._hf_model.ShardConfig | None,
)

Create a new model by parsing and validating input data from keyword arguments.

Raises ValidationError if the input data cannot be validated to form a valid model.

self is explicitly positional-only to allow self as a field name.

Parameter Type Description
repo str The HuggingFace repository ID (e.g., ‘meta-llama/Llama-2-7b-hf’).
artifact_name str | None Optional name for the stored artifact. If not provided, the repo name will be used (with ‘.’ replaced by ‘-’).
architecture str | None Model architecture from HuggingFace config.json.
task str Model task (e.g., ‘generate’, ‘classify’, ’embed’).
modality tuple[str, ...] Modalities supported by the model (e.g., ’text’, ‘image’).
serial_format str | None Model serialization format (e.g., ‘safetensors’, ‘onnx’).
model_type str | None Model type (e.g., ’transformer’, ‘custom’).
short_description str | None Short description of the model.
shard_config flyte.prefetch._hf_model.ShardConfig | None Optional configuration for model sharding.