Customizing tasks and workflows#
- class flytekit.TaskMetadata(cache=False, cache_serialize=False, cache_version='', cache_ignore_input_vars=(), interruptible=None, deprecated='', retries=0, timeout=None, pod_template_name=None)#
Metadata for a Task. Things like retries and whether or not caching is turned on, and cache version are specified here.
See the IDL for the protobuf definition.
- Args:
cache (bool): Indicates if caching should be enabled. See Caching cache_serialize (bool): Indicates if identical (ie. same inputs) instances of this task should be executed in serial when caching is enabled. See Caching cache_version (str): Version to be used for the cached value cache_ignore_input_vars (Tuple[str, …]): Input variables that should not be included when calculating hash for cache interruptible (Optional[bool]): Indicates that this task can be interrupted and/or scheduled on nodes with
lower QoS guarantees that can include pre-emption. This can reduce the monetary cost executions incur at the cost of performance penalties due to potential interruptions
- deprecated (str): Can be used to provide a warning message for deprecated task. Absence or empty str indicates
that the task is active and not deprecated
retries (int): for retries=n; n > 0, on failures of this task, the task will be retried at-least n number of times. timeout (Optional[Union[datetime.timedelta, int]]): the max amount of time for which one execution of this task
should be executed for. The execution will be terminated if the runtime exceeds the given timeout (approximately)
pod_template_name (Optional[str]): the name of existing PodTemplate resource in the cluster which will be used in this task.
- Parameters:
cache (bool)
cache_serialize (bool)
cache_version (str)
cache_ignore_input_vars (Tuple[str, ...])
interruptible (bool | None)
deprecated (str)
retries (int)
timeout (timedelta | int | None)
pod_template_name (str | None)
- to_taskmetadata_model()#
Converts to _task_model.TaskMetadata
- Return type:
TaskMetadata
- class flytekit.Resources(cpu=None, mem=None, gpu=None, ephemeral_storage=None)#
This class is used to specify both resource requests and resource limits.
Resources(cpu="1", mem="2048") # This is 1 CPU and 2 KB of memory Resources(cpu="100m", mem="2Gi") # This is 1/10th of a CPU and 2 gigabytes of memory Resources(cpu=0.5, mem=1024) # This is 500m CPU and 1 KB of memory # For Kubernetes-based tasks, pods use ephemeral local storage for scratch space, caching, and for logs. # This allocates 1Gi of such local storage. Resources(ephemeral_storage="1Gi")
Note
Persistent storage is not currently supported on the Union backend.
Please see the User Guide for detailed examples. Also refer to the K8s conventions.
- Parameters:
cpu (str | int | float | None)
mem (int | str | None)
gpu (int | str | None)
ephemeral_storage (int | str | None)
- class flytekit.WorkflowFailurePolicy(value, names=None, *, module=None, qualname=None, type=None, start=1, boundary=None)#
Defines the behavior for a workflow execution in the case of an observed node execution failure. By default, a workflow execution will immediately enter a failed state if a component node fails.
- FAIL_AFTER_EXECUTABLE_NODES_COMPLETE = 1#
Will proceed to run any remaining runnable nodes once a component node fails.
- FAIL_IMMEDIATELY = 0#
Causes the entire workflow execution to fail once a component node fails.
- class flytekit.PodTemplate(pod_spec=None, primary_container_name='primary', labels=None, annotations=None)#
Custom PodTemplate specification for a Task.
- Parameters:
pod_spec (V1PodSpec | None)
primary_container_name (str)
labels (Dict[str, str] | None)
annotations (Dict[str, str] | None)