The page you navigated to () does not exist, so we brought you to the closest page to it.
You have switched from the to the variant of this site. There is no equivalent of . We have taken you to the closest page in the variant.
flyte.storage
Directory
Classes
| Class | Description |
|---|---|
ABFS |
Any Azure Blob Storage specific configuration. |
GCS |
Any GCS specific configuration. |
S3 |
S3 specific configuration. |
Storage |
Data storage configuration that applies across any provider. |
Methods
| Method | Description |
|---|---|
exists() |
Check if a path exists. |
exists_sync() |
|
get() |
|
get_configured_fsspec_kwargs() |
|
get_random_local_directory() |
|
get_random_local_path() |
Use file_path_or_file_name, when you want a random directory, but want to preserve the leaf file name. |
get_stream() |
Get a stream of data from a remote location. |
get_underlying_filesystem() |
|
is_remote() |
Let’s find a replacement. |
join() |
Join multiple paths together. |
open() |
Asynchronously open a file and return an async context manager. |
put() |
|
put_stream() |
Put a stream of data to a remote location. |
Methods
exists()
def exists(
path: str,
kwargs,
) -> boolCheck if a path exists.
| Parameter | Type | Description |
|---|---|---|
path |
str |
Path to be checked. |
kwargs |
**kwargs |
Additional arguments to be passed to the underlying filesystem. |
Returns: True if the path exists, False otherwise.
exists_sync()
def exists_sync(
path: str,
kwargs,
) -> bool| Parameter | Type | Description |
|---|---|---|
path |
str |
|
kwargs |
**kwargs |
get()
def get(
from_path: str,
to_path: Optional[str | pathlib.Path],
recursive: bool,
kwargs,
) -> str| Parameter | Type | Description |
|---|---|---|
from_path |
str |
|
to_path |
Optional[str | pathlib.Path] |
|
recursive |
bool |
|
kwargs |
**kwargs |
get_configured_fsspec_kwargs()
def get_configured_fsspec_kwargs(
protocol: typing.Optional[str],
anonymous: bool,
) -> typing.Dict[str, typing.Any]| Parameter | Type | Description |
|---|---|---|
protocol |
typing.Optional[str] |
|
anonymous |
bool |
get_random_local_directory()
def get_random_local_directory()Returns: pathlib.Path
get_random_local_path()
def get_random_local_path(
file_path_or_file_name: pathlib.Path | str | None,
) -> pathlib.PathUse file_path_or_file_name, when you want a random directory, but want to preserve the leaf file name
| Parameter | Type | Description |
|---|---|---|
file_path_or_file_name |
pathlib.Path | str | None |
get_stream()
def get_stream(
path: str,
chunk_size,
kwargs,
) -> AsyncGenerator[bytes, None]Get a stream of data from a remote location. This is useful for downloading streaming data from a remote location. Example usage:
import flyte.storage as storage
async for chunk in storage.get_stream(path="s3://my_bucket/my_file.txt"):
process(chunk)| Parameter | Type | Description |
|---|---|---|
path |
str |
Path to the remote location where the data will be downloaded. |
chunk_size |
Size of each chunk to be read from the file. | |
kwargs |
**kwargs |
Additional arguments to be passed to the underlying filesystem. |
Returns: An async iterator that yields chunks of bytes.
get_underlying_filesystem()
def get_underlying_filesystem(
protocol: typing.Optional[str],
anonymous: bool,
path: typing.Optional[str],
kwargs,
) -> fsspec.AbstractFileSystem| Parameter | Type | Description |
|---|---|---|
protocol |
typing.Optional[str] |
|
anonymous |
bool |
|
path |
typing.Optional[str] |
|
kwargs |
**kwargs |
is_remote()
def is_remote(
path: typing.Union[pathlib.Path | str],
) -> boolLet’s find a replacement
| Parameter | Type | Description |
|---|---|---|
path |
typing.Union[pathlib.Path | str] |
join()
def join(
paths: str,
) -> strJoin multiple paths together. This is a wrapper around os.path.join.
TODO replace with proper join with fsspec root etc
| Parameter | Type | Description |
|---|---|---|
paths |
str |
Paths to be joined. |
open()
def open(
path: str,
mode: str,
kwargs,
) -> AsyncReadableFile | AsyncWritableFileAsynchronously open a file and return an async context manager. This function checks if the underlying filesystem supports obstore bypass. If it does, it uses obstore to open the file. Otherwise, it falls back to the standard _open function which uses AsyncFileSystem.
It will raise NotImplementedError if neither obstore nor AsyncFileSystem is supported.
| Parameter | Type | Description |
|---|---|---|
path |
str |
|
mode |
str |
|
kwargs |
**kwargs |
put()
def put(
from_path: str,
to_path: Optional[str],
recursive: bool,
batch_size: Optional[int],
kwargs,
) -> str| Parameter | Type | Description |
|---|---|---|
from_path |
str |
|
to_path |
Optional[str] |
|
recursive |
bool |
|
batch_size |
Optional[int] |
|
kwargs |
**kwargs |
put_stream()
def put_stream(
data_iterable: typing.AsyncIterable[bytes] | bytes,
name: str | None,
to_path: str | None,
kwargs,
) -> strPut a stream of data to a remote location. This is useful for streaming data to a remote location. Example usage:
import flyte.storage as storage
storage.put_stream(iter([b'hello']), name="my_file.txt")
OR
storage.put_stream(iter([b'hello']), to_path="s3://my_bucket/my_file.txt")| Parameter | Type | Description |
|---|---|---|
data_iterable |
typing.AsyncIterable[bytes] | bytes |
Iterable of bytes to be streamed. |
name |
str | None |
Name of the file to be created. If not provided, a random name will be generated. |
to_path |
str | None |
Path to the remote location where the data will be stored. |
kwargs |
**kwargs |
Additional arguments to be passed to the underlying filesystem. |
Returns: The path to the remote location where the data was stored.