diff --git a/docs/hub/spaces-sdks-docker.md b/docs/hub/spaces-sdks-docker.md index 6facf66d2..f936f8e54 100644 --- a/docs/hub/spaces-sdks-docker.md +++ b/docs/hub/spaces-sdks-docker.md @@ -136,10 +136,12 @@ The data written on disk is lost whenever your Docker Space restarts, unless you If you opt-in for a persistent storage upgrade, you can use the `/data` directory to store data. This directory is mounted on a persistent volume, which means that the data written in this directory will be persisted across restarts. + At the moment, `/data` volume is only available at runtime, i.e. you cannot use `/data` during the build step of your Dockerfile. + -You can also use our Datasets Hub for specific cases, where you can store state and data in a git LFS repository. You can find an example of persistence [here](https://huggingface.co/spaces/julien-c/persistent-data), which uses the [`huggingface_hub` library](https://huggingface.co/docs/huggingface_hub/index) for programmatically uploading files to a dataset repository. +You can also use our Datasets Hub for specific cases, where you can store state and data in a git LFS repository. You can find an example of persistence [here](https://huggingface.co/spaces/Wauplin/space_to_dataset_saver), which uses the [`huggingface_hub` library](https://huggingface.co/docs/huggingface_hub/index) for programmatically uploading files to a dataset repository. This Space example along with [this guide](https://huggingface.co/docs/huggingface_hub/main/en/guides/upload#scheduled-uploads) will help you define which solution fits best your data type. Finally, in some cases, you might want to use an external storage solution from your Space's code like an external hosted DB, S3, etc. diff --git a/docs/hub/spaces-storage.md b/docs/hub/spaces-storage.md index d7db1d7f3..57a8da9f5 100644 --- a/docs/hub/spaces-storage.md +++ b/docs/hub/spaces-storage.md @@ -53,7 +53,7 @@ Additional information about billing can be found in the [dedicated Hub-wide sec If you need to persist data that lives longer than your Space, you could use a [dataset repo](./datasets). -You can find an example of persistence [here](https://huggingface.co/spaces/julien-c/persistent-data), which uses the [`huggingface_hub` library](https://huggingface.co/docs/huggingface_hub/index) for programmatically uploading files to a dataset repository. +You can find an example of persistence [here](https://huggingface.co/spaces/Wauplin/space_to_dataset_saver), which uses the [`huggingface_hub` library](https://huggingface.co/docs/huggingface_hub/index) for programmatically uploading files to a dataset repository. This Space example along with [this guide](https://huggingface.co/docs/huggingface_hub/main/en/guides/upload#scheduled-uploads) will help you define which solution fits best your data type. Visit the [`datasets` library](https://huggingface.co/docs/datasets/index) documentation and the [`huggingface_hub` client library](https://huggingface.co/docs/huggingface_hub/index) documentation for more information on how to programmatically interact with dataset repos.