This tutorial will guide you through the process of implementing a federated learning setup using the [FEDn] framework in combination with [Ultralytics] YOLOv8 models. Federated learning allows multiple clients to collaboratively train a global model without sharing their local datasets, ensuring data privacy and security. This tutorial is designed for users familiar with machine learning and federated learning concepts and will provide step-by-step instructions on configuring and running a complete federated learning workflow.
By the end of this tutorial, you will have built a distributed training environment where clients independently train local models, and a global model is aggregated on the server. You will also learn how to use Ultralytics YOLOv8 models for object detection tasks in this federated setting.
- Starting the server in FEDn – Learn how to initiate the central server that coordinates federated learning activities.
- Cloning the repository – Set up the project by cloning the necessary repository for configuration and deployment.
- Installing prerequisites – Install all required dependencies for the client environments.
- Setting up the dataset – Properly structure and configure your dataset to be used with Ultralytics models.
- Setting up configurations – Configure the model, such as defining the number of classes for the YOLOv8 model.
- Building the compute package – Build the compute package to prepare for the training process.
- Initializing the seed model – Generate the initial model to start the training process.
- Initializing the server-side – Set up the server-side of the federated learning system.
- Connecting and starting the clients – Connect the clients that will participate in the federated training.
- Training the global model – Observe how the global model improves through the aggregation of client updates and monitor training progress.
By following these steps, you will not only gain hands-on experience with the FEDn platform but also learn how to integrate object detection tasks with YOLOv8 in a federated learning environment.
Python >=3.9, <=3.12
https://www.python.org/downloads
Firstly, create an account on the FEDn studio platform https://fedn.scaleoutsystems.com/signup.
Once you are logged in, you need to start a new project by clicking on the New project
button.
This initializes the server which later will be used to run the federated learning process.
Next, you need to clone the repository:
git clone https://github.com/scaleoutsystems/fedn-ultralytics-tutorial
Then navigate into the repository:
cd fedn-ultralytics-tutorial
This repository contains all the necessary files and configurations for the federated learning setup.
Next, you need to install the prerequisites. You can install everything using pip:
pip3 install -r requirements.txt
This is recommended to be done in a virtual environment to avoid conflicts with other packages.
Start setting up the dataset by creating a directory named 'datasets' inside the repository:
mkdir datasets
Then copy the path to the 'datasets' folder by running the following command:
echo "$(pwd)/datasets" | pbcopy
Ultralytics uses a specific directory to store datasets, which you can configure using the datasets_dir option in YOLO settings. To set this up to the 'datasets' directory you previously created, run the following command, replacing <path_to_dataset> with the path you just copied:
yolo settings datasets_dir=<path_to_dataset>
After setting the dataset directory, you’ll need to organize your data into a folder named 'fed_dataset' inside the datasets directory. Your final folder structure should look like this:
datasets/
fed_dataset/
train/
images/
image1.jpg
image2.jpg
...
labels/
image1.txt
image2.txt
...
valid/
images/
image1.jpg
image2.jpg
...
labels/
image1.txt
image2.txt
...
Each label file should correspond to an image file, and the format of each label should be:
<class> <x_center> <y_center> <width> <height>
<class> <x_center> <y_center> <width> <height>
...
Each line corresponds to one bounding box in the image.
For further details on how to prepare your dataset, you can visit https://docs.ultralytics.com/datasets/.
For getting started quickly, you can navigate into the examples
repository to download and partition a sample dataset.
To set up your YOLOv8 model, you need to configure the global_config.yaml inside the 'client' folder. Here you choose the number of classes for the YOLOv8 model by setting the num_classes
parameter, and respective class names. You also choose which YOLOv8 model to use by setting the model_size
parameter.
Each client can set different training configurations in the client_config.yaml
file. This file contains the configurations for the client environments, such as the number of local epochs, and batch size. You can adjust these configurations to suit each client's hardware and training requirements.
Once you’ve completed all the configurations, you can build the compute package by running the following command:
python3 client/setup.py
fedn package create -p client
This creates he compute package package.tgz
which contains all the necessary files and configurations for the client environments.
If you make any changes to the global_config.yaml, you’ll need to rebuild (Step 6) and reupload (Step 8) the compute package to apply the updates. For changes in the client_config.yaml
, you don't need to rebuild the compute package.
To initialize the seed model, run the following command:
fedn run build -p client
This command will generate the seed model that will be used as the starting point for the federated learning process.
The next step is to initialize the server side. This is done by uploading the compute package and the seed model to the FEDn studio platform.
This is done by pressing the Sessions button and then the "New session" button. Here you can upload the compute package package.tgz
and the seed model seed.npz
.
Once the compute package and seed model are uploaded, you can create the session by pressing the "Create session" button. Here you also configure the total number of rounds and aggregator function for the federated learning process.
Before starting the training process, you need to connect the clients to the server which is done in the next step.
To connect a client to the server, you need to hand each client a client.yaml file. This file contains the necessary configurations for the client to connect to the server and gain access to the compute package. Connect clients by pressing the "Clients" button on the left hand side. Here you can download a client.yaml file for each client. Place the client.yaml file in the repository and start the client by running following command:
fedn client start -in client.yaml
This starts the client and connects it to the server. Repeat this process for each client you want to connect to the server.
Once the clients are running, you can start the global training by pressing the Start session button in FEDn studio. This will initiate the federated learning process, where the global model is trained by aggregating the updates from the clients. Now you will see things happening on both the server and client side. You can monitor the training progress on the FEDn studio platform where metrics such as recall, precision, and mAP scores are shown.
Once training is completed, you can download any model from a certain round in the session. The model can be used for inference on new data.
In this tutorial, you have learned how to implement Ultralytics YOLOv8 models in a federated learning setting using the FEDn platform. By following the steps outlined in this tutorial, you have successfully set up a distributed training environment where clients independently train local models, and a global model is aggregated on the server. You have also learned how to configure the dataset, set up the model configurations, build the compute package, and start the federated learning process. By completing this tutorial, you have gained hands-on experience with federated learning and object detection tasks using Ultralytics models.
Steps 1, 2, 5, 6, 7 and 8 only need to be done once to set up the federated learning environment in FEDn.
To connect a new client, the only steps that needs to be followed are step 2, 3, 4 and 9. Each client can have different configurations in the client_config.yaml file to account for different hardware and training requirements.