Multinode LoRA + Flash + DS

This project adds LoRA, Flash attn patch and DS (Deepspeed) to Scaled Rope that can be run multinode.

Flash Attention 2 and LLaMA 2 ready 🚀

Setup and Installation

To install the necessary dependencies, use pip to install the required Python packages:
```
pip install -r requirements.txt
```
Update the config.yaml file as per your requirements.

Usage

To run the application, use the following command:

python --configs defaults <your_override_config>

Replace <your_override_config> with your specific configuration specified in configs/config.yaml. Command line arguments can also be overridden.

Please Note: This uses the HF recent PR, so models are HF compatible. Linear scaling argument: 'interpolation_factor', i.e. how much you want to scale the model. If set to None will scale config.max_position_embeddings / 4096. As this is the default for LLaMA 2.

Data

Specify packed untokenized datasets on the hub under dataset_names e.g. (Multi-Domain-Expert-Layers/the_pile_books3_packed_128k)
If pretokenized=True, specify a single pre-tokenized dataset on the hub under dataset_names (conceptofmind/rp-packed-32k-no-filter for OpenLLaMA)

Running on a Single Node

Use the following commands for running on a single node:

Export the necessary paths:

export PYTHONPATH="/mnt/data/jordiclive/scaled-rope:$PYTHONPATH"
export TRANSFORMERS_CACHE="/mnt/data/jordiclive/transformers_cache"
export HF_DATASETS_CACHE="/mnt/data/jordiclive/transformers_cache"
export HF_HOME="/mnt/data/jordiclive/transformers_cache"
export WANDB_API_KEY=""

Run the script:

deepspeed --include=localhost:0,1,2,3,4,5,6,7 --master_port 61500 finetune.py --output_dir saved_ckpts_32k --configs defaults lora-7b-llama2 --deepspeed

Running on Multiple Nodes

Example script using the slurm launcher with deepspeed: scripts/juwels_booster.sh

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
configs		configs
scripts		scripts
tests		tests
utilities		utilities
.gitignore		.gitignore
README.md		README.md
data.py		data.py
finetune.py		finetune.py
flash_patch.py		flash_patch.py
lora.py		lora.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multinode LoRA + Flash + DS

Setup and Installation

Usage

Data

Running on a Single Node

Running on Multiple Nodes

About

Releases

Packages

Languages

jordiclive/scaled-rope

Folders and files

Latest commit

History

Repository files navigation

Multinode LoRA + Flash + DS

Setup and Installation

Usage

Data

Running on a Single Node

Running on Multiple Nodes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages