Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
Konthee authored Feb 15, 2024
1 parent c9a9839 commit 50ea9ae
Showing 1 changed file with 13 additions and 0 deletions.
13 changes: 13 additions & 0 deletions experiment_effect_of_data_preprocessing/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,3 +23,16 @@ sh preprocess_data/blind_pdpa/run_blind.sh
```

## Step of pre-train

1. Convert dataset from hf to openthai
```
sh train_tiny_llama/step1_1submit_data_hf_openthai.sh
```
2. Convert the dataset to Tinyllama format
```
sh train_tiny_llama/step1_2submit_data_openthai.sh
```
3. Train Tinyllama models
```
sh train_tiny_llama/step2_submit_train.sh
```

0 comments on commit 50ea9ae

Please sign in to comment.