LargeScale

Learning how to pre-train big transformers.

Running

First unpack the dataset:

zstd -d dataset.tar.zstd && tar -xf dataset.tar && rm dataset.tar

Check that transformers, datasets, evaluate and other libraries are of latest versions.

Now, run the training script with the right settings. For example, the laptop-friendly one:

python3 hf.py --model_type gpt2 --train_file train.txt --validation_file eval.txt --output_dir output --tokenizer gpt2 --do_train True --config_overrides="n_layer=2,n_embd=120"

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.flake8		.flake8
.gitignore		.gitignore
README.md		README.md
baseline.txt		baseline.txt
eval.txt		eval.txt
hf.py		hf.py
train.txt		train.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LargeScale

Running

About

Uh oh!

Releases

Packages

Uh oh!

Languages

WideLearning/LargeScale

Folders and files

Latest commit

History

Repository files navigation

LargeScale

Running

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages