Small Language Model

Deep Learning Project to generate Shakespearean-like text using a transformer-based, character-level language model (GPT-like).

Generating $1000$ characters using O God, O God! as prompt:

O God, O God! she is full of death.
Patience, I will take thy hire with thy death.

LADY GREY:
Why, then I will do what your grace commands.

GLOUCESTER:

CLARENCE:

LADY GREY:
Why stops my lord; and so much before
The mother of our gates to die.

KING EDWARD IV:
Thanks, good Montgomery; and my soul
Did triumph with all duteous pleasing well
Will not procuous here, stults, and as ours,
Stand all your worthy answer the law,
And afterward here with her! Were I a tyrant,
Where were her life? she durst not call me so,
If she did know me one. Away with him to prison!
He would not prepare to fine his country
And by his power increaseth himself as well as
In all proof, as it appears, cracking ten times,
Not am body's parent of pardon.

MISTRESS OVERDONE:
Nay, but I know 'tis so: I saw him arrested, saw
spectatord and as it were far the people,
To lose his hum with his fower a fool.

SICINIUS:
He would not like the dews of a week fellow;
hath bid he has march him well; which was it born.
To see how the pow

Installation

Install PyTorch

pip3 install torch --index-url https://download.pytorch.org/whl/cu118

And other dependencies:

pip3 install -r requirements.txt

Configuration

Our Python scripts look by default for the configuration file ./config/config.yaml.

You can specify a particular .yaml file located under the ./config directory. To do so, you can append --config-name <config-name> right after the Python filename when launching Python scripts via the CLI. <config-name> is your config name without the yaml extension.

Training

If you want to train a model from scratch, please ensure the checkpoint path doesn't point to an existing checkpoint. Otherwise, training will resume.

python src/trainer.py

Sampling

python src/sampler.py common.sampling.context=<context> common.sampling.nb_tokens=<nb-chars-to-gen>

where <sampling-mode> can either be argmax, prob or top5.

More arguments

Training: You can deactivate weights and biases logs by adding wandb.mode=disabled.
Sampling:
- You can change the sampling mode by adding common.sampling.sampling_mode=<sampling-mode> where <sampling-mode> can either be argmax, prob or top5. If not specified, the default is the mode from the configuration file (e.g. argmax). Note that we also added a [softmax] temperature parameter in the top5 sampling mode which may differ from its usual formulation.
- You can change the [softmax] temperature in prob and top5 by adding common.sampling.temperature=<temperature> where <temperature> has to be positive. If not specified, the default is $1$. A high temperature would lead to more diverse generated tokens/characters (closer to the uniform distribution) but may not make sense when combined.
Example:
```
python src/sampler.py common.sampling.nb_tokens=1000 common.sampling.sampling_mode=top5 common.sampling.temperature=1
```
Refer to Hydra for more information.

TODO list

Clean code
To implement Top-p (Nucleus sampling)
Fix train, validation, test datasets
K, V caching
Evaluating using the validation and test set
Monitor perplexity
Gradient clipping and learning rate decay
Monitor overfitting

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
config		config
data		data
notebooks		notebooks
results/txts/shakespeare		results/txts/shakespeare
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

data

data

notebooks

notebooks

results/txts/shakespeare

results/txts/shakespeare

src

src

.gitattributes

.gitattributes

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Small Language Model

Installation

Configuration

Training

Sampling

More arguments

TODO list

Credits

About

Releases

Packages

Languages

License

Zenchiyu/gpt

Folders and files

Latest commit

History

Repository files navigation

Small Language Model

Installation

Configuration

Training

Sampling

More arguments

TODO list

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Languages