examples

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
data.py		data.py
embed_regularize.py		embed_regularize.py
getdata.sh		getdata.sh
locked_dropout.py		locked_dropout.py
main.py		main.py
model.py		model.py
utils.py		utils.py
valcurve.jpg		valcurve.jpg
weight_drop.py		weight_drop.py

README.md

Adaptive LSTM for Language Modeling

These examples replicate the experiments in the original paper (https://arxiv.org/abs/1805.08574). The training code base is derived from https://github.com/salesforce/awd-lstm-lm, in turn derived from the official PyTorch language model example.

Setup

Install PyTorch and alstm and download the data you want to use (p for Penn Treebank and w for Wikitext-2):

getdata.sh -pw

Train

Penn Treebank

To train the aLSTM on Penn Treebank, run

python main.py --model ALSTM --epochs 190 --emsize 400 --nhid 1150 --nlayers 2 --npar 100 --dropouth 0.25 --dropoute 0.16 --dropouti 0.6 --dropouto 0.6 --dropouta 0.1 --wdecay 1e-6 --var-seq --seq-len 70 --batch_size 20 --cut-steps 100 160 --cut-rate 10 --save

This will give you val / test scores of 58.7 / 56.5.

Wikitext-2

To train the aLSTM on Wikitext-2, run

python main.py --model ALSTM --epochs 187 --emsize 400 --nhid 1500 --nlayers 2 --npar 100 --dropouth 0.25 --dropoute 0.16 --dropouti 0.6 --dropouto 0.6 --dropouta 0.1 --wdecay 1e-6 --var-seq --seq-len 70 --batch_size 20 --cut-steps 80 160 200 --cut-rate 10 --save --data data/wikitext-2

This will give you val / test scores of 67.5 / 64.5.

The API for the language model is the same as that of the AWD-LSTM, so you can use any post-processing scripts they have, such as fine tuning, adding a neural cache or generating samples.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

examples

examples

README.md

Adaptive LSTM for Language Modeling

Setup

Train

Penn Treebank

Wikitext-2

Benchmark

Files

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

Adaptive LSTM for Language Modeling

Setup

Train

Penn Treebank

Wikitext-2

Benchmark