Commit 41814349 authored by Deyan Ginev's avatar Deyan Ginev

fix typos in embedding contents

parent eee0ecec
Pipeline #553 passed with stage
in 24 seconds
......@@ -19,11 +19,11 @@ articles, the right of distribution was only given (or assumed) to arXiv itself.
### Contents
- A 5 billion token model for the arXMLiv 08.2017 dataset
- `glove.arxmliv.5B.300d.zip` and `vocab.arxmliv.zip`
- 300 dimensional GloVe word embeddings for the arXMLiv 08.2017 dataset
- `token_model.zip`
- subset word embeddings
- `glove.subset.zip`
- 300 dimensional GloVe word embeddings for the arXMLiv 08.2017 dataset
- `glove.arxmliv.5B.300d.zip` and `vocab.arxmliv.zip`
- 300d GloVe word embeddings for individual subsets
- `glove.subsets.zip`
- the main arXMLiv dataset is available separately [here](/resources/arxmliv-dataset-082017/)
#### Token Model Statistics
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment