Skip to content
Snippets Groups Projects

Update embeddings description with "nomath" controls

Merged Deyan Ginev requested to merge update-arxmliv-embeddings into master
@@ -25,7 +25,8 @@ articles, the right of distribution was only given (or assumed) to arXiv itself.
- `glove.arxmliv.5B.300d.zip` and `vocab.arxmliv.zip`
- 300d GloVe word embeddings for individual subsets
- `glove.subsets.zip`
- Embeddings and vocabulary with math lexemes omitted: `glove.arxmliv.nomath.11B.300d.zip` and `vocab.arxmliv.nomath.zip`
- Embeddings and vocabulary with math lexemes omitted
- `glove.arxmliv.nomath.11B.300d.zip` and `vocab.arxmliv.nomath.zip`
- added on July 20, 2019
- used as a control when evaluating the contribution of formula lexemes
- the main arXMLiv dataset is available separately [here](/resources/arxmliv-dataset-082018/)
Loading