From 41814349e683503a09b4a2ba0a763a74c82cfc7c Mon Sep 17 00:00:00 2001 From: Deyan Ginev <d.ginev@jacobs-university.de> Date: Wed, 24 Jan 2018 17:00:33 -0500 Subject: [PATCH] fix typos in embedding contents --- resources/arxmliv-embeddings-082017.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/resources/arxmliv-embeddings-082017.md b/resources/arxmliv-embeddings-082017.md index d40197c..d769253 100644 --- a/resources/arxmliv-embeddings-082017.md +++ b/resources/arxmliv-embeddings-082017.md @@ -19,11 +19,11 @@ articles, the right of distribution was only given (or assumed) to arXiv itself. ### Contents - A 5 billion token model for the arXMLiv 08.2017 dataset - - `glove.arxmliv.5B.300d.zip` and `vocab.arxmliv.zip` - - 300 dimensional GloVe word embeddings for the arXMLiv 08.2017 dataset - `token_model.zip` - - subset word embeddings - - `glove.subset.zip` + - 300 dimensional GloVe word embeddings for the arXMLiv 08.2017 dataset + - `glove.arxmliv.5B.300d.zip` and `vocab.arxmliv.zip` + - 300d GloVe word embeddings for individual subsets + - `glove.subsets.zip` - the main arXMLiv dataset is available separately [here](/resources/arxmliv-dataset-082017/) #### Token Model Statistics -- GitLab