Commit b8f59c85 authored by Michael Kohlhase's avatar Michael Kohlhase

Merge branch 'master' of gl.kwarc.info:SIGMathLing/website

parents 8cd57fa2 12dcfb80
Pipeline #557 passed with stage
in 24 seconds
......@@ -69,15 +69,24 @@ The failure to exercise any right provided in this Agreement shall not be a waiv
This Agreement and each party's obligations shall be binding on the representatives, assigns and successors of such party.
Each party has signed this Agreement through its authorized representative.
____________________________________________________________________________________________________________________________________________________________________________________________________________________
(Member Signature)
________________________________________________________________________________________________________________________________________________________________ (Typed or Printed Name)
______________________________________________________________________________________________________________________________________ (Member Signature)
____________________________________________________________________________________________________________ (Typed or Printed Name)
Date: ____________________________________________________________ Place: ____________________________________________________________
____________________________________________________________________________________________________________________________________________________________________________________________________________________ (Signature)
______________________________________________________________________________________________________________________________________ (Signature)
Prof. Dr. Michael Kohlhase (for SIGMathLing)
Date: ____________________________________________________________ Erlangen, Germany
......@@ -5,7 +5,8 @@ title: arXMLiv 08.2017 - An HTML5 dataset for arXiv.org
Part of the [arXMLiv](https://kwarc.info/systems/arXMLiv/) project at the [KWARC](https://kwarc.info/) research group
### Author
Deyan Ginev,
- Deyan Ginev
### Current release
- 08.2017
......@@ -56,8 +57,8 @@ The dataset should be referenced in all academic publications that present resul
obtained with its help. The reference should contain the identifier `arXMLiv:08.2017` in
the title, the author, year, a reference to SIGMathLing, and the URL of the resource
description page. For convenience, we supply some records for bibTeX and EndNote below. To
cite a particular part of the dataset use the subset identifiers in the ciation; e.g. `
\cite[no_problem subset]{arXMLiv:08.2017}` or just explain it in the text using the
cite a particular part of the dataset use the subset identifiers in the ciation;
e.g. `\cite[no_problem subset]{arXMLiv:08.2017}` or just explain it in the text using the
concrete identifier.
#### pure bibTeX
......
......@@ -5,7 +5,7 @@ title: arXMLiv 08.2017 - Word Embeddings; Token Model
Part of the [arXMLiv](https://kwarc.info/systems/arXMLiv/) project at the [KWARC](https://kwarc.info/) research group
### Author
Deyan Ginev,
- Deyan Ginev
### Current release
- 08.2017
......@@ -19,11 +19,11 @@ articles, the right of distribution was only given (or assumed) to arXiv itself.
### Contents
- A 5 billion token model for the arXMLiv 08.2017 dataset
- `glove.arxmliv.5B.300d.zip` and `vocab.arxmliv.zip`
- 300 dimensional GloVe word embeddings for the arXMLiv 08.2017 dataset
- `token_model.zip`
- subset word embeddings
- `glove.subset.zip`
- 300 dimensional GloVe word embeddings for the arXMLiv 08.2017 dataset
- `glove.arxmliv.5B.300d.zip` and `vocab.arxmliv.zip`
- 300d GloVe word embeddings for individual subsets
- `glove.subsets.zip`
- the main arXMLiv dataset is available separately [here](/resources/arxmliv-dataset-082017/)
#### Token Model Statistics
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment