Skip to content
Snippets Groups Projects
Commit 2df560be authored by Frederik Schaefer's avatar Frederik Schaefer
Browse files

download links

parent caef6e79
No related branches found
No related tags found
1 merge request!15download links
......@@ -16,9 +16,10 @@ title: ArGoT 2021 - arXiv Glossary of Terms
- The XML sources total `521 MB` packaged as `.tar.gz` archives.
### Download
- [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arxiv-argot-2021)
- [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
- [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Description
This is the first public release of the ArGoT dataset generated by the [Formal Abstracts](https://formalabstracts.github.io/) research group.
ArGoT is a dataset of term-definition pairs automatically extracted from the arXiv mathematical papers.
......
......@@ -94,8 +94,8 @@ concrete identifier.
```
### Download
[Download link](https://gl.kwarc.info/SIGMathLing/dataset-arXMLiv-08-2017)
([SIGMathLing members](/member/) only)
- [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
- [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via
- [LaTeXML 0.8.2](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.2),
......
......@@ -93,8 +93,8 @@ concrete identifier.
```
### Download
[Download link](https://gl.kwarc.info/SIGMathLing/dataset-arXMLiv-08-2018)
([SIGMathLing members](/member/) only)
- [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
- [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via
- [LaTeXML 0.8.3](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.3),
......
......@@ -95,8 +95,8 @@ concrete identifier.
```
### Download
[Download link](https://gl.kwarc.info/SIGMathLing/dataset-arXMLiv-08-2019)
([SIGMathLing members](/member/) only)
- [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
- [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via
- [LaTeXML 0.8.4](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.4),
......
......@@ -14,7 +14,7 @@ title: arXMLiv 2020 - An HTML5 dataset for arXiv.org
- you also need 1.6 million free inodes to unpack the full data (check via `df -ih .`)
### Download
- [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arxmliv-2020)
- [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
- [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Description
......
......@@ -58,8 +58,9 @@ complete | 5,382,805,349 | 2,573,974 | 746,673
Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082017/#citing-this-resource)
### Download
[Download link](https://gl.kwarc.info/SIGMathLing/embeddings-arXMLiv-08-2017)
([SIGMathLing members](/member/) only)
- [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
- [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via
- [llamapun 0.1](https://github.com/KWARC/llamapun/releases/tag/0.1),
......@@ -264,4 +265,4 @@ python eval/python/distance.py --vocab_file vocab.arxmliv.txt --vectors_file glo
riemmanian 0.621626
riemanian 0.618022
```
\ No newline at end of file
```
......@@ -62,8 +62,9 @@ articles, the right of distribution was only given (or assumed) to arXiv itself.
Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082018/#citing-this-resource)
### Download
[Download link](https://gl.kwarc.info/SIGMathLing/embeddings-arXMLiv-08-2018)
([SIGMathLing members](/member/) only)
- [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
- [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via
- [llamapun 0.2.0](https://github.com/KWARC/llamapun/releases/tag/0.2.0),
......@@ -298,4 +299,4 @@ python2 eval/python/distance.py --vocab_file vocab.arxmliv.txt --vectors_file gl
submanifolds 0.612716
geodesic 0.604488
```
\ No newline at end of file
```
......@@ -55,8 +55,9 @@ articles, the right of distribution was only given (or assumed) to arXiv itself.
Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082019/#citing-this-resource)
### Download
[Download link](https://gl.kwarc.info/SIGMathLing/embeddings-arXMLiv-08-2019)
([SIGMathLing members](/member/) only)
- [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
- [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via
- [llamapun 0.3.4](https://github.com/KWARC/llamapun/releases/tag/0.3.4),
......@@ -317,4 +318,4 @@ python2 eval/python/distance.py --vocab_file vocab.arxmliv.txt --vectors_file gl
submersion 0.600120
```
\ No newline at end of file
```
......@@ -100,8 +100,9 @@ nomath source: `definition/35b170bae4259a5c430846116142d4e4a45097e52daf818b78ea3
```
### Download
[Download link](https://gl.kwarc.info/SIGMathLing/statements-arXMLiv-08-2018)
([SIGMathLing members](/member/) only)
- [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
- [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via
- [llamapun 0.3.2](https://github.com/KWARC/llamapun/releases/tag/0.3.2)
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment