Skip to content
Snippets Groups Projects
Commit 02f95899 authored by Frederik Schaefer's avatar Frederik Schaefer
Browse files

Merge branch 'downloadlinks' into 'master'

download links

See merge request !15
parents caef6e79 2df560be
No related branches found
No related tags found
1 merge request!15download links
Pipeline #4567 passed
...@@ -16,9 +16,10 @@ title: ArGoT 2021 - arXiv Glossary of Terms ...@@ -16,9 +16,10 @@ title: ArGoT 2021 - arXiv Glossary of Terms
- The XML sources total `521 MB` packaged as `.tar.gz` archives. - The XML sources total `521 MB` packaged as `.tar.gz` archives.
### Download ### Download
- [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arxiv-argot-2021) - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
- [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Description ### Description
This is the first public release of the ArGoT dataset generated by the [Formal Abstracts](https://formalabstracts.github.io/) research group. This is the first public release of the ArGoT dataset generated by the [Formal Abstracts](https://formalabstracts.github.io/) research group.
ArGoT is a dataset of term-definition pairs automatically extracted from the arXiv mathematical papers. ArGoT is a dataset of term-definition pairs automatically extracted from the arXiv mathematical papers.
......
...@@ -94,8 +94,8 @@ concrete identifier. ...@@ -94,8 +94,8 @@ concrete identifier.
``` ```
### Download ### Download
[Download link](https://gl.kwarc.info/SIGMathLing/dataset-arXMLiv-08-2017) - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
([SIGMathLing members](/member/) only) - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via ### Generated via
- [LaTeXML 0.8.2](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.2), - [LaTeXML 0.8.2](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.2),
......
...@@ -93,8 +93,8 @@ concrete identifier. ...@@ -93,8 +93,8 @@ concrete identifier.
``` ```
### Download ### Download
[Download link](https://gl.kwarc.info/SIGMathLing/dataset-arXMLiv-08-2018) - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
([SIGMathLing members](/member/) only) - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via ### Generated via
- [LaTeXML 0.8.3](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.3), - [LaTeXML 0.8.3](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.3),
......
...@@ -95,8 +95,8 @@ concrete identifier. ...@@ -95,8 +95,8 @@ concrete identifier.
``` ```
### Download ### Download
[Download link](https://gl.kwarc.info/SIGMathLing/dataset-arXMLiv-08-2019) - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
([SIGMathLing members](/member/) only) - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via ### Generated via
- [LaTeXML 0.8.4](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.4), - [LaTeXML 0.8.4](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.4),
......
...@@ -14,7 +14,7 @@ title: arXMLiv 2020 - An HTML5 dataset for arXiv.org ...@@ -14,7 +14,7 @@ title: arXMLiv 2020 - An HTML5 dataset for arXiv.org
- you also need 1.6 million free inodes to unpack the full data (check via `df -ih .`) - you also need 1.6 million free inodes to unpack the full data (check via `df -ih .`)
### Download ### Download
- [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arxmliv-2020) - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
- [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Description ### Description
......
...@@ -58,8 +58,9 @@ complete | 5,382,805,349 | 2,573,974 | 746,673 ...@@ -58,8 +58,9 @@ complete | 5,382,805,349 | 2,573,974 | 746,673
Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082017/#citing-this-resource) Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082017/#citing-this-resource)
### Download ### Download
[Download link](https://gl.kwarc.info/SIGMathLing/embeddings-arXMLiv-08-2017) - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
([SIGMathLing members](/member/) only) - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via ### Generated via
- [llamapun 0.1](https://github.com/KWARC/llamapun/releases/tag/0.1), - [llamapun 0.1](https://github.com/KWARC/llamapun/releases/tag/0.1),
......
...@@ -62,8 +62,9 @@ articles, the right of distribution was only given (or assumed) to arXiv itself. ...@@ -62,8 +62,9 @@ articles, the right of distribution was only given (or assumed) to arXiv itself.
Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082018/#citing-this-resource) Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082018/#citing-this-resource)
### Download ### Download
[Download link](https://gl.kwarc.info/SIGMathLing/embeddings-arXMLiv-08-2018) - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
([SIGMathLing members](/member/) only) - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via ### Generated via
- [llamapun 0.2.0](https://github.com/KWARC/llamapun/releases/tag/0.2.0), - [llamapun 0.2.0](https://github.com/KWARC/llamapun/releases/tag/0.2.0),
......
...@@ -55,8 +55,9 @@ articles, the right of distribution was only given (or assumed) to arXiv itself. ...@@ -55,8 +55,9 @@ articles, the right of distribution was only given (or assumed) to arXiv itself.
Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082019/#citing-this-resource) Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082019/#citing-this-resource)
### Download ### Download
[Download link](https://gl.kwarc.info/SIGMathLing/embeddings-arXMLiv-08-2019) - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
([SIGMathLing members](/member/) only) - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via ### Generated via
- [llamapun 0.3.4](https://github.com/KWARC/llamapun/releases/tag/0.3.4), - [llamapun 0.3.4](https://github.com/KWARC/llamapun/releases/tag/0.3.4),
......
...@@ -100,8 +100,9 @@ nomath source: `definition/35b170bae4259a5c430846116142d4e4a45097e52daf818b78ea3 ...@@ -100,8 +100,9 @@ nomath source: `definition/35b170bae4259a5c430846116142d4e4a45097e52daf818b78ea3
``` ```
### Download ### Download
[Download link](https://gl.kwarc.info/SIGMathLing/statements-arXMLiv-08-2018) - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data.
([SIGMathLing members](/member/) only) - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome!
### Generated via ### Generated via
- [llamapun 0.3.2](https://github.com/KWARC/llamapun/releases/tag/0.3.2) - [llamapun 0.3.2](https://github.com/KWARC/llamapun/releases/tag/0.3.2)
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment