From 2df560be484abb48fe51d1a761b821fcb4e1126a Mon Sep 17 00:00:00 2001 From: jfschaefer <jan.frederik.schaefer@fau.de> Date: Thu, 30 Jun 2022 10:05:57 +0200 Subject: [PATCH] download links --- resources/argot-dataset-2021.md | 3 ++- resources/arxmliv-dataset-082017.md | 4 ++-- resources/arxmliv-dataset-082018.md | 4 ++-- resources/arxmliv-dataset-082019.md | 4 ++-- resources/arxmliv-dataset-2020.md | 2 +- resources/arxmliv-embeddings-082017.md | 7 ++++--- resources/arxmliv-embeddings-082018.md | 7 ++++--- resources/arxmliv-embeddings-082019.md | 7 ++++--- resources/arxmliv-statements-082018.md | 5 +++-- 9 files changed, 24 insertions(+), 19 deletions(-) diff --git a/resources/argot-dataset-2021.md b/resources/argot-dataset-2021.md index e920372..30e4f19 100644 --- a/resources/argot-dataset-2021.md +++ b/resources/argot-dataset-2021.md @@ -16,9 +16,10 @@ title: ArGoT 2021 - arXiv Glossary of Terms - The XML sources total `521 MB` packaged as `.tar.gz` archives. ### Download - - [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arxiv-argot-2021) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! + ### Description This is the first public release of the ArGoT dataset generated by the [Formal Abstracts](https://formalabstracts.github.io/) research group. ArGoT is a dataset of term-definition pairs automatically extracted from the arXiv mathematical papers. diff --git a/resources/arxmliv-dataset-082017.md b/resources/arxmliv-dataset-082017.md index dcda75d..7a4da03 100644 --- a/resources/arxmliv-dataset-082017.md +++ b/resources/arxmliv-dataset-082017.md @@ -94,8 +94,8 @@ concrete identifier. ``` ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arXMLiv-08-2017) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! ### Generated via - [LaTeXML 0.8.2](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.2), diff --git a/resources/arxmliv-dataset-082018.md b/resources/arxmliv-dataset-082018.md index ca0da73..43d133a 100644 --- a/resources/arxmliv-dataset-082018.md +++ b/resources/arxmliv-dataset-082018.md @@ -93,8 +93,8 @@ concrete identifier. ``` ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arXMLiv-08-2018) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! ### Generated via - [LaTeXML 0.8.3](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.3), diff --git a/resources/arxmliv-dataset-082019.md b/resources/arxmliv-dataset-082019.md index 2fd8f56..22e3329 100644 --- a/resources/arxmliv-dataset-082019.md +++ b/resources/arxmliv-dataset-082019.md @@ -95,8 +95,8 @@ concrete identifier. ``` ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arXMLiv-08-2019) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! ### Generated via - [LaTeXML 0.8.4](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.4), diff --git a/resources/arxmliv-dataset-2020.md b/resources/arxmliv-dataset-2020.md index 51ab89f..ef3eb33 100644 --- a/resources/arxmliv-dataset-2020.md +++ b/resources/arxmliv-dataset-2020.md @@ -14,7 +14,7 @@ title: arXMLiv 2020 - An HTML5 dataset for arXiv.org - you also need 1.6 million free inodes to unpack the full data (check via `df -ih .`) ### Download - - [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arxmliv-2020) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! ### Description diff --git a/resources/arxmliv-embeddings-082017.md b/resources/arxmliv-embeddings-082017.md index 5494a53..4e37d5d 100644 --- a/resources/arxmliv-embeddings-082017.md +++ b/resources/arxmliv-embeddings-082017.md @@ -58,8 +58,9 @@ complete | 5,382,805,349 | 2,573,974 | 746,673 Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082017/#citing-this-resource) ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/embeddings-arXMLiv-08-2017) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! + ### Generated via - [llamapun 0.1](https://github.com/KWARC/llamapun/releases/tag/0.1), @@ -264,4 +265,4 @@ python eval/python/distance.py --vocab_file vocab.arxmliv.txt --vectors_file glo riemmanian 0.621626 riemanian 0.618022 - ``` \ No newline at end of file + ``` diff --git a/resources/arxmliv-embeddings-082018.md b/resources/arxmliv-embeddings-082018.md index 85ea3ec..66a33d0 100644 --- a/resources/arxmliv-embeddings-082018.md +++ b/resources/arxmliv-embeddings-082018.md @@ -62,8 +62,9 @@ articles, the right of distribution was only given (or assumed) to arXiv itself. Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082018/#citing-this-resource) ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/embeddings-arXMLiv-08-2018) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! + ### Generated via - [llamapun 0.2.0](https://github.com/KWARC/llamapun/releases/tag/0.2.0), @@ -298,4 +299,4 @@ python2 eval/python/distance.py --vocab_file vocab.arxmliv.txt --vectors_file gl submanifolds 0.612716 geodesic 0.604488 - ``` \ No newline at end of file + ``` diff --git a/resources/arxmliv-embeddings-082019.md b/resources/arxmliv-embeddings-082019.md index 11a21b5..d361955 100644 --- a/resources/arxmliv-embeddings-082019.md +++ b/resources/arxmliv-embeddings-082019.md @@ -55,8 +55,9 @@ articles, the right of distribution was only given (or assumed) to arXiv itself. Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082019/#citing-this-resource) ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/embeddings-arXMLiv-08-2019) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! + ### Generated via - [llamapun 0.3.4](https://github.com/KWARC/llamapun/releases/tag/0.3.4), @@ -317,4 +318,4 @@ python2 eval/python/distance.py --vocab_file vocab.arxmliv.txt --vectors_file gl submersion 0.600120 - ``` \ No newline at end of file + ``` diff --git a/resources/arxmliv-statements-082018.md b/resources/arxmliv-statements-082018.md index 2a4d684..d5c9d4d 100644 --- a/resources/arxmliv-statements-082018.md +++ b/resources/arxmliv-statements-082018.md @@ -100,8 +100,9 @@ nomath source: `definition/35b170bae4259a5c430846116142d4e4a45097e52daf818b78ea3 ``` ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/statements-arXMLiv-08-2018) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! + ### Generated via - [llamapun 0.3.2](https://github.com/KWARC/llamapun/releases/tag/0.3.2) -- GitLab