diff --git a/resources/argot-dataset-2021.md b/resources/argot-dataset-2021.md index e920372ad6664fe174255bed9f9b53162962dedb..30e4f196bba18741c8a6bdaa168ac7c04225964d 100644 --- a/resources/argot-dataset-2021.md +++ b/resources/argot-dataset-2021.md @@ -16,9 +16,10 @@ title: ArGoT 2021 - arXiv Glossary of Terms - The XML sources total `521 MB` packaged as `.tar.gz` archives. ### Download - - [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arxiv-argot-2021) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! + ### Description This is the first public release of the ArGoT dataset generated by the [Formal Abstracts](https://formalabstracts.github.io/) research group. ArGoT is a dataset of term-definition pairs automatically extracted from the arXiv mathematical papers. diff --git a/resources/arxmliv-dataset-082017.md b/resources/arxmliv-dataset-082017.md index dcda75dc7a0c80016f857ad4a44c8cd294c76bca..7a4da03009702e0007f649327b74a9ccdb187b4b 100644 --- a/resources/arxmliv-dataset-082017.md +++ b/resources/arxmliv-dataset-082017.md @@ -94,8 +94,8 @@ concrete identifier. ``` ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arXMLiv-08-2017) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! ### Generated via - [LaTeXML 0.8.2](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.2), diff --git a/resources/arxmliv-dataset-082018.md b/resources/arxmliv-dataset-082018.md index ca0da737ef763c18f65e2b7b0a5de829e04e5ffc..43d133ad62a0cfb4a483d3f6bdf9773152f59456 100644 --- a/resources/arxmliv-dataset-082018.md +++ b/resources/arxmliv-dataset-082018.md @@ -93,8 +93,8 @@ concrete identifier. ``` ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arXMLiv-08-2018) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! ### Generated via - [LaTeXML 0.8.3](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.3), diff --git a/resources/arxmliv-dataset-082019.md b/resources/arxmliv-dataset-082019.md index 2fd8f56dd36b583a96c7d9fa74bbbb2d996cd04b..22e33297bfc93e42324e8e832cc1274d49db4561 100644 --- a/resources/arxmliv-dataset-082019.md +++ b/resources/arxmliv-dataset-082019.md @@ -95,8 +95,8 @@ concrete identifier. ``` ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arXMLiv-08-2019) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! ### Generated via - [LaTeXML 0.8.4](https://github.com/brucemiller/LaTeXML/releases/tag/v0.8.4), diff --git a/resources/arxmliv-dataset-2020.md b/resources/arxmliv-dataset-2020.md index 51ab89fbd06c3f30ea344e839c31602b56db8969..ef3eb33c7033d67e3b937cc371cbb7865b220013 100644 --- a/resources/arxmliv-dataset-2020.md +++ b/resources/arxmliv-dataset-2020.md @@ -14,7 +14,7 @@ title: arXMLiv 2020 - An HTML5 dataset for arXiv.org - you also need 1.6 million free inodes to unpack the full data (check via `df -ih .`) ### Download - - [Download link](https://gl.kwarc.info/SIGMathLing/dataset-arxmliv-2020) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! ### Description diff --git a/resources/arxmliv-embeddings-082017.md b/resources/arxmliv-embeddings-082017.md index 5494a5343ad4766ad187f4a95a217e35dbdc4b35..4e37d5d9ea8e8987a36ff9342caf86804005a071 100644 --- a/resources/arxmliv-embeddings-082017.md +++ b/resources/arxmliv-embeddings-082017.md @@ -58,8 +58,9 @@ complete | 5,382,805,349 | 2,573,974 | 746,673 Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082017/#citing-this-resource) ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/embeddings-arXMLiv-08-2017) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! + ### Generated via - [llamapun 0.1](https://github.com/KWARC/llamapun/releases/tag/0.1), @@ -264,4 +265,4 @@ python eval/python/distance.py --vocab_file vocab.arxmliv.txt --vectors_file glo riemmanian 0.621626 riemanian 0.618022 - ``` \ No newline at end of file + ``` diff --git a/resources/arxmliv-embeddings-082018.md b/resources/arxmliv-embeddings-082018.md index 85ea3eccebc5b25870d5c8c295422a30cbc30c18..66a33d02d21265116063ea68396252f5d27fa51a 100644 --- a/resources/arxmliv-embeddings-082018.md +++ b/resources/arxmliv-embeddings-082018.md @@ -62,8 +62,9 @@ articles, the right of distribution was only given (or assumed) to arXiv itself. Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082018/#citing-this-resource) ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/embeddings-arXMLiv-08-2018) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! + ### Generated via - [llamapun 0.2.0](https://github.com/KWARC/llamapun/releases/tag/0.2.0), @@ -298,4 +299,4 @@ python2 eval/python/distance.py --vocab_file vocab.arxmliv.txt --vectors_file gl submanifolds 0.612716 geodesic 0.604488 - ``` \ No newline at end of file + ``` diff --git a/resources/arxmliv-embeddings-082019.md b/resources/arxmliv-embeddings-082019.md index 11a21b554832f9ec399e14b615b54d7d27757cc9..d361955c52f5c1200ad8c3e96d4b6dc0b596ef48 100644 --- a/resources/arxmliv-embeddings-082019.md +++ b/resources/arxmliv-embeddings-082019.md @@ -55,8 +55,9 @@ articles, the right of distribution was only given (or assumed) to arXiv itself. Please cite the main dataset when using the word embeddings, as they are generated and distributed jointly. [Instructions here](/resources/arxmliv-dataset-082019/#citing-this-resource) ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/embeddings-arXMLiv-08-2019) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! + ### Generated via - [llamapun 0.3.4](https://github.com/KWARC/llamapun/releases/tag/0.3.4), @@ -317,4 +318,4 @@ python2 eval/python/distance.py --vocab_file vocab.arxmliv.txt --vectors_file gl submersion 0.600120 - ``` \ No newline at end of file + ``` diff --git a/resources/arxmliv-statements-082018.md b/resources/arxmliv-statements-082018.md index 2a4d68468a3d534b1c219b374b2cf220361ccc0c..d5c9d4d3ec782f9692637692e9f34e16db752817 100644 --- a/resources/arxmliv-statements-082018.md +++ b/resources/arxmliv-statements-082018.md @@ -100,8 +100,9 @@ nomath source: `definition/35b170bae4259a5c430846116142d4e4a45097e52daf818b78ea3 ``` ### Download - [Download link](https://gl.kwarc.info/SIGMathLing/statements-arXMLiv-08-2018) - ([SIGMathLing members](/member/) only) + - [Download links](https://gl.kwarc.info/SIGMathLing/download-links/-/blob/main/README.md). This is a temporary solution as we are in the process of migrating the data. + - [SIGMathLing members](/member/) only. Joining is free and mostly a legal checkmark on our end - all researchers welcome! + ### Generated via - [llamapun 0.3.2](https://github.com/KWARC/llamapun/releases/tag/0.3.2)