diff --git a/resources/grounding-dataset-v1.md b/resources/grounding-dataset-v1.md new file mode 100644 index 0000000000000000000000000000000000000000..91abf814f1458cb468a4df412eec8f5e516f59f5 --- /dev/null +++ b/resources/grounding-dataset-v1.md @@ -0,0 +1,40 @@ +--- +layout: page +title: Dataset for Grounding of Formulae, Version 1 +--- + +### Basic Information + +* Author: Takuto Asakura, AndreĢ Greiner-Petter, Akiko Aizawa, and Yusuke Miyao +* Release date: 2020-03-18 + +### Accessibility and License + +The content of this dataset is licensed to [SIGMathLing members](/member/) for +research and tool development purposes. + +Access is restricted to [SIGMathLing members](/member/) under the [SIGMathLing +Non-Disclosure-Agreement](/nda/) as for most [arXiv](http://arxiv.org) +articles, the right of distribution was only given (or assumed) to arXiv +itself. + +### Description + +This is the first public release of the dataset for grounding of formulae. + +As a trial work, this dataset consists of an annotated long paper (20 pages in +PDF): + +* Simeone, O.: A Very Brief Introduction to Machine Learning with Applications +to Communication Systems. IEEE Transactions on Cognitive Communications and +Networking 4(4) (2018) + +The original XHTML file of the paper was taken from the [arXMLiv:08.2018 +dataset](/resources/arxmliv-dataset-082018/), and we manually annotated all +937 identifiers (i.e., `<mi>` tags) in the document to the corresponding +mathematical objects (meanings). + +### Download + +[Download link](https://gl.kwarc.info/SIGMathLing/dataset-grounding-v1) +([SIGMathLing members](/member/) only) diff --git a/resources/index.md b/resources/index.md index 7b6baf97526600efbac474aba80febfd80c0b5a2..eac938f9f0cd656787250479cb3ab15984dd4acc 100644 --- a/resources/index.md +++ b/resources/index.md @@ -11,6 +11,7 @@ title: SIGMathLing - Datasets and Resources 1. [quantity expressions](/resources/quantity-expressions) 1. [arXMLiv word embeddings, 08.2017 release](/resources/arxmliv-embeddings-082017) 1. [arXMLiv corpus, 08.2017 release](/resources/arxmliv-dataset-082017/) + 1. [Dataset for Grounding of Formulae, v1](/resources/grounding-dataset-v1) ## Resources hosted externally 1. [ACL-math-annotation](http://www-al.nii.ac.jp/acl-math-annotation/)