diff --git a/resources/grounding-dataset.md b/resources/grounding-dataset.md new file mode 100644 index 0000000000000000000000000000000000000000..0aca663f6f1194d989601e30ee2d79f8a83ee455 --- /dev/null +++ b/resources/grounding-dataset.md @@ -0,0 +1,40 @@ +--- +layout: page +title: Dataset for Grounding of Formulae +--- + +### Basic Information + +* Author: Takuto Asakura, AndreĢ Greiner-Petter, Akiko Aizawa, and Yusuke Miyao +* Updated: 2020-03-26 + +### Accessibility and License + +The content of this dataset is licensed to [SIGMathLing members](/member/) for +research and tool development purposes. + +Access is restricted to [SIGMathLing members](/member/) under the [SIGMathLing +Non-Disclosure-Agreement](/nda/) as for most [arXiv](http://arxiv.org) +articles, the right of distribution was only given (or assumed) to arXiv +itself. + +### Description + +This is the project to create a dataset for grounding of formulae. + +As a trial work, this dataset consists of an annotated long paper (20 pages in +PDF): + +* Simeone, O.: A Very Brief Introduction to Machine Learning with Applications +to Communication Systems. IEEE Transactions on Cognitive Communications and +Networking 4(4) (2018) + +The original XHTML file of the paper was taken from the [arXMLiv:08.2018 +dataset](/resources/arxmliv-dataset-082018/), and we manually annotated all +937 identifiers (i.e., `<mi>` tags) in the document to the corresponding +mathematical objects (meanings). + +### Download + +[Download link](https://gl.kwarc.info/SIGMathLing/grounding-dataset-v1) +([SIGMathLing members](/member/) only) diff --git a/resources/index.md b/resources/index.md index 7b6baf97526600efbac474aba80febfd80c0b5a2..98b15279cec43d7a903c7a2009f831fee220cfdc 100644 --- a/resources/index.md +++ b/resources/index.md @@ -12,6 +12,9 @@ title: SIGMathLing - Datasets and Resources 1. [arXMLiv word embeddings, 08.2017 release](/resources/arxmliv-embeddings-082017) 1. [arXMLiv corpus, 08.2017 release](/resources/arxmliv-dataset-082017/) +## Work-In-Progress Resources hosted on the SIGMathLing Repository + 1. [Dataset for Grounding of Formulae](/resources/grounding-dataset) + ## Resources hosted externally 1. [ACL-math-annotation](http://www-al.nii.ac.jp/acl-math-annotation/)