Skip to content
Snippets Groups Projects

Grounding dataset v1

Merged Takuto Asakura requested to merge grounding-dataset-v1 into master
2 files
+ 43
0
Compare changes
  • Side-by-side
  • Inline

Files

+ 40
0
 
---
 
layout: page
 
title: Dataset for Grounding of Formulae
 
---
 
 
### Basic Information
 
 
* Author: Takuto Asakura, André Greiner-Petter, Akiko Aizawa, and Yusuke Miyao
 
* Updated: 2020-03-26
 
 
### Accessibility and License
 
 
The content of this dataset is licensed to [SIGMathLing members](/member/) for
 
research and tool development purposes.
 
 
Access is restricted to [SIGMathLing members](/member/) under the [SIGMathLing
 
Non-Disclosure-Agreement](/nda/) as for most [arXiv](http://arxiv.org)
 
articles, the right of distribution was only given (or assumed) to arXiv
 
itself.
 
 
### Description
 
 
This is the project to create a dataset for grounding of formulae.
 
 
As a trial work, this dataset consists of an annotated long paper (20 pages in
 
PDF):
 
 
* Simeone, O.: A Very Brief Introduction to Machine Learning with Applications
 
to Communication Systems. IEEE Transactions on Cognitive Communications and
 
Networking 4(4) (2018)
 
 
The original XHTML file of the paper was taken from the [arXMLiv:08.2018
 
dataset](/resources/arxmliv-dataset-082018/), and we manually annotated all
 
937 identifiers (i.e., `<mi>` tags) in the document to the corresponding
 
mathematical objects (meanings).
 
 
### Download
 
 
[Download link](https://gl.kwarc.info/SIGMathLing/grounding-dataset-v1)
 
([SIGMathLing members](/member/) only)
Loading