Dataset for Grounding of Formulae
- Author: Takuto Asakura, André Greiner-Petter, Akiko Aizawa, and Yusuke Miyao
- Updated: 2020-03-26
Accessibility and License
The content of this dataset is licensed to SIGMathLing members for
research and tool development purposes.
Access is restricted to SIGMathLing members under the SIGMathLing
Non-Disclosure-Agreement as for most arXiv
articles, the right of distribution was only given (or assumed) to arXiv
This is the project to create a dataset for grounding of formulae.
As a trial work, this dataset consists of an annotated long paper (20 pages in
- Simeone, O.: A Very Brief Introduction to Machine Learning with Applications
to Communication Systems. IEEE Transactions on Cognitive Communications and
Networking 4(4) (2018)
The original XHTML file of the paper was taken from the arXMLiv:08.2018
dataset, and we manually annotated all
937 identifiers (i.e.,
<mi> tags) in the document to the corresponding
mathematical objects (meanings).
(SIGMathLing members only)