arXMLiv 2020 Dataset Released

The 2020 release to the arXMLiv data set has been published, including 1.58 million HTML5+MathML document conversions from arXiv.org.

Details can be found on the corresponding data set resource page.

The content of this data set is licensed to SIGMathLing members for research and tool development purposes subject to the SIGMathLing Non-Disclosure-Agreement.