SIGMathLing - Datasets and Resources

Resources hosted on the SIGMathLing Repository

  1. ar5iv corpus, 04.2024 release
  2. argot dataset 2021
  3. arXMLiv corpus 2020
  4. arXMLiv corpus, 08.2019 release
  5. arXMLiv word embeddings, 08.2019 release
  6. arXMLiv statements dataset, 08.2018 release
  7. arXMLiv word embeddings, 08.2018 release
  8. arXMLiv corpus, 08.2018 release
  9. quantity expressions
  10. arXMLiv word embeddings, 08.2017 release
  11. arXMLiv corpus, 08.2017 release
  12. NTCIR Math Information Retrieval data

Work-In-Progress Resources hosted on the SIGMathLing Repository

  1. Dataset for Grounding of Formulae
  2. Artifact Parameter Dataset

Resources hosted externally

  1. ACL-math-annotation

Additional resources are en route

see the plan for details.