From b72331279fe939bd5c98f96233957a1cb70e2734 Mon Sep 17 00:00:00 2001 From: Michael Kohlhase <m.kohlhase@jacobs-university.de> Date: Tue, 2 May 2017 07:26:23 +0200 Subject: [PATCH] a project --- projects/arXMLiv.md | 8 ++++++++ 1 file changed, 8 insertions(+) create mode 100644 projects/arXMLiv.md diff --git a/projects/arXMLiv.md b/projects/arXMLiv.md new file mode 100644 index 0000000..86a4dc7 --- /dev/null +++ b/projects/arXMLiv.md @@ -0,0 +1,8 @@ +--- +layout: project +menu_title: arXMLiv +title: arXMLiv +start: 2006 +people: mkohlhase,dginev +--- +The [Cornell e-print arXiv](http://arxiv.org) contains one of the largest corpora of scientific literature in the world. Unfortunately, its contents are locked up in the TeX/LaTeX format, which makes it nearly useless for knowledge management techniques. We translate it to XML to have a basis for uncovering it's structural semantics. -- GitLab