diff --git a/projects/arXMLiv.md b/projects/arXMLiv.md
new file mode 100644
index 0000000000000000000000000000000000000000..86a4dc7b920105293127133dac9652f020070142
--- /dev/null
+++ b/projects/arXMLiv.md
@@ -0,0 +1,8 @@
+---
+layout: project
+menu_title: arXMLiv  
+title: arXMLiv
+start: 2006
+people: mkohlhase,dginev
+---
+The [Cornell e-print arXiv](http://arxiv.org) contains one of the largest corpora of scientific literature in the world. Unfortunately, its contents are locked up in the TeX/LaTeX format, which makes it nearly useless for knowledge management techniques. We translate it to XML to have a basis for uncovering it's structural semantics.