Skip to content
Snippets Groups Projects
semantization.md 1.22 KiB
Newer Older
  • Learn to ignore specific revisions
  • Michael Kohlhase's avatar
    Michael Kohlhase committed
    ---
    layout: page
    
    title: Semi-Automated Semantization
    menu_title: Semantization
    
    Tom Wiesing's avatar
    Tom Wiesing committed
    menu_order: 105
    
    Michael Kohlhase's avatar
    Michael Kohlhase committed
    ---
    
    Michael Kohlhase's avatar
    Michael Kohlhase committed
    is the process of making the knowledge and structure in informal representations explicit,
    so that they can be acted upon by machines.
    
    Currently, 99% of the available mathematical kwnoledge is encoded in informal mathematical
    documents: journal articles, books, preprints, handwritten course notes or recordings of
    lectures. To make these accessible to
    [semantic services and knowledge managment systems](kminteract), we must semanticize them.
    
    The KWARC group engages in multiple projects to help along semantization. In the
    [sTeX](/systems/sTeX/) format, we enable authors to semantically prelaop LaTeX documents
    so that we can generate [OMDoc](/systems/OMDoc) representation from them (again via
    [LaTeXML](http://dlmf.nist.gov/LaTeXML)).
    
    In the [arXMLiv project](/systems/arXMLiv) we transform the
    [Cornell ePrint arXiv](http://arxiv.org) into XML with MathML and explicit document
    structure via [LaTeXML](http://dlmf.nist.gov/LaTeXML). In the [LLaMaPuN](/systems/lamapun)
    project we develop libraries for automatically identifying meaning structures in arXMLiv
    documents so that we will eventually be able to harvest OMDoc from the results.