Skip to content
Snippets Groups Projects
intro.tex 3.31 KiB
Newer Older
  • Learn to ignore specific revisions
  • \section{Introduction}\label{sec:introduction}
    
    Andreas Schärtl's avatar
    Andreas Schärtl committed
    
    To tackle the vast array of mathematical publications, various ways of
    \emph{computerizing} mathematical knowledge have been researched and
    developed. As it is already difficult for human mathematicians to keep
    even a subset of all mathematical knowledge in their mind, a hope is
    that computerization will yield great improvement to mathematical (and
    really any) research by making the results of all collected research
    
    readily available and easy to search~\cite{onebrain}.
    
    Andreas Schärtl's avatar
    Andreas Schärtl committed
    
    One research topic in this field is the idea of a \emph{tetrapodal
    search} that combines four distinct areas of mathematical knowledge
    and data. These four kinds being (1)~the actual formulae as \emph{symbolic
    knowledge}, (2)~examples and concrete objects as \emph{concrete knowledge},
    (3)~names and comments as \emph{narrative knowledge} and finally
    (4)~identifiers, references and their relationships, referred to as
    \emph{organizational knowledge}~\cite{tetra}.
    
    
    Tetrapodal search aims to provide a unified search engine that indexes
    each of the four different subsets of mathematical knowledge.  Because
    all four kinds of knowledge are inherently different in their
    structure, tetrapodal search proposes that each kind of mathematical
    
    Andreas Schärtl's avatar
    Andreas Schärtl committed
    knowledge should be made available in a storage backend that fits
    exactly with the kind of data it is providing. With all four areas
    available for querying, tetrapodal search intends to then combine
    the four indexes into a single query interface.
    
    As it stands, tetrapodal search is not really close to usable
    
    Andreas Schärtl's avatar
    Andreas Schärtl committed
    prototypes. Research is focused on providing schemer's, storage backends
    and indexes for the four different kinds of mathematical
    knowledge. The focus of the \emph{ulo-storage} project is the area of
    
    organizational knowledge.
    
    A previously proposed way to structure such organizational data is the
    
    Andreas Schärtl's avatar
    Andreas Schärtl committed
    \emph{upper level ontology} (ULO)~\cite{ulo}. ULO takes the form of an
    OWL~ontology and as such all organization information is stored as
    RDF~triplets with a unified schema of ULO~predicates~\cite{owl}.  Some
    
    effort has been made to export existing databases of formal
    
    Andreas Schärtl's avatar
    Andreas Schärtl committed
    mathematical knowledge to {ULO}. In particular, there exist exports
    from Isabelle and Coq libraries~\cite{uloisabelle, ulocoq}. The
    resulting data set is already quite large, the Isabelle export alone
    containing more than 200~million triplets.
    
    
    Existing exports from Isabelle and Coq result in a single or multiple
    RDF~files. This is a convenient form for exchange and easily versioned
    using Git. However, considering the vast number of triplets, it is
    impossible to query easily and efficiently as it is. Ultimately this
    
    Andreas Schärtl's avatar
    Andreas Schärtl committed
    is what \emph{ulo-storage} work is focused on: Making ULO data sets
    accessible for querying and analysis. We collect the RDF files spread
    from different Git repositories, import them into a database and then
    provide an API for accessing the data set.
    
    Andreas Schärtl's avatar
    Andreas Schärtl committed
    The main contribution of this project is twofold. First, (1) we built
    up various infrastructure components that can make up building blocks
    in a larger tetrapodal search system. Second, (2)~we ran sample
    prototype applications and queries on top of this interface. While the
    applications themselves are admittedly not very interesting, they give
    insight for future development of~{ULO}. The following section
    provides an overview of the involved components in \emph{ulo-storage}.