Commit b3d5e764 authored by Ulrich's avatar Ulrich

text + evaluation

parent 578140a4
Number of documents: 20
Total:
QEs detected correctly: 267
missed QEs: 28
false QEs: 63
astro-ph0001117:
QEs detected correctly: 51
missed QEs: 7
false QEs: 3
astro-ph0008271:
QEs detected correctly: 9
missed QEs: 5
false QEs: 3
astro-ph0011211:
QEs detected correctly: 47
missed QEs: 6
false QEs: 2
astro-ph0012434:
QEs detected correctly: 5
missed QEs: 0
false QEs: 1
astro-ph0102308:
QEs detected correctly: 33
missed QEs: 3
false QEs: 5
cond-mat0001199:
QEs detected correctly: 19
missed QEs: 0
false QEs: 1
cond-mat0002133:
QEs detected correctly: 3
missed QEs: 0
false QEs: 0
cond-mat0003070:
QEs detected correctly: 0
missed QEs: 0
false QEs: 5
cond-mat0007243:
QEs detected correctly: 2
missed QEs: 0
false QEs: 1
cond-mat0009225:
QEs detected correctly: 5
missed QEs: 0
false QEs: 1
cond-mat0011175:
QEs detected correctly: 1
missed QEs: 0
false QEs: 2
cond-mat0102209:
QEs detected correctly: 8
missed QEs: 0
false QEs: 0
hep-ex0006009:
QEs detected correctly: 33
missed QEs: 0
false QEs: 0
hep-lat0007038:
QEs detected correctly: 19
missed QEs: 2
false QEs: 31
hep-lat0008012:
QEs detected correctly: 11
missed QEs: 1
false QEs: 2
hep-ph0010254:
QEs detected correctly: 11
missed QEs: 4
false QEs: 0
hep-th0011187:
QEs detected correctly: 0
missed QEs: 0
false QEs: 1
math0101174:
QEs detected correctly: 0
missed QEs: 0
false QEs: 3
physics0007034:
QEs detected correctly: 5
missed QEs: 0
false QEs: 0
physics0011001:
QEs detected correctly: 5
missed QEs: 0
false QEs: 2
<?xml version="1.0" encoding="utf-8"?>
<rdf:RDF xmlns:kat="https://github.com/KWARC/KAT/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:d="http://kwarc.info/semanticextraction/KAnnSpec#">
<rdf:Description><kat:annotation rdf:nodeID="kat_run"/></rdf:Description>
<rdf:Description rdf:nodeID="kat_run"><rdf:type rdf:resource="kat:run"/><kat:date rdf:datatype="xs:dateTime">2017-4-10T11:58:5.000Z</kat:date><kat:tool>KAT</kat:tool><kat:runid>0</kat:runid></rdf:Description>
......
This diff is collapsed.
......@@ -48,7 +48,7 @@ Quantity expressions are spotted as a second step. The results are stored
as separate annotations which contain references to the documents.
Rating the likelihood of the annotations is a subtask of this step.
This is especially relevant, when there is more than one annotation -- this means more than one possible meaning -- for an expression.
In our experiments, we found up to BLA meanings for an annotation. However, the average during our evaluations is BLA meanings per annotation.
In our experiments, we found up to 6 meanings for an annotation. However, the average during our evaluations is 1.2 meanings per annotation.
We will refer to a program as \textit{spotter}, if it creates annotations from the document.
In addition to that, we will call a program
......@@ -547,6 +547,7 @@ at the end:
\ednote{Discuss with Michael}
\subsubsection{Quantitative Evaluation}
\ednote{do this}
\begin{itemize}
\item how many documents were handled successful?
......
......@@ -25,7 +25,8 @@
\usepackage{graphicx}
\usepackage{subcaption}
\usepackage{wrapfig}
\usepackage{hyperref}
\def\UrlBreaks{\do\/\do-}
\onehalfspacing
......@@ -226,11 +227,11 @@ quantity expressions and the last recommendation mentions the runtime.
extension which not only involves the adaption of the detection schema but also of
the annotation format.
\item The evaluation of machine learning technologies for semantics extraction might also
proof useful. A good starting point for that might be the implementation of a
prove useful. A good starting point for that might be the implementation of a
scoring system based on machine learning. One can either evaluate it using the current
results of the rule based approach or by allowing manual disambiguation for the
users of the unit conversion service and use this data for training and testing.
\item For his declaration spotter Jan Frederik Schäfer developed a XML-based pattern language.
\item For his declaration spotter~\cite{janbsc} Jan Frederik Schäfer developed a XML-based pattern language.
Its use for the detection of quantity expressions can be evaluated which can either lead to
the implementation of an additional spotter or to a reimplementation of the current spotter
using the pattern language. An advantage of the pattern language is that the detection
......@@ -246,6 +247,13 @@ quantity expressions and the last recommendation mentions the runtime.
\section{Conclusions}
\newpage
\addcontentsline{toc}{section}{References}
\bibliographystyle{halpha} \bibliography{literatur}
\newpage
%\section*{Index of Abbreviations} \addcontentsline{toc}{section}{Index of
......@@ -257,13 +265,5 @@ quantity expressions and the last recommendation mentions the runtime.
\addcontentsline{toc}{section}{Curriculum Vit\ae} \input{cv.tex}
\newpage
\addcontentsline{toc}{section}{References}
\bibliographystyle{halpha} \bibliography{literatur}
\end{document}
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment