Skip to content
Snippets Groups Projects
Commit fa84e034 authored by Andreas Schärtl's avatar Andreas Schärtl
Browse files

Merge branch 'master' of gl.kwarc.info:supervision/schaertl_andreas

parents 9100d79b 96cadae9
No related branches found
No related tags found
No related merge requests found
......@@ -37,6 +37,10 @@ Week 18 (27.04.-03.05.)
characters; the lowest of the range is 0xa0, which
is below 0x7c (the pipe symbol)
-> the IRIs in the Isabelle exports have the following
characters which are (according to automated tool
`iriok' [4]) not valid: '|', '\' and ' '
-> importing the files w/ fixed IRIs I get new errors
from graphDB (sigh)
......@@ -63,3 +67,4 @@ References
[1] https://www.w3.org/TR/2014/REC-rdf11-concepts-20140225/
[2] https://tools.ietf.org/html/rfc3986
[3] https://tools.ietf.org/html/rfc3987
[4] https://gitlab.cs.fau.de/kissen/iriok
#! /bin/sh
# extract iris from rdf files passed on stdin; really it just returns all
# quoted strings
set -eu
grep -P -o '".*"' | sed 's/"//g'
#! /bin/sh
set -eu
# prepare DIRECTORY for import w/ the graphdb web
# interface; this is for our prototype, I can't imagine
# we'll use the web interface in production
#
# this script is a fork of xz-to-gz.sh, found in the
# same directory
if [ ! $# -eq 1 ]; then
echo "usage: $0 DIRECTORY" 1>&2
exit 1
fi
directory="$1"
files=$(find "$directory" -name "*.xz")
for file in $files; do
echo "$file" 1>&2
# the filename of the extracted (uncompressed) file
rdf_file=$(echo "$file" | sed 's/\.xz//')
# uncompress, fix iris, compress again
unxz "$file"
sed -i 's/|/%7C/g' "$rdf_file"
gzip "$rdf_file"
done
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment