**Evaluation note:** These in-built evlauation runs are provided as a sanity check that the generated GloVe models pass a basic baseline against the non-expert tasks in the default GloVe suite.
One would need a scienctific discourse tailored set of test cases to evaluate the arXiv-based models competitively.
...
...
@@ -114,37 +114,187 @@ One would need a scienctific discourse tailored set of test cases to evaluate th