diff --git a/resources/arxmliv.md b/resources/arxmliv.md index a5659cd61d5163b426a94c7e49d81346482b6622..df18affe134df1f3a4e26e792ace5732cb8a6a9d 100644 --- a/resources/arxmliv.md +++ b/resources/arxmliv.md @@ -43,7 +43,7 @@ The dataset is segmented in 3 different subsets, each corresponding to a severit This version of the dataset has had minimal manual quality control, and we offer no additional warranty beyond the latexml severity reported. -We welcome community feedback on all of: data quality, representation issues, need for auxiliary resources (e.g. figures, token models), as well as organization and archival best practices. The conversion, build system, and data redistribution efforts are all ongoing projects at the KWARC research group. +We welcome community feedback on all of: data quality, representation issues, need for auxiliary resources (e.g. figures, token models), as well as organization and archival best practices. The conversion, build system, and data redistribution efforts are all ongoing projects at the [KWARC research group](http://kwarc.info). A following release is planned for mid-2018, with an up-to-date arXiv dataset and community feedback incorporated. We anticipate annual dataset releases going forward.