Are there examples of
metadata —
lists of what a site offers, that users search to help them decide what to download —
for non-uniform collections like
UCI ml
or openml.org ?
Metadata is possible for collections of files with uniform attributes,
e.g. metadata-standards-and-best-practices-for-data-dictionaries-for-csv-files-data,
but tough for kitchen sinks, or libraries of books .
(I use the term "TOC", table of contents, interchangably with "metadata";
experts please suggest a better term.)
There's quite a range of TOCs for non-uniform collections, depending on
- users: window shoppers / experts who know exactly what they're looking for
- how users want to search the TOC: click through web pages / download a list of files with attributes / ...
- static / extensible: who can add comments, add new files ?
- human readable / machine readable / both.
With this huge range, standards, even guidelines, are difficult; no single kind of TOC can be good for everybody. But examples of TOCs good for particular corners of this range, with detailed descriptions, could help opendata providers to improve their TOCs — which way is up ?
(Should this be a community wiki ?)