Questions tagged [metadata]

Metadata is data/information that provides data/information about other data.
Metadata is essentially documentation of data.

There are three distinct types of metadata: descriptive metadata, structural metadata, and administrative metadata.

  1. Descriptive Metadata - describes a resource for purposes such as discovery and identification. It can include elements such as title, abstract, author, and keywords.
  2. Structural Metadata - is metadata about containers of data and indicates how compound objects are put together, for example, how pages are ordered to form chapters. It describes the types, versions, relationships and other characteristics of digital materials.
  3. Administrative Metadata - provides information to help manage a resource, such as when and how it was created, file type and other technical information, and who can access it.

Metadata is documentation that describes data.

Properly describing and documenting data allows users to understand and track important details of the work. Having said properly described and documented metadata about data facilitates search and retrieval of the data when deposited in a data repository/data silo/data warehouse.

Metadata is information about data. Similar to a library catalog record, metadata records document the who, what, when, where, how, and why of a data resource. Geospatial metadata describes maps, Geographic Information Systems (GIS) files, imagery, and other location-based data resources.

Metadata (meta data, or sometimes metainformation) is "data about other data", of any sort in any media. An item of metadata may describe an individual datum, or content item, or a collection of data including multiple content items and hierarchical levels, such as a database schema. In data processing, metadata provides information about, or documentation of, other data managed within an application or environment. This commonly defines the structure or schema of the primary data.

Why Do We Need Metadata?

Metadata are crucial for any potential use or reuse of data; no one can responsibly re-use or interpret data without accompanying metadata that explains how the dataset was created, why, where it is geographically located, and details about the structure and meaning of the data.

There are many uses for metadata, even beyond the simple discovery of datasets. Metadata can be used for understanding data, analysis and synthesis, maintaining longevity of a dataset for an organization, tracking the progress of a research project, and demonstrating the return on investment for research at an institution.

Key Points

  1. Data are not complete without a metadata record.
  2. Use metadata to understand and re-use data.
  3. Document everything about the data in the metadata record.
  4. Use mandated Federal metadata standards and tools to create metadata.
  5. Validate metadata to ensure they follow metadata standards.
  6. Share metadata with catalogs to improve discovery and access to the data.
  7. Metadata are an important component of a USGS data release.

For example, metadata would document data about data elements or attributes, (name, size, data type, etc) and data about records or data structures (length, fields, columns, etc) and data about data (where it is located, how it is associated, ownership, etc.). Metadata may include descriptive information about the context, quality and condition, or characteristics of the data. It may be recorded with high or low granularity.

The underlying concepts of metadata have been in use for as long as collections of information have been organized. For example, the information structure for materials in library card catalogs is a type of metadata that has served as a collection management and resource discovery tool for decades.

Metadata is essential for understanding information stored in data warehouses; using metadata makes it possible to create customizable elements for markup languages such as XML, HTML, XHTML, and SGML.

Metadata is machine understandable (read: machine readable) information for the web.

Metadata is such an imperative in data collection, entire data portals exist consisting solely of metadata about data, such as the City of Philadelphia's Benny the Metadata Catalog, and the Sunlight Foundation's Criminal Hall of Justice.

Core Components of a Metadata Record (According to FGDC)

Metadata Record Information - information about the metadata record including the language in which the record is written, a unique file identifier for the metadata record, the metadata standard used to organize the record, a point of contact for the metadata record, and the date that the metadata record written.

Identification Information – citation-level information about the data including the title, abstract, purpose for creation, status, keywords (theme and place), and extent (temporal, vertical and horizontal). Constraints Information – information about legal and security limitations to data access and use. Data Quality Information – information about the processes and sources used to develop the data and positional and/or accuracy assessments performed.

Constraints Information – information about legal and security limitations to data access and use. Data Quality Information – information about the processes and sources used to develop the data and positional and/or accuracy assessments performed.

Maintenance Information – information about the scope and frequency of data updates. Spatial Representation – information about the mechanism used to represent spatial data (grid, point, vector).

Reference System Information – information about the reference systems used to represent geographic position and time.

Content Information – information about the data set entities and attributes.

Symbology Information – information about the symbols used to represent spatial features.

Distribution Information – information about the data distributors and methods for obtaining the data.

Metadata Extension Information – information about custom, user-based, changes to the elements, domains or conditionality of the standard.

Application Schema Information – information about the schema or data models used to structure the data.

Citations, References and Resources for Metadata

Metadata - GIS Wiki
Geospatial Metadata - FGDC.gov
Wikipedia Entry
Understanding Metadata - What is Metadata, and What is it For? (PDF)
Geospatial Metadata Fact Sheet - FGDC 2011-07 (PDF)
Metadata - USGS Data Management
FSP FAQ: Metadata for USGS Scientific Data
What is Metadata? - Indiana University Knowledge Base
Metadata and Resource Description - W3C
Metadata and Describing Data
Metadata Definition and Examples

91 questions
12
votes
2 answers

Is there a specification for versioning a dataset?

In computer software, semantic versioning (or something like it) is something of a standard for how to version software releases. The Major.Minor.Patch semantics make clear how big of a change has occurred between the present and an earlier release.…
Thomas
  • 1,114
  • 6
  • 14
4
votes
1 answer

Will open data foster a common set of "metadata" standards?

This is just one example, but I'm sure that there can be many others. Government open data will have links to all the relevant license information, which will be found in the metadata. That represents an information "standard." Does this apply only…
Tom Au
  • 541
  • 2
  • 15
4
votes
1 answer

Is it a good idea to think of defining a DCAT vocabulary in other languages?

IMHO, Project Open Data does a great job defining a standard metadata vocabulary based on DCAT for government datasets. Would it be a good idea to start defining DCAT metadata terms in other languages, for example, "título" (Spanish) instead of…
defvol
  • 221
  • 1
  • 5
3
votes
0 answers

Better global elevation data

For my online 3D earth model on guadTree algorithm I need elevation data for all land and oceans. I recently discovered SRTM 1 Arc-Second Global, but it does not have data for the oceans and the most northern and southern latitudes. I studied the…
Ni55aN
  • 171
  • 1
3
votes
0 answers

Is there a taxonomy based on concepts of "fiscal" and "utility" data?

Is there are a theory, division, classification or taxonomy schema for open data? With classes like "fiscal" and "utility". Explaining and illustrating Perhaps "utility" (or "public utility" or day by day util data) is not the correct term... I…
Peter Krauss
  • 343
  • 1
  • 11
3
votes
0 answers

search for data by structure

Is there some way to search for data sets by structure, for example, I might want to search on the internet for time series data by time period, categorical data, share amount of total, or multiple columns of the above. In particular, I want to…
John Carlson
  • 231
  • 1
  • 3
2
votes
0 answers

Get specific metadata from a group of open databases

Use case: I am investigating common characteristics of chosen databases. By databases I understand collections of data, possibly with relations between them. I want to collect metadata for all databases that I could possibly find that meet some…
Sayid
  • 173
  • 3
1
vote
0 answers

How to simulate stock exchange data (OHLC chart) realistically?

The stock market is described in many places (e.g. here) as a random walk. Among the various arguments against this hypothesis, nowhere is mentioned what struck me to be the most obvious one: every stock has a lower bound of 0. A random walk,…
1
vote
0 answers

Number of images being used for the annual product

I am using the v2.1 annual nighttime light (NTL) product for my analysis. My study areas are several megacities, which include: Los Angles, Mexico City, Tehran, Paris, Cairo, Manila, Tokyo. I would like to ask you if you could tell me where to find…
Nikos
  • 173
  • 5
1
vote
1 answer

Adding organization metadata to RSS/Atom feed?

What is the most correct / typical / popular namespace to associate an organization name, logo, and maybe website metadata with an RSS 2 (or Atom) feed entry? Can we just use https://schema.org/Organization for that? Organization can be understood…
Roman Susi
  • 113
  • 3
1
vote
2 answers

Product features / components / attributes data set

I am desperately looking for a way to either create or find an existing data set for products that lists the most common features / components or attributes of that product. An example could be: DSLR…
cwinhall
  • 21
  • 1