This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
At its core, metadata data is data about data, or in plainer language it is a description about a thing.
What Metadata Does
Metadata serves a few basic purposes. Primarily it is used to organize things to make them findable. It also describes things so they can be understood without opening or using them.
What Metadata Does Not Do
Metadata is data about a thing, but it is not the thing itself. Sharing metadata does not result in the full item being shared. It simply lets others know the item exists and where it can be found.
Why Should I Care?
Metadata is pervasive in information systems, and comes in many forms. The core features of most software packages we use every day are metadata-driven. People listen to music via streaming services; post photos online in social media accounts and digital photo platforms, watch videos; manage finances with accounting software; connect with others via email, text, and social media; and store lengthy contact lists on their mobile devices. All of this content comes with metadata—information about the item’s creation, name, topic, features, and the like. Metadata is key to the functionality of the systems holding the content, enabling users to find items of interest, record essential information about them, and share that information with others.
Listen to Sarah and Deblekha talk about the history of Metadata
So what does metadata actually do?
Web pages often have metadata embedded in them. The links from one Web page to others and records of user behaviour—selecting individual pages to view from among lists of search results, for example—are types of metadata as well. Web search engines build up vast indexes that use page text and its attendant metadata to provide relevant search results to users.
Google goes even further. In 2012, it launched a knowledge base it calls the Knowledge Graph with 3.5 billion facts: metadata about 500 million people, places, and things, and the relationships between them. The Knowledge Graph and other structured metadata stored by Google are used to enhance search results and provide other value-added features such as sports scores, integration of search results with maps, and the knowledge cards that appear on the search results screen providing details on notable people and places.
Wikipedia, a free, crowdsourced online encyclopedia, both uses and generates metadata. The Wikimedia Foundation’s Wikidata project is an open and collaboratively edited knowledge base similar to Google’s Knowledge Graph. It stores factual information about topics in structured forms that can be pulled into Wikipedia articles or other information systems. The DBpedia project does the reverse, mining metadata from Wikipedia info boxes, categories, images, geospatial information, and links to generate an open resource of structured metadata that can be reused in countless ways.
Metadata is vital for business transactions, too. Retailers must track many details about the products they carry, including price, source, inventory quantity, and descriptive information. This is even more essential for online businesses; since online shoppers are unable to view items in person, they expect to be able to search by criteria such as keyword and object type, or to use facets to narrow a wide spectrum of products to a more manageable number.
Businesses routinely store metadata about searches and transactions, which enables them to analyze sales trends, predict future demand, pay sales taxes owed to governments, and more. This same metadata allows businesses to provide a more personalized shopping experience, with features such as purchase history, address books for multiple shipping locations, and product recommendations.
Manufacturers use metadata to track designs, parts, and materials, and to manage their research programs. The travel industry similarly relies on metadata about passengers, patrons, and bookings and about resources such as flights and hotel rooms.
News media use metadata to track events, coverage, and published content. All businesses use metadata for human-resource functions such as hiring, payroll, and performance management.
For the arts, metadata can be used for all of these purposes depending on need – discoverability, internal organization, providing technical information about your works, sales, and administration.
Next, Sarah Stang on Metadata is not Neutral – how to think about it critically.