<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"
        "http://www.w3.org/TR/REC-html40/loose.dtd">
<HTML LANG="en-US"><HEAD>
<TITLE>Meta Tag Snob: Metadata Schemas and Initiatives</TITLE>
<base href="http://www.bauser.com/websnob/meta/schemas.html">
<META NAME="description" CONTENT="A list of metadata initiatives supporting
the use of HTML META tags.">
<meta name="keywords" content="meta tag, robots, search engines, meta data
initiatives, meta data schemas">
<meta name="DCTERMS.created" scheme="DCTERMS.W3CDTF" content="2002-05-25">
<!--#exec cgi="../cgi/head.pl"-->
<script src="pop.js" type="text/javascript"></script>
</HEAD><BODY BGCOLOR="#000000" TEXT="#00FF00" LINK="#0000FF" VLINK="#FF00FF">

<p id="breadcrumbs"><a href="/websnob/" rel="Start">Websnob</a> &gt;
<a href="/websnob/meta/">META Tags</a> &gt;
<strong>Schemas and Initiatives</strong></p>

<p class="advert"><!--#exec cgi="../adverts/ad_1.pl"--></p>

<h1>Schemas and Initiatives</h1>

<p>After a few years of <samp>META</samp> tag free-for-alls, the librarians
got involved.</p>

<p>If you've never talked to a full-time, professional, library-science-
educated, certified geek-type librarian about metadata, you really should.
They're experts in the stuff. Most of them wrote their master's theses on
"issues" you've never even thought about. Metadata is their thing.</p>

<p>Librarians' biggest issue with online metadata is that it's all too
chaotic and proprietary. Proprietary <samp>META</samp> is horribly
inefficient, because it causes a search engine to miss information that
isn't using that engine's tags, and forces web authors to learn a dozen
different taggings schemes. Engine-specific tagging is one of the reasons
so many second-tier search sites don't have good databases.</p>

<p>In Real Life, librarians solved this problem years ago, by all agreeing
to use universal schemas like the Library of Congress catalog system or the
Dewey Decimal System. You can learn one system, and understand how to find
things in all the libararies that use that system. More importantly,
publishers can label their books using a system, and have it properly
indexed in libararies all over North America.</p>

<p>Since the mid-1990s, various groups of libary scientists have been
trying to create metadata schemes for the Internet. Some of those
<em>metadata initiatives</em> design "universal" schemas that could apply
to any resource, while others create specialized ones (often based on the
universal ones) that provide detailed metadata for specific subject areas.
Both types of initiatives seek the same goal: Creating a "write once, read
many" scheme that all web authors and web services can use. To date, no
such initiative has gained widespread acceptance, although a few
specialized ones have traction in their specialities.</p>

<p>While there are dozens (if not hundreds) of metadata schemas in
developement or in use, only a few have recommended using the <acronym
title="Hypertext Markup Language">HTML</acronym> <samp>META</samp> element
to label documents. (Most metadata initiatives are moving towards <a
href="/websnob/rdf/">RDF/XML</a>.) Most of the initiatives that do use
<samp>META</samp> tags have adopted the concept of <dfn>prefixing</dfn>:
Each indexing term is prefixed with two or three letters indicating which
metadata schema the tag belongs to. For example, here's a tag that's part
of a mythical "ABC" scheme:

<p><code>
&lt;meta name="ABC.keywords" content="sex, drugs, rock and roll"&gt;
</code></p>

<p>This page only lists metadata initiatives that I've confirmed as having
recommended HTML <samp>META</samp> tags.</p>

<h2 id="AC">A-Core</h2>

<p>A-Core is an apparently defunct initiative to create a metadata scheme
that describes the other metadata in a document. (For example, an "AC.name"
tag doesn't tell you who wrote a document. It tells you who wrote the
document's meta tags.) The HTML version of A-Core has/had five elements,
and used the prefix "AC".</p>

<p>Reference: <a
href="http://metadata.net/ac/draft-iannella-admin-01.txt">The A-Core:
Metadata about Content Metadata</a></p> 

<h2 id="DC">Dublin Core Metadata Initiative</h2> 

<p>The best known metadata intiative on the Web, the <a
href="http://dublincore.org/">Dublin Core</a> began in 1995 at a workshop
in Dublin, Ohio, and has been revised a few times since then. The current
version of the <acronym title="Dublin Core Metadata
Initiative">DCMI</acronym> element set has 16 base elements (which use a
"DC" prefix when used in <samp>meta</samp> elements) and 30 qualified
elements (which are more specific versions of the base tags, and which
<em>should</em> begin with the "DCTERMS" prefix; see below for information
about older versions of Dublin Core).</p>

<p>Dublin Core is noteworthy among metadata schemes in that it also
includes a set of predefined "encoding schemes" that can be used in the
<samp>scheme</samp> attribute of a <samp>meta</samp> element.
Officially-recognized Dublin Core encoding schemes also use a "DCTERMS"
prefix. For example, these two tags use the W3CDTF scheme, which indicates
the dates are encoded using the same format used by the <a
href="http://www.w3.org/">WorldWide Web Consortium</a>:</p>

<p><code>
&lt;meta name="DC.date" content="2000-01-01" scheme="DCTERMS.W3CDTF"&gt;<br>
&lt;meta name="DCTERMS.created" content="2001-01-01" scheme="DCTERMS.W3CDTF"&gt;
</code></p>

<p>(Earlier drafts of the Dublin Core standard encouraged concatenating
qualifiers onto their unqualified equivilents. (For example,
<samp>name="DC.date.created"</samp> instead of
<samp>name="DCTERMS.created"</samp>.) They also allowed page authors to
create custom qualifiers, but the current versions of Dublin Core allow
<em>only</em> <a href="list.html#dcmi">the 46 official values</a>, because
custom values aren't easily converted to other formats like <acronym
title="eXtensible Markup Language">XML</acronym>, and often don't "dumb
down" (correlate with unqualified elements) well.)</p>

<p>If the <samp>content</samp> of a Dublin Core element is a <acronym
title="Uniform Resource Identifier">URI</acronym>, the element may be
better expressed as a <a
href="/websnob/html4/relationships.html#DCMI"><samp>link</samp>
element</a>.</p>

<p>Reference: <a
href="http://dublincore.org/documents/dcq-html/">Expressing Dublin Core in
HTML/XHTML meta and link elements</a></p>

<h2 id="GEM"><acronym title="Gateway to Educational
Materials">GEM</acronym> Project</h2>

<p>The Gateway to Educational Materials Project is a U.S. Department of
Education initiative to locate and label online resources (such as lesson
plans) for teachers. GEM uses several of the <a href="#DC">Dublin Core
elements</a> and eight of its own elements. GEM's elements use the prefix
"GEM".</p>     

<p>Reference: <a
href="http://www.geminfo.org/Workbench/Metadata/GEM_Element_List.html">GEM
Element Set and Profiles Documentation</a></p>

<h2 id="WMDI">Weblog Metadata Initiative</h2>

<p>The <acronym title="Weblog Metadata Initiative"><a href=
"http://www.wmdi.org/" >WMDI</a></acronym> is apparently an attempt to
create a metadata scheme (using several Dublin Core elements, as well as
new elements using the "WMDI" prefix) just for weblogs. The project appears
to be dormant, and I'm not sure the proposed tags were ever used in the
wild.</p>

<p>Reference: <a
href="http://www.wmdi.org/HTML%20Meta%20Spec.htm">Specification for
Encoding WMDI Data in HTML &lt;meta&gt; Tags Version 0.1, 10/20/02</a></p>


<p class="advert"><!--#exec cgi="../adverts/ad_2.pl"--></p>
<!--#exec cgi="../cgi/menu.pl"-->
<!--#exec cgi="../cgi/2002"-->
</BODY></HTML>

