Problem statement: Cultural heritage, bibliographic and archival communities use different controlled vocabularies for the resources that they manage. These controlled vocabularies may not be recognized by very diverse user communities, and ignored by large commercial information hubs and Internet search engines. Metadata needs to flow among diverse environments and reach users wherever they are. The semantic, hierarchical, and granular relationships in controlled vocabularies are often lost when retrieved outside the environment in which they were created.
"Terminology Services are web-based services for controlled vocabularies. More than 4.5 million terms, 2.4 million concept links, and 2 million contextual data elements are accessible to your applications.
Each vocabulary is fully indexed and searchable. Vocabulary data is retrievable in multiple representations including the MARC authority format, used by libraries, and the SKOS Core Vocabulary used in Semantic Web applications."
Thom Hickey describes a project that is controlling millions of headings in OCLC by linking them to the NACO authority files
"Right now we are working our way through the a set of fairly easy 26 million headings, personal names that match an authority record on multiple subfields."
[I guess 'easy' is relative]
The WorldCat Identities project alongs users to search and browse over 25 million personal and corporate authors. Udsing data-maing techniques, OCLC Research has pulled information on authors from the WorldCat database, including alterantive forms of names, publication timelines (by & about), genres and subjects.


