Currently, The Neighborhood Writing Alliance is working on a project in which interns are adding sets of non-hierarchical keywords (sort of like a tag cloud in social media) to an internally accessible bibliographic database of the journal it has published for more than 20 years, The Journal of Ordinary Thought – Or JOT as it is called. The database is being created in a log-in controlled environ called CiteuLike. This application works like other reference-maintenance software available. Users, who are given controlled log-in web-based profiles, build collections and can add multiple levels of information to each bibliographic record. One of the types of information users can apply to records is tags.
I am working as Metadata Specialist on this project, overseeing the work of the interns, editing tag-sets for better search potential and presenting examples through assorted instructional techniques best practices/policy for adding tags to the collection. One of the questions anyone who considers subject access in library catalogs is this notion of “aboutness” – that of determining what a piece of writing or other cultural artifact is about. In other words, if we could attach a subject to some cultural artifact, what would it be, how many subjects can one artifact have and how do we decide? To make matters more abstract, “subjects” themselves are also cultural products based on factors such as who might be in charge at the time, who is most likely to be the common users or viewers of said artifacts and whether there are requisite resources (money and other factors) at the time of creation of the bibliographic record to add or attach all possible permutations. For example, here is a link to a search for manuscript papers connected with Abraham Lincoln in Library of Congress’ holdings. If one clicks on Andrew Johnson Papers, 1783-1947, it is apparent the record contains a summary of that collection’s contents. This summary works to tell what the collection is about. On one level, this bibliographic record contains pure data referring to the collection. But on another level, the writing of the summary is a human-decided process that involves processing (thinking) and writing (also a human experience). It is not obvious what a piece of writing is “about” – even if the writer or bibliographic record creator states so – nor how it will affect the reader or viewer.
These are the fun challenges in front of us on this project. It is underway and progress is being made. The document which holds the tagging Best Practices [which Merriam-Webster’s Collegiate Dictionary defines as performances or forms which excel all others]* is being written along with some other helpful guidelines by way of examples with specific explanations. I think we will each learn a little something along the way.
PS: As always, dialogue is welcome.
* Merriam-Webster’s Collegiate Dictionary, 10th Ed. Springfield: Merriam-Webster, Incorporated, 2001. 108, 912
Your post seems to indirectly discuss some of the differences between humans and machines. A discussion that can be called infinitely relevant to information management with the current rush for linked data, with literal tagging in code having been born from an effort to create and growing toward artificial intelligence. The differences you mention are significant and thought provoking. Thanks.
This article on tagging Vs categories for Web Content is fascinating. http://www.wpbeginner.com/beginners-guide/categories-vs-tags-seo-best-practices-which-one-is-better/