Tuesday, November 07, 2006

More structured metadata

I often encounter people who see my job title (Metadata Librarian) and assume I have an agenda to do away with human cataloging entirely and rely solely on full-text searching and uncontrolled metadata generated by authors and publishers. That’s simply not true; I have no such goal. I am interested in exploring new means of description, not for their own sake, but for the retrieval possibilities they suggest for our users. So here are a few statements that begin to explain my metadata philosophy:

I want more automation. Throwing more money at a manual cataloging process is not a reasonable solution. First of all, it would take waaaaaaayyyyy more money than we can even dream of getting, and second, much metadata creation is not a good use of human effort. Let’s automate everything we can, saving our skilled people for the tasks current automation means are furthest from performing adequately. Let’s get more objective types of metadata, such as pagination, from resources themselves or from their creators (including publishers). Let’s build systems that make data entry and authority control easy. Yes, there will be some mistakes. There will be mistakes if the whole thing is done by humans too. Are catching the few mistakes that will happen from these automated processes more important than devoting our human effort to that extra few resources? More automation means more data total, and the sorts of discovery services I have in mind need lots of that data.

I want more consistency. Users can’t find what’s not there. While we can’t prescribe all records for all resources everywhere have to have a large number of features (I’m against metadata police!), the more of those features that are there mean more discovery options for those users. Imagine a system that provides access to fiction based on geographic setting. Cool, huh? I read one book recently set in Cape Breton Island and can’t wait to get my hands on more. We can’t do that very well today because that data is in very few of our records, and when it is there, isn’t always in the same place. The more consistent we are with our metadata, the better able we’ll be to build those next-generation systems.

I want more structure. I’m a big fan of faceted browsing. The ability to move seamlessly through a system, adding and removing features such as language, date, geography, topic, instrumentation (hey, I’m a musician…), and the like based on what I’m currently seeing in a result set is something I believe our users will be demanding more and more. But we can’t do this if that information isn’t explicitly coded. Instrumentation (e.g., “means of performance”) as part of a generic “subject” string isn’t going to cut it. Geographic subdivisions (even in their own subfield) that are structured to be human- rather than machine-readable also aren’t going to cut it. Nor are textual language notes, [ca. 1846?], or most GMDs. Many of these things can be parsed, and turned into more highly structured data with some degree of success. But why aren’t we doing it that way in the first place? More structure = better discovery capabilities.

What this all means is I’m glad there are lots of extremely bright people with all sorts of perspectives and skills thinking about improved discovery for library materials, but that doesn’t necessarily mean throwing out metadata-based searching. The sorts of systems I envision require more, more highly structured, more predictable, and higher-quality metadata. I want more, not less.

I’ll stand on one last (smallish) soapbox before wrapping this up. In many communities (including both search engines and libraries), discussions about retrieval possibilities often center around textual resources. However, not everything that people are interested in is textual. That’s of course not a surprise, but I’m shocked at how often discovery models are presented that rely on this assumption. I’m all for using the contents of a textual resource to enhance discovery in interesting ways, but we need systems that can provide good retrieval for other sorts of materials too. Let’s not leave our music, our art, our data sets, our maps hanging out to dry while we plow forward with text alone.

25 comments:

Steve Lawson said...

"AMEN!"

(Didn't want to leave you hanging.)

Anonymous said...

Thank you. We need to "smarten up", not dumb down, cataloging/metadata records. Too many people seem to think that we arguing for updating cataloging practices to work better with contemporary technologies are arguing for a 'dumbing down'. Couldn't be more opposite.

Jonathan

Thom Pease said...

Check out Free Library of Philadelphia, Fleischer Collection MARC records which have instrumentation requirements, based on the Daniels Orchestal Music model. (i.e. 1-2-2-2, etc.)

And for audio information...how about getting the timing by actually putting CDs into the computer and letting the application extract it, rather than absurdly ca.-ing eveything; that would be better for machines too. And I don't get me started on name-title analytics. Music and sound recordings continue to be round pegs in a square box world, at least for bibliographic descrption and access purposes. All-Music Guide is not perfect, but at least they got the model right for their information.

markson said...

Information shops are normally actualized on minimal effort office servers that are UNIX or Windows/NT based.Data Analytics Courses

saketh321 said...

I think this is a really good article. You make this information interesting and engaging. ExcelR Data Analytics Courses You give readers a lot to think about and I appreciate that kind of writing.

Anonymous said...

Great blog post,
Digital Marketing Course in Telugu

tejaswani blogs said...

Recently, I came upon your site, which has left a lasting impression on me, and this particular post has piqued my curiosity. I'm looking forward to reading your fantastic article.

Digital Marketing Training In Telugu

hge said...


Web basics for Digital Marketing
Table of Contents

Web basics for Digital Marketing
What is Internet? – Web basics for Digital Marketing
What is a HTTP? – Web basics for Digital Marketing
What is a Web Browser?
Domain names & Domain extensions
Domain Extension:
Internet category Examples:
Country Code Examples:
Domain Name Registration (Backend Process):
User side Steps:
How to choose domain name?
Web Hosting
Types of Web Hosting Services:
Shared Hosting:
Dedicated Hosting:
Cloud Hosting:
What to consider while choosing a hosting?
Which of the 3, Shared, Dedicated, Cloud hosting is preferred?
Web Development – Web basics for Digital Marketing
Types of Websites:
Static Websites:
Dynamic Websites:
Front End Coding:
Back End Coding:
What is CMS?
Why WordPress?
WordPress Installation:
WordPress Dashboard
Installing and customizing Themes.
Creating Categories, Pages and Posts
Creating Categories, Pages and Posts
Adding Widgets
Install Plugins

The Web basics for Digital Marketing will be discussed in this chapter.



We being digital marketers, we are expected to have strong web basics for digital marketing.

physicians email list said...

it decision makers list

racesite.pro said...

Great work ! This is the type of information that are supposed to be shared across the internet. 경마

traininginstitute said...

I have been searching to find a comfort or effective procedure to complete this process and I think this is the most suitable way to do it effectively.
data science training in malaysia

shubham 3ri tech said...

3RI Technologies data analytics courses in pune
3ri Technologies has been committed to providing quality education and knowledge to the data analytics courses in pune We recognize that different students have different training requirements, so we provide a variety of courses to different student groups. 3ri technologies in pune provide the ideal platform to meet the demands of the constantly evolving analytics courses in pune market. Our training ensures IT professionals, business users and decision-makers have the knowledge they need to drive an enterprise effectively..

Unknown said...

I feel very grateful that I read this. It is very helpful and very informative and I really learned a lot from it.ethical hacking training in noida

shubham 3ri tech said...

data Analytics courses in pune
3RI Technologies is the leading institution offering instructor-data analytics courses in pune Fresh graduates and working professionals can also enroll in it.
data analytics Courses in Pune

SCARLET BROWN said...

Thanks for the sensible critique. Me and my neighbor were just preparing to do some research about this. We got a grab a book from our area library but I think I learned more clear from this post. I’m very glad to see such fantastic information being shared freely out there.

야한소설
대딸방
스포츠마사지
출장마사지

Unknown said...

Excellent effort to make this blog more wonderful and attractive. data science training in mysore

Unknown said...

A debt of gratitude is in order for sharing the information, keep doing awesome... I truly delighted in investigating your site. great asset... data scientist course in mysore

Unknown said...

I was taking a gander at some of your posts on this site and I consider this site is truly informational! Keep setting up.. data scientist course in mysore

Unknown said...

Uncommonly in ordinary extraordinarily captivating post. I was looking for such an information and partook in the experience of examining this one. Keep on posting. A responsibility of appreciation is for sharing.data analytics course in bhubaneswar

PMP Training in Malaysia said...

360DigiTMG, the top-rated organisation among the most prestigious industries around the world, is an educational destination for those looking to pursue their dreams around the globe. The company is changing careers of many people through constant improvement, 360DigiTMG provides an outstanding learning experience and distinguishes itself from the pack. 360DigiTMG is a prominent global presence by offering world-class training. Its main office is in India and subsidiaries across Malaysia, USA, East Asia, Australia, Uk, Netherlands, and the Middle East.

Unknown said...

This was really one of my favorite website. Please keep on posting. data scientist course in surat

satyaET said...

Visitnaturesbox.in

Mahil mithu said...

Well, I really appreciated for your great work. This topic submitted by you is helpful and keep sharing...
Experienced Family Lawyers
Multi State Family Law Attorneys

Rosario said...

Hi I have read a lot from this blog thank you for sharing this information. We provide all the essential topics in Web Development Course In Gurgaon for more information just log in to our website Web Development Course In Gurgaon


Mobile app development company said...

It is nice to read such high-quality content. I would like to share information about Mobile app development companies in Pune . Appconsultio is a mobile app development company that specializes in creating high-quality, user-friendly apps for businesses of all sizes. We have a proven track record of success, and our apps have been downloaded millions of times around the world. We are passionate about creating apps that make a difference, and we are committed to providing our clients with the best possible service.