Sunday, January 20, 2008

Musings on RDA, LC Working Group Report, and various other random things

I’ve been staying on the outskirts of the flurry of activity the last few months surrounding RDA and the Library of Congress Working Group on the Future of Bibliographic Control. Various things have been swimming around in my brain during this time, and I think now they’re finally ready to come out. I don’t know that I have any conclusions or suggestions for future directions, but that’s what I see as the function of this blog – to think through things that may or may not end up anywhere interesting.

I read the LC Working Group on the Future of Bibliographic Control’s draft report, and submitted comments via the web form designed for quick email questions rather than substantive feedback in time for the Working Group’s consideration. I didn’t post my comments on the blog, as many others did, as I felt my comments were pretty boring – they mostly were of the type “this paragraph seems to say x, but I wonder if you really meant to say y….” Overall, I was impressed with the report, and thought it represented an admirable vision for the directions in which libraries should be heading. It also struck me as largely avoiding library politics, although I thought it was odd a specific reference to FAST disappeared between the draft and final reports – I wonder what that was about? I liked the boldness of the report pushing attention for special collections, and the tough questions about the continued utility of MARC and LCSH.

But I, like many others, found a bit of schizophrenia in some of the specific recommendations. The report is not afraid to take a bold stand on MARC, but stops well short of recommending a move away from tens of thousands of distributed copies of bibliographic records, and (new in the final version, I think) questions RDA’s move away from ISBD. The report recommends moving quickly to work on new bibliographic frameworks but even more forcefully says that RDA should wait before proceeding. It provides many recommendations discussing how to improve moving information in and out of the catalog but provides little in the way of rethinking the function of the catalog itself. I believe some of this inconsistency is the result of trying to address comments the WG received (although I don’t see any changes related to any of my comments in there!), but most of it is probably due to the fact that this is a committee effort, written and revised on a short schedule. The biggest disappointment for me in the final report was that my favorite recommendation from the draft lost all of its power. In the draft report, one of the recommendations relating to LIS curricula described some extremely technical and theoretical topics as essential to offer. I believe cultivating individuals with both system and information expertise is the single most effective things we can do to ensure libraries play a part in the future information environment. In the final report, this recommendation was sanitized to simply say LIS curricula should include “advanced knowledge and topics.” Bleah. That could mean anything.

One other thing regarding the LC WG report: representatives from both Google and Microsoft served on the Working Group, but I see little if any evidence in the report that these individuals contributed points of view that haven’t been making the rounds within the library community already. That’s unfortunate. We need some outside points of view in this community.

I know the term “bibliographic control” has been questioned in relationship to this report. Roy Tennant suggests “descriptive enrichment instead. I recognize the problems with bibliographic control – it sounds so authoritarian in the face of the open vision the report outlines. But all labels are words, and words have baggage. I’m not clever with names (my dog is named Daisy, if that gives you a sense of how un-creative I am in this area), but I’m skeptical that any brief name could capture what we’re trying to do here. “Descriptive enrichment” to me calls up images of armies of humans manually adding things to records, an image I think we don’t want to be promoting. So I’ll remain neutral on the name issue – if someone comes up with a new one that folks like, I’d be happy to start using it. But I’m unlikely to be the one thinking that new label up.

I read many of the responses to the LC WG report that appeared on blogs, and found myself agreeing with many of the points made, and disagreeing with others. Pretty standard reaction, I suspect. I found OCLC’s response quite odd, however. It had the general tenor of “we’re doing all that stuff already, don’t worry, just trust us…” while at the same time oversimplifying the issues in a way I found totally inappropriate for a response to a committee of experts. For example, the OCLC response touts its FRBR work as testing the WG didn’t realize was happening, but it glosses over the fact that the Work-level clustering and other FRBR-like things OCLC has been doing aren’t true FRBR implementations. This community needs clarity and hard truths on these issues right now, not something that’s been reviewed by marketing. OCLC Research and RLG Programs are now and have been doing extremely interesting things recently, but few if any of them make their way to the productized mainstream of OCLC in ways that promote the state of the art or even fit well with the vision outlined in the LC WG report. I hope OCLC takes the report’s recommendations to heart in the same way LC and the rest of us are trying to do.

I found RDA’s reaction (or lack thereof) to the LC WG report to be of note as well. The folks behind RDA (the “Committee of Principals” for those of you in the know for such things) have on the RDA web site a response dated the same day the WG closed its call for comments. Presumably they submitted this document as an official comment in the appropriate time frame. The response, as the preface to the final LC WG report notes, smacks of “we’re too far along to stop now,” which in my mind is equivalent to “we/he/she have worked really hard, so what we came up with must be good,” which I believe is completely and totally bogus. It also lays the guilt trip on LC – saying basically “we’d hate to lose your input.” What the response doesn’t do is address directly (aside from listing a few pseudo-FRBR implementations that one can’t imagine the LC WG didn’t know about) any of the concerns raised in the report. It looks to me like more of people talking past each other and being defensive rather than trying to find common ground. Of course, the RDA response says they won’t be stopping development, which is no surprise at all. (Really, did anyone think they would? Wishful thinking doesn’t count.)

Through all this, I remain agnostic about RDA. I figure at some point I’m going to have to form an opinion, but frankly, I haven’t had the time to invest to develop an informed one. I haven’t read the last set of drafts (released December-ish), and with previous drafts I had trouble devoting the mental energy to them to see the forest of general vision and effectiveness for the trees of specific rules. I like the idea of more explicit connections to FRBR being behind the new organization, but it looks awfully complex. FRBR of course is complex, but I can’t help wondering if there’s another way to make the connection. I also understand (but again, haven’t seen myself) that the new drafts and/or supporting documents use terminology from the DC Abstract Model, including “literal value surrogate” and the like. I’m as intimidated by the terminology as the next person, but I do think it’s worth it to introduce some intellectual stringency to this process. I’m just not sure how to do that and still make the documents accessible.

Martha Yee has put online a set of cataloging rules she’s developed as a response to what seems to be the insanity surrounding us. I’ve long thought Martha was a clear voice in pushing against the book-centric focus of the cataloging community and realizing the importance of the display of information to users (in addition to just how we store it), but I’ve found myself disagreeing strongly with some of her more recent work that seems not to understand the state of the art with regards to search engines, information retrieval, or artificial intelligence. I haven’t read her cataloging rules yet, but I’m encouraged that she’s come up with some sort of concrete alternative (rather than just complaining, like the rest of us do), and apparently seems to be working towards an RDF model for her cataloging rules – bravo! I think any new set of rules, to be successful, however, need to be written to take advantage of current machine processing technologies. Not having read either the latest RDA drafts or Yee’s rules, I can’t say whether they do this or not. One can only hope.

And hope is where I am for the future of libraries. We have a lot going on right now in libraries, and I consider that a good thing. To use an old adage, we can’t be so afraid we’ll make a mistake that it prevents us from doing anything at all. Because we will make mistakes. We’re human. No matter how many people we involve, no matter how many levels of review we have, there will be things we try that don’t work out. If we realize that ahead of time we’ll be able to recover and try new things that will work. We’ve done so much already, and we have in our community an enormous number of insightful, dedicated individuals with a vision for where we’re going. Now we just have to find a way to let that vision emerge from the bureaucracy and the power of inertia.

5 comments:

Lorcan Dempsey said...

I guess that I sometimes hear FRBR used to refer to the full range of entities proposed in the original report, and sometimes I hear it used as short-hand for a 'work-based' approach.

Jenn Riley said...

I hear that as well, and overall I think that's OK. But the LC WG report pretty clearly says that Work clustering is just one part of the FRBR model and more work needs to be done testing other parts of it.

Jonathan Rochkind said...

Re "bibliographic control", lately I've been saying "metadata control" instead. I've got no real problem with the word 'control'.

Re FRBR terminology, I think it _is_ kind of a problem that frequently 'frbr' is used to mean no more than "putting things into work sets". Because then people think that's what frbr -is-, and don't realize that in fact the importance of frbr is in providing an explicit externalized formal model of the data in our domain.

Jonathan Rochkind said...

And well said on OCLC's response, I think that's exactly it. This is exactly the wrong time to be oversimplifying and telling everyone "Don't worry, everything is fine, and pretty straightforward, and we're taking care of it." Because it ain't so, and it will take community engagement to really take care of it.

Dean Giustini said...

HiJenn,
A fascinating post about RDA and the various responses to it. What I find interesting is the linkage of better descriptive approaches of semantic web technologies and metadata standards in the library community.

Allan Cho and I tried to find common points or linkages in this discourse in our paper 'Web 3.0 and health librarians' for some of the reasons you mention in your post.
Dean