CCQ 43:3/4 on Semantic Web May 25, 2007
Posted by jrochkind in cataloging.add a comment
I’m finally getting around to looking at the Cataloging and Classification Quarterly vol. 43 iss. 3/4. It’s a special issue on semantic web technologies for libraries. I think it could be really good background reading for the discussions some of us are trying to have.
It looks like it’s got some great stuff in it! I recommend everyone take a look. I am particularly excited to read “Library Cards for the 21st Century” by Charles McCathieNevile and Eva Méndez:
“This paper presents several reflections on the traditional card catalogues and RDF (Resourc Description Framework), which is “the” standard for creating the Semantic Web… The central theme of the discussion is resource description as a discipline that could be based on RDF. RDF is explained as a very simple grammar, using metadata and ontologies for semantic search and access. RDF has the ability to enhance 21st century libraries and support metadata interoperability in digital libraries, while maintaining the expressive power that was available to librarians when catalogues were physical artefacts.”
Haven’t read it yet, looking forward to. (I still say that almost all libraries in 2007 are ‘digital libraries’)
My library has online access to CCQ via Haworth Press.
“Broken”, huh? May 25, 2007
Posted by jrochkind in Theory, cataloging.5 comments
Jonathan, you say “our current metadata environment is seriously and fundamentally broken in several ways”. What are the ways in which it is broken? I would say the cataloguing community have just been overtaken by a tsunami of change in the last ten years (mainly the shift to digital information) and is still working out how best to respond and adapt.
I suspected someone would ask that of me after the last post. A definitive argument/explanation for why/what is broken in our current environment has yet to be written, and is not an easy thing to do. All I can do is provide a sketch of some notes toward that thesis, which I’ll try to do here.
My Cataloging/Metadata Credo May 24, 2007
Posted by jrochkind in Theory, cataloging.11 comments
I think our current metadata environment is seriously and fundamentally broken in several ways.
I do NOT think the solution lies in getting rid of everything we’ve got, or in nothing but machine-analysis of full text. I think the solution requires continual engagement by metadata professionals, which will be continually needed. We will always need catalogers—that is, metadata professionals involved in the generation and maintenance of metadata. Because that’s what catalogers are and have always been. (more…)
‘Access Points’ as Identifiers May 22, 2007
Posted by jrochkind in Theory, cataloging.3 comments
An essay I originally posted to rda-l on 14 February 2007, and put here now mainly to have a persistent URL to easily access it. I made a few minor edits for clarity while I was at it. (So this is perhaps a new Expression of my essay, if you’re keeping track).
“Access points” as “Textual identifiers” ?
ruby trick question May 16, 2007
Posted by jrochkind in programming.1 comment so far
Okay, back to nuts and bolts programming.
Can anyone explain exactly what’s going on when ruby does, like “20.minutes.ago”. I mean, #minutes must be a method on numeric values, right? So why can’t I find it included in the rdoc for Integer or Numeric? And then #minutes returns some kind of object that has an #ago method. So, um, what kind? I don’t get it. I like to understand what’s going on.
Expression and Manifestation May 15, 2007
Posted by jrochkind in Theory, cataloging.7 comments
Random musings on the two confusing/controversial FRBR Group 1 entities, which started out as a comment on another forum, but I figure, hey, why not put it here.
Future of Bibliographic Control May 14, 2007
Posted by jrochkind in Theory, cataloging.add a comment
I’m finding that the LC hearings of the Working Group on Bibliographic Control are producing some very valuable discussion. I hope that the report the working group ends up producing will be equally valuable–and I hope that somehow, it can actually effect our discourse and practice, instead of just disappearing into a black hole as most similar contributions over the past 15 years seem to.
In the meantime you can, and I highly encourage you to, read Mark Linder’s notes on the meeting, as well as Diane Hillman’s.
Serials Coverage: Z39.71 vs. ONIX Coverage May 7, 2007
Posted by jrochkind in Practice, Theory, cataloging.3 comments
Serials Coverage
I have an issue I’d like to put on the radar of ILS developers generally, especially open source ILS developers, especially apropos since the Evergreen Serials module is in the process of being developed.
When trying to integrate my Link Resolver with my ILS recently, I wanted to accomplish a task that seems like you’d often want to accomplish: When given a particular journal citation (say, issn, volume, and issue), identify if we have it in print, and identify the particular ILS record(s) that correspond to that serial holding in print.
In our environment, this turned out not to be possible to do in a reasonably confident way. Part of the problem is the Z39.71 standard, which is used to express serials coverage/holdings in a human readable format. While z39.71 holdings statements are theoretically intended to be consistent and maybe even machine-processable—anyone who has tried to machine process them will have discovered they aren’t really suitable for recovering the sort of information needed to perform my task, for example.
On top of that, in many actual ILS environments, catalogers end up entering z39.71 purely by hand. I don’t know if there is even a way to validate z39.71 holdings statements automatically (I suspect there is not, an obvious problem in itself), but I’d guess that in a typical environment around half of z39.71 statements in a corpus are probably not strictly legal z39.71. Whether through typo, cataloger misunderstanding of the standard, or simply lack of concern with following the standard I don’t know, probably a different mix in different institutions.
RDA, JSC, DCAM, RDF, FRBR May 4, 2007
Posted by jrochkind in Theory, cataloging.4 comments
And other impenetrable acronyms.
I share the generalized optimism toward the recent announcement of the DC/RDA joint project.
It’s confusing to talk and think about these sorts of ideas, because to talk about metadata like this, you need to talk so very abstractly. We try to mean very precise things, but we don’t always have the precise words to describe them, or to be understood by people who may not mean the same things by the same words.
I’ve been confused by the DCAM for a while, myself. As I keep circling around and around trying to understand what’s going on, at this particular stage in my circling I’ve found this paper, Towards an Interoperability Framework for Metadata Standards, by Nilsson et al, to be very helpful, and I think I’m getting closer to understanding what DCAM is. When I go and look at my comments made to Pete Johnston’s blog post, linked above, around five months ago, already I wouldn’t ask those same questions now (although I can’t exactly answer them in clear language either–so tricky to talk about this stuff!)
I do start to wonder, though: Is DCAM trying to solve the _exact_ same problem RDF is? Is there any reason to have both? What does DCAM have that the “RDF suite” does not? Nilsson et al do say that “The RDF suite of specifications, however, follow a more similar pattern to the framework presented here.”
Erik Hatcher on what’s needed May 3, 2007
Posted by jrochkind in General.add a comment
Erik Hatcher’s essay on their experiences prototyping blacklight at UVa ought to be required reading for anyone interested in the future of library digital services.
To my mind, the most important point he makes is this:
Let me reiterate that what I see needed is a process, not a product. With Solr in the picture, we can all rest a bit easier knowing that a top-notch open source search engine is readily available… a commodity. The investment for the University of Virginia, then, is not in search engine technology per se, but rather in embracing the needs of the users at a fine-grained level.
This is a point I see many library decision makers not fully grasping. It’s not about buying a product (whether open source _or_ proprietary), it’s about somehow getting multiple parts of the libraries on board in a coordinated effort to focus our work where it matters. The tech may make this possible for the first time–and some tech may be better than other tech–but tech can’t solve things for you. Just plunking your money down for the ‘right’ product from a vendor (Yes, even if that vendor is OCLC!) can not be an end point.
But organizational strategy is a lot harder than just buying an expensive product, unfortunately.