maps, territories, and discretion

Lorcan Dempsey mentions in passing that decisions of which manifestations belong to the same work set is”discretionary at the edges”:

A note on ‘discretionary’. We cluster stuff based on aggregate cataloger choices. I like Tim Spalding’s characterization of the ‘cocktail party test’ in a blog entry about works and LibraryThing.

Regarding ‘discretionary’, I think this is exactly right. It’s important to note that the ‘work set’ is a subjective and contextual choice, not some objective piece of data waiting to be discovered. But that doesn’t mean it’s useless, it’s very important because (in Western culture at least?), the concept of ‘work’ exists, and is of value to users — it’s socially constructed, its got grey edges, reasonable people may disagree in edge cases, but that doesn’t mean it doesn’t exist or isn’t useful!  (For instance, when a patron comes in asking for a copy of Hamlet, try telling her “Sorry, there’s no such thing as ‘Hamlet’, please come back when you can tell me a particular edition published in a particular place at a particular year that you want.”  Ha!)

The FRBR report says: “The concept of what constitutes a work and where the line of demarcation lies between one work and another may in fact be viewed differently from one culture to another.”  Quite right.

(This element of ‘discretion’ is present to some extent in ALL models of reality — and our bibliographic description is indeed a model. And always has been, even if it hasn’t always been formalized, even if it’s been based on traditional implicit shared understanding, not spelled out. The ‘map’ is never the ‘territory’, just a useful abstraction/approximation, with certain discretionary choices made as to be useful to a certain community/context).

So traditional cataloging tries to make these work distinctions (to the extent that they are implied in AACR2 choices like ‘uniform title’, and hopefully more explicit in RDA, but I can’t say for sure) by setting out precise instructions meant to result in choices that match that cultural determination of ‘work’.

LibraryThing tries to do it instead by just relying on members of that culture using their intution, and averaging out everyone’s choices and relying on them to reach consensus through discussion.

Neither is more ‘correct’, and neither is more ‘FRBR’, just two different approaches to trying to create a collective decision about work sets that is useful to users.  Both are discretionary and subjective.

Sometimes this is hard for those in the library community to understand; we seem to attract people who want there to be a ‘right’ and ‘wrong’ answer, who want “but is it REALLY the same work?” to be based on some kind of objective reality with an absolute discoverable ‘correct’ answer.  Sorry, that’s just not reality, modelling reality is inherently full of discretionary choices. But we can, as we traditionally do in library cataloging, set out rules and guidelines to make it as likely as possible that different people at different places will make the same choices.

But we really ought to explain why those rules and guidelines are what they are, and what the goal is. In order to better allow catalogers to use their professional judgement to make the choices most likely to accomplish that goal.

AACR2 rules that kind of sort of provide guidelines for making ‘work set’ decisions, but couch them in terms of simple orthographic decisions for ‘uniform title’, without ever even mentioning that this is really a decision about work sets — gee, it definitely doesn’t help us understand what we’re doing, and try to do it to serve the users as well as possible.  (It also doesn’t help that ‘uniform title’ is meant to express (at least) several other things in addition to ‘work set’ — we really need to record the ‘is in work set X’ decision as it’s own discrete reconstructable data element.) Anyone know if RDA improves on any of this? I still haven’t had the fortitude to try and make it through the RDA draft.

Work-centric or manifestation-centric display?

Lorcan also points out that:

Interestingly, Goodreads and LibraryThing seem to default to a work-based view: the entry is at the work level…   Amazon seems to default to a particular ‘manifestation’ or ‘expression’… Google Books seems to do something similar…. Worldcat.org is more like Amazon and Google. At the moment, it aims to show the most highly held member of a work set in a result, and then link to other editions from that…

I’d be interested if Worldcat is considering trying to make a ‘work’ view the default ‘landing page’ from a search, a bit more like LibraryThing. I suspect this would actually be of more general use than the library legacy practice of always showing individual manifestations as search ‘landing pages’.

Lorcan says: “There are reasons for taking these various approaches and each service make decisions based on what it is trying to do, and the view it takes of its user interests.”  Certainly true as far as it goes — but I’ve never seen a written out clear analysis of what the reasons for the traditional library manifestation-centered display are, what they are trying to accomplish, what user interests we believe they are meeting.

I suspect that in fact this choice isn’t based on any actual clearly thought attempt to meet certain user interests — but instead just because we’ve always done it that way. Because in the card catalog world it was impractical to do otherwise. And in the online world, it takes a bit more work to do otherwise.  Not because doing it this way actually is necessarily optimal for meeting identified user interests.

Please note that I’m not saying that we should ‘catalog at the work level’, whatever that would mean. Our cataloging practices certainly still need to describe manifestations, and there need to be different records for different manifestations. (On the other hand, something like subject cataloging probably is best done once at the work-level, not duplicated effort for every manifestation.) But a work-centric display can still be provided — if there is sufficient data recorded to allow software to reconstruct cataloger decisions about work-set groupings!  Current practice makes this difficult.

(And note that’s why Amazon and Google don’t have workset-centric displays. They don’t have the data to do it! Even Google’s vaunted algorithmic prowess can’t, apparently, determine work set groupings reliably enough to make a work-centric display. At least not within the resources Google is willing to throw at the problem. LibraryThing can do it because of volunteer human labor!  Library cataloging theoretically relies on such human labor, and we certainly spend an enormous amount of person hours in such labor — but don’t actually capture the fruit of that labor in unambiguous enough form to make it easy for the software to take advantage of. Shame on us.)

2 thoughts on “maps, territories, and discretion

  1. Jonathan,

    The main issue that I see in terms of serving the users with Work views is that of translations: do we really want to show users the “original” when the original is in a language they cannot read? In Librarything, when you search on “War and Peace” you get a set of the English-language translations of that work, not a Work page for “Voĭna i mir” that represents the original and all of the translations. This is a level that FRBR does not have: it isn’t expression, because it can include multiple translations, each of which is a unique FRBR expression.

    It looks like WorldCat’s FRBR will default to showing the work in the “language of the catalog” (English, in this case), although the set includes other languages as well. http://frbr.oclc.org/research/pages/000000754.html

    While “Work” may be a useful concept at a research level, in terms of the person looking for something to read, it is less so. We may need to define a new grouping that means something like: all of the versions of this in my language of choice. We surely are going to have to show the user a work title that is in their language of choice. This seems to complicate the UI for the presentation of the FRBR work, especially for catalogs that wish to serve an international audience.

    BTW, OpenLibrary will soon be unveiling its Work pages.

  2. Well, here’s a little fantasy design. This would be better with graphical mockups for those less verbally inclinded, but I’m on vacation and don’t feel like spending the time. (you’d think librarians would be be very verbally inclined, but imagining an interface from a verbal description seems hard for folks, sorry!)

    So, when you do a search, what you get back are “hits” representing the work. Each hit will give you the name/author in your language of interface choice, if the title/author are present in your language of choice. Otherwise, the title/author will be given in the ‘cataloging language’ and/or original language the book is written in, if that’s all that’s there.

    Each hit will also have some ‘aggregate’ info about the work, if there’s more than one copy in the database being searched. How many copies there are, what languages are represented, what formats are represented (print, microform, audiobook, digital, whatever).

    If you want, you can restrict your search to _only_ return works that are in a certain language (like English). The catalog knows your language of choice if you set preferences, so gives you an easy way to choose ‘just English stuff’, or you can click a ‘more options’ link to select some other language. You can click a checkbox for ‘stuff in English only’ before you search — OR after you’ve done a search that includes some works that are not available in English, you get a prominent link at the top of the search results “limit to only works available in English” (exact verbabe may very).

    (You can also limit to ‘only works available in print’ or ‘only works available digitally AND in English’ if you like. Figuring out how to make an understandable interface that allows this kind of sophistication will take a bit of trial and error and user testing, but it’s totally possible. I have some ideas that I’ll mock up some day.)

    Once you click on a ‘hit’ on the results page, you get a listing that gives you info about the work (including a summary in your language of choice, if present), and also a listing of all of the individual items in that work. The items can be grouped in various ways — by language, by format. Not sure what the ‘default’ grouping would be. On this work page listing items, you can also choose to show ‘only items in English’ or ‘only items in microform’ (ha, like anyone ever wants that, just an example). IF on your original search you had chosen ‘only works available in English’ before clicking on a hit, then the items-for-a-work listing is going to default to only showing items in English … but there’s a link on the page “10 more items not in English are available. Click here to show them.”

    So something like that, eh? Is a way to have a work-centric display/interaction that still helps users find what they actually want. When you say “This seems to complicate the UI for the presentation”, do you mean complicate the _implementation_ or complicate things for the user? It definitely is more work to implement (and requires metadata that we sometimes have and sometimes don’t). I don’t think it complicates the interface, I think most users will actually find something like this to be SIMPLER.

    Becuase, especially, don’t forget that our current typical manifestation-centric displays don’t exactly make it easy for users to sort through multiple manifestations of a work to find the one of use to them in the first place! Many of our typical interfaces don’t even allow the user to limit by language of manifestation, and individual manifestations of the same work can be scattered throughout a result set, they aren’t necessarily even adjacent! If you think about how to solve these problems — I think you arrive at a work-centric (instead of manifestation-centric) interface along the lines of what I’ve described.

Leave a comment