<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Bibliographic Wilderness</title>
	<atom:link href="http://bibwild.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://bibwild.wordpress.com</link>
	<description>Gone to Croatoan</description>
	<lastBuildDate>Tue, 03 Nov 2009 17:22:19 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='bibwild.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/ea716c9e33850137ee97dd84756506f2?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>Bibliographic Wilderness</title>
		<link>http://bibwild.wordpress.com</link>
	</image>
			<item>
		<title>digital media in dissertations</title>
		<link>http://bibwild.wordpress.com/2009/11/01/digital-media-in-dissertations/</link>
		<comments>http://bibwild.wordpress.com/2009/11/01/digital-media-in-dissertations/#comments</comments>
		<pubDate>Mon, 02 Nov 2009 03:27:00 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=1013</guid>
		<description><![CDATA[For personal (rather than work) interests, I was interested in the dissertation written by an acquaintance.
Harmony in Bulgarian music
by        Kirilov, Kalin Stanchev, Ph.D., University of Oregon, 2007       , 531 pages; AAT 3294000
I found it in Proquest Dissertations &#38; Theses no problem, and [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=1013&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>For personal (rather than work) interests, I was interested in the dissertation written by an acquaintance.</p>
<blockquote><p><strong>Harmony in Bulgarian music</strong><br />
by        <a href="void(0);"><em>Kirilov, Kalin Stanchev</em></a>, Ph.D., University of Oregon, 2007       , 531 pages; AAT 3294000</p></blockquote>
<p>I found it in Proquest Dissertations &amp; Theses no problem, and shortly had the PDF. Ain&#8217;t the 21st century grand?</p>
<p>But wait, reading the text, it turns out that the dissertation has accompanying CDs.  Listed in the table of contents as &#8220;POCKET MATERIAL: Three Compact Discs&#8230;. Inside Back Cover.&#8221;</p>
<p>But of course I can&#8217;t get the CD&#8217;s from Proquest. That got me started thinking, what if Proquest excepted digital attachments with dissertations? But then I realized they&#8217;d have to get into the much of digital archiving, deciding what formats they accept and developing a plan to maintain them as readable. (This might be unreasonable to expect from a business that currently doens&#8217;t seem to even bother OCR&#8217;ing it&#8217;s digital dissertation PDFs, at least this one wasn&#8217;t).</p>
<p>Then I wondered if maybe the University of Oregon had a dissertation repository that might actually have those CD&#8217;s online.  I mean, they&#8217;re already digital materials, no need for &#8217;scanning&#8217;, just an easy CD rip.  But the likelyhood of this existing didn&#8217;t seem high enough to overcome my laziness and send me on an investigation to see if it existed. (Shoudln&#8217;t <em>that</em> be easier too?)<br />
I wonder if any universities are making available digital attachments (&#8216;pocket material&#8217;) that go with their dissertations.</p>
<p>The technical issues aren&#8217;t much, but the legal issues are probably more of a barrier:  it&#8217;s probably fair use to attach a CD to a single copy of a dissertation regardless of copyright, but not neccesarily to put the same recordings in an online archive, or to make them publically available.</p>
<p>I probably couldn&#8217;t even ILL the dissertation in question; most universities won&#8217;t send their physical dissertations through ILL, will they?  I guess I&#8217;d have to go there and listen to it, or track down the author of the dissertation and ask him for a copy (that will get a lot harder in 100 years, naturally).</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/1013/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/1013/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/1013/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/1013/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/1013/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/1013/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/1013/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/1013/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/1013/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/1013/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=1013&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/11/01/digital-media-in-dissertations/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>interests&#8230;</title>
		<link>http://bibwild.wordpress.com/2009/10/14/interests/</link>
		<comments>http://bibwild.wordpress.com/2009/10/14/interests/#comments</comments>
		<pubDate>Wed, 14 Oct 2009 22:26:38 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=1007</guid>
		<description><![CDATA[&#8230;Of readers, publishers, authors, and libraries all more or less came to a compromise in inherited publishing market. But the digital age is upsetting that compromise, that&#8217;s for sure.
As digital collections grow, Mr. Sargent said he feared a world in which “pretty soon you’re not paying for anything.” In part because of such concerns, Macmillan [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=1007&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>&#8230;Of readers, publishers, authors, and libraries all more or less came to a compromise in inherited publishing market. But the digital age is upsetting that compromise, that&#8217;s for sure.</p>
<blockquote><p>As digital collections grow, Mr. Sargent said he feared a world in which “pretty soon you’re not paying for anything.” In part because of such concerns, Macmillan does not allow its e-books to be offered in public libraries. The company publishes authors like Janet Evanovich, Augusten Burroughs and Jeffrey Eugenides.</p></blockquote>
<p><a href="http://www.nytimes.com/2009/10/15/books/15libraries.html?_r=1&amp;hp">http://www.nytimes.com/2009/10/15/books/15libraries.html?_r=1&amp;hp</a></p>
<p>I&#8217;d note that (at least in the US) the<a href="http://en.wikipedia.org/wiki/First-sale_doctrine"> first sale doctrine</a> would make it <em>impossible</em> for publishers to prohibit libraries from buying and lending physical books &#8212; we legally have that right.  But electronic books, covered by licensing agreements and not covered by the first sale doctrine? Apparently they can tell us we aren&#8217;t allowed to buy them and lend them.</p>
<p>(Could they tell an individual that once purchased (or &#8216;licensed&#8217;), she couldn&#8217;t let anyone else read it on her e-reader, they had to buy their own copy? Maybe.)</p>
<p>Although according to wikipedia, this is actually something of a legal gray area, not entirely decided. Maybe the first sale doctrine does apply to software in general, and e-books in particular. I bet e-books would make a better test case (for those who want to see that it does apply) than software, since they are so analagous to the print books the first sale doctrine was actually intended for.   It would be nice if some library was willing to push it, buy an e-book and lend it out, insisting that the first sale doctrine gave them that right,  even if a publisher insisted they weren&#8217;t&#8217; allowed to do so.</p>
<p>That it&#8217;s libraries involved makes things even <em>more</em> confusing, because there are, according to wikipedia, special exemptions for libraries in certain provisions which specifically exempt computer software from loan or rental under the first sale doctrine.</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/1007/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/1007/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/1007/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/1007/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/1007/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/1007/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/1007/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/1007/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/1007/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/1007/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=1007&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/10/14/interests/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>Opening for dept head at MPOW</title>
		<link>http://bibwild.wordpress.com/2009/10/14/opening-for-dept-head-at-mpow/</link>
		<comments>http://bibwild.wordpress.com/2009/10/14/opening-for-dept-head-at-mpow/#comments</comments>
		<pubDate>Wed, 14 Oct 2009 20:36:27 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=1004</guid>
		<description><![CDATA[A position has been posted for the head of the department in which I work.
Posted in General       <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=1004&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>A <a href="https://hrnt.jhu.edu/jhujobs/job_view.cfm?view_req_id=41440&amp;view=sch">position has been posted</a> for the head of the department in which I work.</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/1004/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/1004/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/1004/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/1004/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/1004/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/1004/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/1004/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/1004/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/1004/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/1004/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=1004&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/10/14/opening-for-dept-head-at-mpow/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>user behavior</title>
		<link>http://bibwild.wordpress.com/2009/10/08/user-behavior/</link>
		<comments>http://bibwild.wordpress.com/2009/10/08/user-behavior/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 17:52:32 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=1002</guid>
		<description><![CDATA[Thanks to Lorcan Dempsey for pointing out this very interesting report on &#8220;Discoverability&#8221; from the University of Minnesota. 
The report basically analyzes the research-acquiring behavior of their users (and academic library users in general via the literature), and comes up with some trends and suggestions for library strategic directions to meet their needs. I recommend [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=1002&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Thanks to <a href="http://orweblog.oclc.org/archives/002012.html">Lorcan Dempsey</a> for pointing out this <a href="http://conservancy.umn.edu/handle/48258">very interesting report on &#8220;Discoverability&#8221; from the University of Minnesota. </a></p>
<p>The report basically analyzes the research-acquiring behavior of their users (and academic library users in general via the literature), and comes up with some trends and suggestions for library strategic directions to meet their needs. I recommend it highly, it&#8217;s got a nice executive summary (which is pretty much all I&#8217;ve made it through so far, but I plan to read more). (Incidentally, why does cut and paste from the PDF result in gibberish? Very annoying. Speaking of usability.)</p>
<p>Somewhere I forget recently I read a piece of science journalism that had a quote from a scientist along these lines:  &#8220;There are two kinds of interesting research findings. Sometimes you discover something you did not expect at all, and sometimes you verify something you suspected but didn&#8217;t yet have sufficient evidence for.&#8221;</p>
<p>Most (but not all) of what&#8217;s in the report is in the second category for me, but of course that doesn&#8217;t impeded it&#8217;s use, the evidence-based findings is important.</p>
<p>Much of what&#8217;s in the report resonate with existing ideas I had for intermediate term development of library digital services, with implementation ideas often made possible by Umlaut.</p>
<h2>Use of non-library discovery interfaces</h2>
<blockquote><p>Trend 1. Users are discovering relevant resources outside traditional library systems.</p>
<p>[...]</p>
<p>[Suggestion:] &#8230;We need to ensure that items in our collection are and licensed resources are discoverable in non-library environments.</p></blockquote>
<p>I&#8217;d on to this &#8220;ensure that library services for making use of items are accessible even when the user starts from a non-library environment.&#8221; (Eric Lease Morgan has talked a bit about this).</p>
<p>One of my long-standing goals for Umlaut I think resonates with this. Umlaut is designed from the start to be a &#8216;landing page&#8217; for library services for a known item. No matter where you find an item, if you want to find out how you can get it from the library or what library services exist from it, Umlaut will do that for you.</p>
<p>Umlaut, like any link resolver, traditionally does this by working with licensed vendor discovery services that send an OpenURL link to Umlaut. But the problem is that users are using many discovery interfaces that don&#8217;t this, and are unlikely to do this in the near term (for various reasons, including business interests of the operators of those services). So what can we do?</p>
<p>Well, one thing I really want to do is customize LibX to work optimally with Umlaut, adding links to the Umlaut page to the third party discovery interface. (or even Umlaut-provided services directly on the third party page).  Find a book in Amazon, Google Books, or a variety of other places? No problem, we can still connect you to Library availability and services in one click (or zero clicks if the info is inserted directly on the page!).</p>
<p>It&#8217;s unfortunate that a browser plugin (which works only with IE or FF, not Safari, not custom smartphone web browser, etc) is required for this, but I can&#8217;t think of much other way around. Another possible &#8216;fallback&#8217; interface could be providing the URL of the page you are looking at to a server-side application, which does LibX-style processing on the server, and then tells you what the library can do for you for items found on that page. This might be the best &#8216;fallback&#8217; option for users who can&#8217;t install a LibX style plugin.</p>
<h2>Delivery</h2>
<blockquote><p>Trend 2: Users expect discovery and delivery to coincide. Searchers do not distinguish between discovery and delivery in their web searches&#8230;</p>
<p>[Suggestion] &#8230;systems, data, and information should be optimized for fulfillment.</p></blockquote>
<p>This gets to another long standing desire I&#8217;ve had &#8212; to unify my libraries various delivery mechanisms in one simple interface with as few clicks as possible.</p>
<p>We offer a variety of pretty useful delivery mechanisms. Sure, sometimes there&#8217;s immediately available electronic text.  When there&#8217;s not, there&#8217;s ILL.  For some combinations of user type and material type, we&#8217;ll also deliver physical copies in our stacks directly to your office; or make a scan of a chapter or article from a volume in our stacks and email it to you. But other materials are in-library use only &#8212; some of these you can pull of the stacks yourself, and others of these you need to request a pull and view it in a special office (eg special collections; also some AV materials).</p>
<p>Pretty darn useful, but we have a variety of different forms and interfaces (at least 6 different ones, if not more) to make a variety of different requests.  You&#8217;ve got to know they exist, find em, fill em out.</p>
<p>Instead, I&#8217;m imagining a &#8216;delivery menu&#8217; (brand it like a chinese takeaway menu if you want to be cute) that figures out, based on who you are, whether the item is in our stacks or not, and what type of material it is, tell you exactly what you can do with this. You can view this in the library. You can check this out. You can have it pulled for you and waiting at the circ desk to check out. You can have it delivered to your office. You can get a photocopy of a specific article emailed to you. Etc. And present all this information &#8212; and provide actions to choose an option &#8212; in as few clicks as possible.</p>
<p>Combined with the prior note, we can imagine that a user finds an item of interest on Amazon, and then in as few clicks as possible (0-3) finds out what delivery options are available to her, and chooses one of them, knowing how long we approximate it will take to get to her depending on her choice.  Of course, this &#8216;delivery menu&#8217; would be available in library discovery interfaces too, but the real power is in combining this with acknoledgement of the &#8220;using non library discovery services&#8221; trend.</p>
<p>This is totally do-able, especially on the Umlaut platform. The real challenge is on the business process end, not the technical end. (Consolidating and rationalizing all our delivery options, potentially requiring changes in staff workflow or policies to make everything make sense).</p>
<h2>Mobile</h2>
<blockquote><p>Trend 3. Usage of portable Internet-capable devices is expanding.  Rather than just supplementing the desktop computer, mobile devices are poised to become the primary means of Internet access for a critical mass of users.</p></blockquote>
<p>This one is trickier to figure out how to address, especially when combined with this reccommendation from the report:</p>
<blockquote><p>&#8230;we should strive to be end-user device/platform agnostic.</p></blockquote>
<p>Taking account of that I wouldn&#8217;t actually move to &#8220;develop an iPhone app&#8221; for it, as seems to be a popular trend. We don&#8217;t have the resources to develop and maintain custom apps for every possible advanced mobile device in use.</p>
<p>Instead, I&#8217;d develop special stylesheets for our core services that divide and format pages appropriately for an iPhone or similar next generation smartphone &#8220;high resolution mobie display, significantly smaller than a laptop or desktop.&#8221;  [Look for an upcoming article in Code4Lib journal adressing how you start doing this.]</p>
<p>Additionally, I&#8217;ve thought before about developing some SMS (aka &#8216;txt message&#8217;) interfaces to meet the lowest common denominator of cellphone mobile net access.  I would like it if you could text an ISBN (or ISSN, or even DOI) to Umlaut, and Umlaut would text you back with whether the library has it and what you can do. &#8220;Reply with the numeral 1 to place an ILL request for this item.&#8221; (Or other appropriate options.). Also, if you happen to have a camera cell phone, why type in an ISBN when you can snap a picture of a barcode, and MMS it to Umlaut instead?</p>
<p>Again, totally do-able, especially with Umlaut as a platform.</p>
<h2>Recommendation Systems</h2>
<blockquote><p>Trend 4: Discovery increasingly happens through recommending. Facilitating discovery requires us to develop and implement systems that push relevant content to users and allows users to share content with others.</p></blockquote>
<p>This one is harder.</p>
<p>The study recommends:</p>
<blockquote><p>We should capture the data necessary to provide targetted suggestions to users and defer to network-level systems where a critical mass already exists.</p></blockquote>
<p>Umlaut was in fact originally intended by Ross Singer to capture that data to provide those systems, but those features were never really matured, and are not currently present in Umlaut. Umlaut as a platform is still potentially a key point to capture data &#8212; but the study&#8217;s point that we need to move this data to aggregated systems with a critical mass is key. Just data from my institution is not going to cut it to algorithmically provide useful recommendations.</p>
<p>Perhaps an Umlaut that captures data and then sends it to a cross-institution SOPAC installation? (I&#8217;m not sure if the SOPAC infrastructure can handle article, rather than book/title, citation data or not).</p>
<p>Ex Libris&#8217;s bX service is designed to do this too,  although currently can only take source data from a stock SFX installation (not from Umlaut); it could still be used to provide recommendations in Umlaut, if we wanted to pay for it.  I expect more vendors to start adding such services.</p>
<p>As a stop-gap, Umlaut currently <em>does</em> provide links on it&#8217;s &#8220;landing page&#8221; to the recommendation services we were already paying for: Scopus and ISI Web of Knowledge. On a landing page for an item, Umlaut gives you one-click access to Scopus or ISI&#8217;s &#8220;similar items&#8221;.  (which are not based on usage, but based on reference and metadata similarity).</p>
<h2>Non-traditional objects</h2>
<blockquote><p>Trend 5. Our users increasingly rely on emerging nontraditional information objects. The format of useful and discoverable information is much broader than those traditionally offered through the libraries; users increasingly rely upon multimedia objects, data sets, blogs, and other &#8220;grey&#8221; objects to meet their information needs.</p></blockquote>
<p>Okay, this one is a stumper. Nothing in my pre-existing idea bag meets it, I&#8217;ve got nothing.</p>
<p>The issue is that my services, such as Umlaut, really rely on pre-existing databases/knowledge bases the library has of items and what we can do with them. Including both very traditional databases (the catalog) and more recent ones (the link resolver knowledge base).</p>
<p>But almost all of these databases can really only &#8216;control&#8217; pretty traditional information.  If a user comes to my service with a dataset she&#8217;s interested in, my software doesnt&#8217; really have any good way to figure out what we can do for her with that dataset, if we have it in the library, if she can get it ILL, where it is on the internet. I&#8217;m pretty much at a loss. (If the dataset has a DOI, and that DOI can be provided, I&#8217;m in a bit better shape).</p>
<p>I&#8217;d note that the study&#8217;s recommendations don&#8217;t provide much actionable advice on this one either. It&#8217;s a toughy, and requires rethinking larger swaths of library operation to address, I can&#8217;t identify many intermediate-term ways to address it, although maybe it just needs some more clever thinking.</p>
<h2>Umlaut</h2>
<p>I remain pleased at how well-positioned Umlaut is as a platform to address most of the trends identified.  Umlaut exists as a flexible platform for &#8220;known item services&#8221; &#8212; for making a &#8216;landing page&#8217; (or inserting services on a foreign page via javascript) for giving the user delivery/access options and other library services for a known item &#8212; regardless of where the user found the known item, through a library search service, a licensed vendor search service, or a third party web discovery service.</p>
<p>I am more and more convinced that this is a key piece of library infrastructure for the foreseeable future, and the investments we&#8217;ve made in it so far are very worthwhile.</p>
<p>But while the platform is there as an appropriate place to add features, as identified above, actually adding the features takes time and resources. I hope I get the time to work on some of them in the intermediate future. Perhaps this report will help make it more clear to some resource allocators the necessity of some of these directions.</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/1002/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/1002/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/1002/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/1002/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/1002/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/1002/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/1002/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/1002/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/1002/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/1002/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=1002&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/10/08/user-behavior/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>cataloging and &#8216;citations&#8217;</title>
		<link>http://bibwild.wordpress.com/2009/09/30/cataloging-and-citations/</link>
		<comments>http://bibwild.wordpress.com/2009/09/30/cataloging-and-citations/#comments</comments>
		<pubDate>Thu, 01 Oct 2009 00:20:51 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=999</guid>
		<description><![CDATA[So my understanding is that many &#8216;entries&#8217; in a cataloging record are meant to be &#8216;citations&#8217;. They are meant to unambiguously identify the work cited.   In the age when cataloging rules were created, what you&#8217;d do with that unambiguous citation was simply look it up in a printed or card catalog.
But the very precise rules [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=999&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>So my understanding is that many &#8216;entries&#8217; in a cataloging record are meant to be &#8216;citations&#8217;. They are meant to unambiguously identify the work cited.   In the age when cataloging rules were created, what you&#8217;d do with that unambiguous citation was simply look it up in a printed or card catalog.</p>
<p>But the very precise rules involving &#8216;main entry&#8217; and &#8216;uniform title&#8217; should, I believe, allow software to unambiguously find the target of the citation in a database, if it&#8217;s there.</p>
<p>I am at the very beginning stages of figuring out how to do this exactly, it&#8217;s not exactly simple.</p>
<p>If it turns out that you can&#8217;t even do this, I&#8217;m <em>really</em> going to think that much of the very complicated and time-consuming cataloging rules are irrelevant in the post-card-catalog age. But we&#8217;re not there yet.</p>
<p>Initial signs, however, aren&#8217;t very good. Take this example from <a href="http://www.oclc.org/bibformats/en/7xx/76x-78x.shtm">OCLC docs on 76x-78x linking fields.</a></p>
<blockquote><p>The first choice for identification is the uniform title. If available, use the entire uniform title (e.g., title and qualifier) to identify the related publication. If the uniform title is unavailable, use the main entry and title proper. For example, if OCLC record number 6597310 has the following uniform title:</p>
<table border="0" cellspacing="0" cellpadding="1">
<tbody>
<tr valign="top">
<td width="30" align="left">130</td>
<td width="10" align="right">0</td>
<td width="17" align="left"></td>
<td width="90%" align="left">Monthly digest of statistics (Zimbabwe. Central Statistical Office)</td>
</tr>
</tbody>
</table>
<p>It would be linked to the related publication in <a href="http://www.oclc.org/bibformats/en/7xx/780.shtm">field 780</a>.</p>
<table border="0" cellspacing="0" cellpadding="1">
<tbody>
<tr valign="top">
<td width="30" align="left">780</td>
<td width="10" align="right">0</td>
<td width="17" align="left">0</td>
<td width="90%" align="left"><span style="font-family:Arial,Helvetica,sans-serif;">‡</span>t Monthly digest of statistics (Zimbabwe. Central Statistical Office) <span style="font-family:Arial,Helvetica,sans-serif;">‡</span>w (OCoLC)6597310</td>
</tr>
</tbody>
</table>
</blockquote>
<p>Okay, fair enough. And a referenced uniform title should indeed allow us to unambiguously identify records belonging to the cited work.  But wait. That title is clearly a uniform title, it&#8217;s given in a 130.</p>
<p>But in the 780 example then&#8230; shouldn&#8217;t that title be in subfield &#8217;s&#8217;, not &#8216;t&#8217;? 780 subfield s is <a href="http://www.oclc.org/bibformats/en/7xx/780.shtm">clearly documented</a> as &#8220;uniform title&#8221;, right?</p>
<p>But wait, $t says: it is indeed used for title elements from a 245 <em>or</em> a 130.  Subfield &#8216;u&#8217; is only used for field 240 entered uniform titles.</p>
<p>So wait, when citing a work in a 780, you put a uniform title in subfield s if it&#8217;s title-main-entry, but you put it in subfield t if it&#8217;s author main entry? And when you find a title in t, there&#8217;s no way to know if it&#8217;s a uniform (controlled) title, or a transcribed (245) title?</p>
<p>Um. So, um.  I am kinda speechless. If you&#8217;re going to spend all these expensive cataloger hours following very precise rules, wouldn&#8217;t it be sensible to make the rules result in data that can actually be interpreted to do what&#8217;s it&#8217;s supposed to do?</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/999/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/999/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/999/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/999/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/999/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/999/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/999/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/999/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/999/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/999/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=999&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/09/30/cataloging-and-citations/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>More MARC issues: 700</title>
		<link>http://bibwild.wordpress.com/2009/09/28/more-marc-issues-700/</link>
		<comments>http://bibwild.wordpress.com/2009/09/28/more-marc-issues-700/#comments</comments>
		<pubDate>Mon, 28 Sep 2009 19:35:57 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=995</guid>
		<description><![CDATA[So, okay, here&#8217;s another puzzle for the catalogers.
A 700 (or 7xx in general) could be an &#8216;analytic&#8217;, representing one element that&#8217;s the contents of the item cataloged. OR could just represent a contributor (who isn&#8217;t &#8216;main entry&#8217;) to the work. An &#8216;analytic&#8217; will mention the particular part of the work contained, generally in controlled form.
Now, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=995&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>So, okay, here&#8217;s another puzzle for the catalogers.</p>
<p>A 700 (or 7xx in general) could be an &#8216;analytic&#8217;, representing one element that&#8217;s the contents of the item cataloged. OR could just represent a contributor (who isn&#8217;t &#8216;main entry&#8217;) to the work. An &#8216;analytic&#8217; will mention the particular part of the work contained, generally in controlled form.</p>
<p>Now, I want to treat this differently depending on if it&#8217;s an analytic or not. For instance, just plain contributor names should be listed as &#8216;contributors&#8217;, along with links to collocate on controlled form of name. But if it&#8217;s an analytic, I STILL want to seperate out the person&#8217;s actual name as &#8216;contributor&#8217; (and let you collocate in general just by their name).  But I ALSO  say what part of the work they contributed, and give a link to look up other records for that analytic entry (the part).</p>
<p>So 7xx field have second indicator two. Which oddly gives you two possibilities. You can note that it definitely <em>is</em> an analytic entry. Or you can note that you don&#8217;t know either way. <em>Very strangely</em> there is no way to even note that you definitely know it&#8217;s not! Second indicator blank just means &#8220;no information.&#8221; So it might still be an analytic.</p>
<p>Of course, even if the indicators gave you a way to record that it definitely wasn&#8217;t, no doubt we&#8217;d still have plenty of records whose second indicator gave no information.</p>
<p>So&#8230;.   how can I tell if a 7xx is an &#8216;analytic&#8217; or not?  Can I assume that it&#8217;s an analytic if and only if subfield t is present? Are there any cases where it is an analytic but there&#8217;s no subfield t, or where it&#8217;s not an analytic but there <em>is</em> a subfield t?</p>
<h3>Addendum:</h3>
<p>The 730 field specifically is even worse. I don&#8217;t know if there&#8217;s any way for me to tell if it&#8217;s an analytic or not?  I mean, if second indicator is 2, it is. And if second indicator is blank&#8230; absolutely no way to tell.</p>
<p>What the heck could a 730 be other than an analytic? Anyone have examples?</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/995/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/995/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/995/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/995/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/995/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/995/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/995/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/995/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/995/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/995/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=995&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/09/28/more-marc-issues-700/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>Principle of avoiding &#8220;false promises&#8221; in interfaces</title>
		<link>http://bibwild.wordpress.com/2009/09/24/principle-of-avoiding-false-promises-in-interfaces/</link>
		<comments>http://bibwild.wordpress.com/2009/09/24/principle-of-avoiding-false-promises-in-interfaces/#comments</comments>
		<pubDate>Thu, 24 Sep 2009 15:53:00 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=992</guid>
		<description><![CDATA[So lately I keep thinking about this idea I think of as a &#8220;false promise&#8221; in a user interface.  Not sure if other people already recognize this and refer to the concept by some other label, let me know if you know they do.
But the idea is that your software shouldn&#8217;t suggest by it&#8217;s input [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=992&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>So lately I keep thinking about this idea I think of as a &#8220;false promise&#8221; in a user interface.  Not sure if other people already recognize this and refer to the concept by some other label, let me know if you know they do.</p>
<p>But the idea is that your software shouldn&#8217;t suggest by it&#8217;s input that it can do something that it really can&#8217;t do at all.   This becomes especially tricky when we&#8217;re dealing with our library data and systems that in fact <em>can&#8217;t</em> do a lot of things.  Some examples will help.</p>
<h3>SFX &#8216;citation linker&#8217; input screen</h3>
<p>SFX by default has a screen that let&#8217;s you input an article citation, and then SFX will try to find links or other information for it.  (I don&#8217;t want to put a link to mine here cause I don&#8217;t want to attract the robots).</p>
<p>Now, to begin with, this is both an annoying process for the user, and an error-prone process for SFX. But I want to draw your attention to two particular fields on that screen: &#8220;Author&#8221; and &#8220;Article Title&#8221;.</p>
<p>The default input screen asks you to input an &#8220;Author&#8221;. However, in (estimating) 95%-99% of cases, SFX can&#8217;t actually <strong>do</strong> anything with that author or title you&#8217;ve input at all. It doesn&#8217;t help SFX find a match, it doesn&#8217;t effect SFX&#8217;s functionality at all.</p>
<p>So our interface implies that the user ought to enter author and title &#8212; a painful and annoying process for the user.  The <strong>&#8220;false promise&#8221; </strong>here, in my opinion, is that this will <strong>do anything at all.</strong> Now, granted, in a tiny minority of cases it will, which is why SFX puts the field there. But that means we&#8217;re making a &#8220;false promise&#8221; in the vast majority of cases, in my opinion. We&#8217;re &#8220;leading the user on.&#8221;</p>
<h3>MARC relator codes</h3>
<p>This might be a better example. So MARC fields for listing controlled authors or other contributors (100 and 700) theoretically allow the data to say particularly what relationship the contributor has to the work at hand. (Author? Editor? Illustrator? Performer on a musical composition? Composer? Wrote a preface?).</p>
<p>Most OPAC interfaces don&#8217;t do much with this. But if you start thinking of what you might want to do, an initial naive approach might be to allow the user to limit a search by these relator codes. Don&#8217;t just give me any record that has Noam Chomsky in any 100 or 700 &#8212; that&#8217;s what our traditional interfaces do, but for prolific people it might give me too much. I really only want books where Noam Chomksy wrote a preface.</p>
<p>So, okay, maybe you go ahead and provide this limit in your search interface.  The problem is that the vast majority of our data doesn&#8217;t have these relator codes. So if you just do a search for Noam Chomksy with relator code for &#8216;wrote a preface&#8217;, you&#8217;re going to miss <strong>most</strong> of the books that Noam Chomsky really <strong>did</strong> write a preface for.</p>
<p>You <strong>might</strong> miss it because Noam Chomsky is in a 700 field with no relator code. Or you might miss it because we don&#8217;t often record people who wrote prefaces at all.</p>
<p>In either case though, I think the interface was making a &#8216;false promise&#8217;, it suggested you could search limiting by role of the contributor, but our data doesn&#8217;t really support that at all. The results are going to be misleading if the user assumes the interface really can do what it suggests it can.</p>
<h3>So?</h3>
<p>So what do you think? Any other examples you can think of of &#8216;false promises&#8217; that our interfaces make?</p>
<p>Identifying the &#8216;false promises&#8217; is easier than fixing them. Usually they are there because of limitations in our software or data that are not easy or cheap to resolve.  If you really get rid of all of the false promises, you have to get rid of much of your functionality!  Or pepper it with disclaimers and limitations that most users won&#8217;t read anyway, and just make us look kind of incompetent if they do. (&#8220;WARNING: You can TRY to search on relator code, but your results will only include a tiny percentage of things that really matched your search.&#8221;)</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/992/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/992/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/992/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/992/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/992/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/992/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/992/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/992/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/992/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/992/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=992&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/09/24/principle-of-avoiding-false-promises-in-interfaces/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>A reasonable display for series data in MARC?</title>
		<link>http://bibwild.wordpress.com/2009/09/24/a-reasonable-display-for-series-data-in-marc/</link>
		<comments>http://bibwild.wordpress.com/2009/09/24/a-reasonable-display-for-series-data-in-marc/#comments</comments>
		<pubDate>Thu, 24 Sep 2009 15:39:15 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=990</guid>
		<description><![CDATA[So I know plenty of catalogers read my blog  (or used to).  Appreciate any feedback or advice you have on this.
Basically, I&#8217;m trying to figure out how to actually do a useful user-friendly display of  &#8217;series&#8217; information from MARC records.
My assumptions
So we have 440, 490, and 8xx.  There&#8217;s a distinction between &#8220;transcribed&#8221; series, and controlled [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=990&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>So I know plenty of catalogers read my blog  (<a href="http://bibwild.wordpress.com/2009/02/02/bibliographic-wildernesss-readers/">or used to</a>).  Appreciate any feedback or advice you have on this.</p>
<p>Basically, I&#8217;m trying to figure out how to actually do a useful user-friendly display of  &#8217;series&#8217; information from MARC records.</p>
<h3>My assumptions</h3>
<p>So we have 440, 490, and 8xx.  There&#8217;s a distinction between &#8220;transcribed&#8221; series, and controlled (aka &#8220;traced&#8221; or &#8220;access point&#8221;).  I know that the controlled data is meant to be used for collocation.  I am assuming that the &#8220;transcribed&#8221; data is better for user display though.  Is this right?   (I&#8217;ll refer to these two concepts as &#8220;displayable&#8221; and &#8220;controlled&#8221;).</p>
<p>So if we&#8217;ve got a 440, then that is both displayable and controlled.</p>
<p>But current practice going forward is not to use 440, but instead to use a 490 for displayable, and a 8xx for controlled.</p>
<h3>So what should the interface do?</h3>
<p>So thinking about an individual record display. I can&#8217;t just list all 440, 490, and 8xx fields under &#8220;Series&#8221;, because in the case of 490/8xx, that&#8217;ll lead to me displaying the <em>same</em> series twice. Once in transcribed form, and once in controlled form. This is confusing and doesn&#8217;t make sense.</p>
<p>So what I&#8217;m thinking is that for a 490/8xx pair, I actually display the 490 on the screen &#8212; it&#8217;s the value meant for user-display.  But it&#8217;s clickable, and when you click on it, the <em>search</em> that will be executed is actually on the corresonding 8xx, because that&#8217;s the field meant for collocation.</p>
<p>This is assuming there is a corresponding 8xx. If there&#8217;s not, it&#8217;s somewhat simpler. We display the 490, and either it&#8217;s not click-searchable at all, or if it is, it searches an uncontrolled series index of all 490s, it doesn&#8217;t actually try to collocate on a controlled field, cause we don&#8217;t have one.</p>
<p>Does this make sense?  Am I missing something?</p>
<h3>But the problem</h3>
<p>But there&#8217;s still a problem here. A record can theoretically belong to multiple series.  Meaning it could have multiple 490s.  Each of which may or may not have a controlled 8xx corresponding to it.</p>
<p>As far as I can tell, there&#8217;s no way to tell <strong>which</strong> 8xx goes with <strong>which </strong>490. Especially since a 490 may or may not have a corresponding 8xx.</p>
<p>This might not effect very many records, that have multiple series, but it still annoys me to have a known &#8216;bug&#8217;, a known case where things won&#8217;t work right at all.  I&#8217;m not really sure what the heck my code should do if there are multiple 490s.  Am I missing something?</p>
<h3>By the way</h3>
<p>This is one good example of how it&#8217;s somehow difficult or even impossible to get meaningful information out of our AACR2/MARC, despite some people&#8217;s belief to the contrary that it&#8217;s always simple and straightforward.</p>
<p>So&#8230; what the heck should be done with this 440/490/8xx stew?</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/990/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/990/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/990/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/990/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/990/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/990/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/990/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/990/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/990/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/990/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=990&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/09/24/a-reasonable-display-for-series-data-in-marc/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>Amazon Windowshop: Serendipitous Browsing Online</title>
		<link>http://bibwild.wordpress.com/2009/09/18/amazon-windowshop-serendipitous-browsing-online/</link>
		<comments>http://bibwild.wordpress.com/2009/09/18/amazon-windowshop-serendipitous-browsing-online/#comments</comments>
		<pubDate>Fri, 18 Sep 2009 18:06:16 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=986</guid>
		<description><![CDATA[Fiacre O&#8217;Duinn alerts us to a kind of interesting interface Amazon provides, which I hadn&#8217;t been aware of before: Amazon Windowshop. 
Fiacre asks if this is what the library catalog should look like.

I wouldn’t want the WHOLE library catalog to look ONLY like that — but I think it could be VERY useful and interesting [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=986&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><a href="http://www.librarybazaar.com/2009/09/17/the-catalogue-of-the-future/">Fiacre O&#8217;Duinn alerts us</a> to a kind of interesting interface Amazon provides, which I hadn&#8217;t been aware of before: <a href="http://www.windowshop.com/">Amazon Windowshop. </a></p>
<p>Fiacre asks if this is what the library catalog should look like.</p>
<div>
<p>I wouldn’t want the WHOLE library catalog to look ONLY like that — but I think it could be VERY useful and interesting to provide a “serendipitous browsing” interface to the catalog (on top of a more traditional type-in-search-get-result list interface) that is along the lines of Amazon windowshop.</p>
<p>Try to replicate the experience of browsing the shelves, but online you get the benefit that you can arrange books in more than one dimension (as amazon windowshop does in two), re-arrange them in different orders (for instance LCC OR DDC OR something else entirely, don’t have to pick just one), and additionally be able to allow unified browsing of a corpus that may be in several different physical locations (including off-site storage) or may be currently checked out but maybe you want to include them in the &#8216;browse&#8217; anyway.</p>
<p>I&#8217;ve been thinking for a while about how to provide such an online serendipitous browse experience, like a physical shelf browse but taking advantage of the unique <a href="http://en.wikipedia.org/wiki/Affordance">affordances</a> offered by the online environment. And I definitely thought (cover) images were a necessary component &#8212; I had been thinking of iTunes coverflow as a model. Amazon Windowshop provides another VERY interesting model to try and steal the best parts of &#8212; whenever I or anyone else can find the time to try and work on it!  Too many cool projects, not enough time. (And replicating Amazon windowshop would take some fancy coding).</div>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/986/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/986/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/986/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/986/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/986/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/986/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/986/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/986/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/986/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/986/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=986&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/09/18/amazon-windowshop-serendipitous-browsing-online/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>Sophisticated item services from Umlaut in Xerxes federated search interface</title>
		<link>http://bibwild.wordpress.com/2009/09/15/umlaut-in-xerxe/</link>
		<comments>http://bibwild.wordpress.com/2009/09/15/umlaut-in-xerxe/#comments</comments>
		<pubDate>Tue, 15 Sep 2009 20:13:29 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=975</guid>
		<description><![CDATA[So, if you try to architect your applications solidly and flexibly, and build in features for integration, and it all works out okay, one of the benefits you get is it&#8217;s pretty easy to combine them.
I&#8217;ve added a feature to the Xerxes federated search tool to add sophisticated item-level information and services that were already [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=975&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>So, if you try to architect your applications solidly and flexibly, and build in features for integration, and it all works out okay, one of the benefits you get is it&#8217;s pretty easy to combine them.</p>
<p>I&#8217;ve added a feature to the <a href="http://code.google.com/p/xerxes-portal/">Xerxes </a>federated search tool to add sophisticated item-level information and services that were already being compiled by our Umlaut installation&#8212; to  Xerxes record-detail pages.</p>
<p>I think this is pretty neat from a sort of &#8217;single business&#8217; perspective of providing consistent services regardless of what tool the user happens to be using.</p>
<p>So now, when you look at an item detail page in Xerxes, you can, right on that page,  see:</p>
<ul>
<li> call numbers and availability</li>
<li>Full text links from SFX, right on the page</li>
<li>Links to &#8220;similar items&#8221; content from Web of Knowledge and Scopus.</li>
<li>links to pre-filled ILL forms, as appropriate.</li>
<li>For monographic content, full text, preview, and &#8217;search inside&#8217; functionality from Amazon, Google, and others.</li>
<li>Other stuff &#8212; whatever happens to be configured in Umlaut, when new stuff is added to Umlaut, it&#8217;ll automatically show up in Xerxes too. (Well, new services of the existing types; if a whole new type/section is added to Umlaut, will take a couple lines of code in Xerxes to add it).</li>
</ul>
<p>This is live in production here now, but you can&#8217;t really see it without a local login. So here&#8217;s some screenshots of Xerxes item detail pages, content from Umlaut circled in red.</p>
<p><a href="http://bibwild.files.wordpress.com/2009/09/book1.png"><img class="alignnone size-medium wp-image-979" title="book" src="http://bibwild.files.wordpress.com/2009/09/book1.png?w=203&#038;h=300" alt="book" width="203" height="300" /></a></p>
<p><a href="http://bibwild.files.wordpress.com/2009/09/article1.png"><img class="alignnone size-medium wp-image-978" title="article" src="http://bibwild.files.wordpress.com/2009/09/article1.png?w=215&#038;h=300" alt="article" width="215" height="300" /></a></p>
<p>It&#8217;s worth noting that this content is inserted on the page by javascript after page load. It can take 1-3 seconds or so to come in (depending on speed Umlaut can do it&#8217;s thing), which you can&#8217;t see in the screenshots. While waiting, you get a spinner and status message. If a user doesn&#8217;t have javascript enabled, this feature won&#8217;t effect their page view at all.</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/975/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/975/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/975/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/975/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/975/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/975/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/975/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/975/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/975/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/975/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=975&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/09/15/umlaut-in-xerxe/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>

		<media:content url="http://bibwild.files.wordpress.com/2009/09/book1.png?w=203" medium="image">
			<media:title type="html">book</media:title>
		</media:content>

		<media:content url="http://bibwild.files.wordpress.com/2009/09/article1.png?w=215" medium="image">
			<media:title type="html">article</media:title>
		</media:content>
	</item>
		<item>
		<title>DLF ils-di dlfexpanded service for Horizon</title>
		<link>http://bibwild.wordpress.com/2009/09/10/dlf-ils-di-dlfexpanded-service-for-horizon/</link>
		<comments>http://bibwild.wordpress.com/2009/09/10/dlf-ils-di-dlfexpanded-service-for-horizon/#comments</comments>
		<pubDate>Thu, 10 Sep 2009 21:00:12 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=972</guid>
		<description><![CDATA[So, I have a servlet (based on initial work from Tod Olson at uchicago, expanded by me) to provide holdings information from Horizon in the DLF ils-di &#8220;dlfexpanded&#8221; format. The servlet code and some documentation is available.
That&#8217;s the short statement. It turns out that you can&#8217;t really just say that without providing some more specifics, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=972&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>So, I have a servlet (based on initial work from Tod Olson at uchicago, expanded by me) to provide holdings information from Horizon in the <a href="http://www.diglib.org/architectures/ilsdi/">DLF ils-di &#8220;dlfexpanded&#8221; format</a>. The <a href="http://code.google.com/p/horizon-holding-info-servlet/">servlet code and some documentation is available</a>.</p>
<p>That&#8217;s the short statement. It turns out that you can&#8217;t really just say that without providing some more specifics, caveats, exceptions, limitations etc. Also it&#8217;s worth adding some interesting observations.</p>
<h3>Motivation</h3>
<p>As we&#8217;ve moving ahead with blacklight, we&#8217;re going to need to have <em>some</em> way to get item holdings information out of Horizon. By &#8220;item holdings information&#8221; I mean &#8220;copy&#8221; information, what items do we have, what are their call numbers, what are their statuses (checked in or out among many others), what are their locations, etc. etc. Everything you&#8217;d need to provide an actual OPAC display telling the users what they need to know about our holdings.</p>
<p><em>A sidenote on terminology: </em>In Horizon there are &#8216;items&#8217;, and sometimes a bib just has &#8216;items&#8217;. But sometimes a big has different sets of items in groups &#8212; this is usually used for serials, or occasionally for multi-volume series.  Horizon confusingly calls this set of items a &#8216;copy&#8217;.   The DLF ils-di report calls it a &#8216;holdingset&#8217;.  I have no idea what your ILS calls it. It&#8217;s a two-level hiearchy, a bib can contain one or more copies/holdingsets which each contains items.  OR a bib can contain one or more items directly, without the intervening copy/holdingset.</p>
<p>And, the way most people are doing this at present (for a variety of reasons) is checking in realtime at point of demand for this info, not trying to index it. So, okay, go with the conventional wisdom. So I need a realtime service to provide this info from Horizon.</p>
<p>But I figure, as long as I&#8217;m doing this, MUCH better to provide the info in some standard format, instead of a custom one. Then, theoretically, the consuming code on the Blacklight end can be written to that standard format, instead of being custom for Horizon.  And my understanding is that the Blacklight team has indeed been thinking/wishing for some standard stuff on the Blacklight end to consume stuff in DLF ils-di format, and/or jangle (which also typically, at the moment, uses the DLF &#8216;dlfexpanded&#8217; format to actually return data in).</p>
<p>So, okay, that makes sense.</p>
<h3>But DLF ils-di format is not a complete spec</h3>
<p>So it turns out once you decide to return data in the <a href="http://www.diglib.org/architectures/ilsdi/schemas/1.1/dlfexpanded.xsd">DLF ils-di &#8220;dlfexpanded&#8221; format</a>, you&#8217;re actually not done deciding what your data is actually going to look like.</p>
<p>The dlfexpanded format is just kind of a coat tree to hang your actual metadata &#8216;coats&#8217; on.  dlfexpanded lets you give a list of itemIDs and say they belong to a bib; it lets you give a list of holdingsets and say which itemIDs belong to them. Good so far. But to actually describe anything else about those items and holdingsets (location, call number, item status, any user-displayable notes, etc), you&#8217;ve got to include additional metadata of your own choosing &#8212; dlfexpanded gives you some hooks that it allows you to hang basically whatever other namespaced (and hopefully specified and standardized) XML you want on.</p>
<p>So figuring out what metadata to actually use to describe everything I wanted about my Items and Copies (aka &#8216;holdingsets&#8217;) took a bit of investigating and thinking.</p>
<h3>simpleavailability</h3>
<p>Sure, I used the dlf:simpleavailability format that dlfexpanded gives you just to say whether something is &#8220;available&#8221; or not (and provide a custom user-displayable string conveying that).</p>
<p>Although I ended up only providing that at the item level. The dls-di report seems to assume the client could ask for &#8216;availability&#8217; at the bib or holdingset level too. But I wasn&#8217;t even sure what the semantics of this should be, and figuring out the code to this without impacting performance (more on performance later) was tricky. So, okay, the client can look at the availability on all items and figure out how to sum them up at the bib or copy level itself, if needed (I&#8217;m not sure I&#8217;ll even need to, for my use cases).</p>
<p>But I want to say a lot more about my Items and Copies than simpleavailability. I want to include enough data that my complete OPAC screen could be replicated by third party software.</p>
<h3>mfhd</h3>
<p>So after hunting around for available &#8217;standard&#8217; options, I settled on good old MFHD &#8212; expressed in marc-xml.   I considered the new fangled &#8220;ISO Holdings&#8221;, but limited public documentation is available, and from looking at the schema that is available, it didn&#8217;t look like ISO Holdings would let me express anything that MFHD didn&#8217;t. Sure, MFHD is kind of a bear for the developer to work with, with all those opaque numeric codes, but oh well, went with the known evil, MFHD.</p>
<p>Except I&#8217;m not really using mfhd as is typical. I use <em>just enough</em> of it to express what I want.  I include kind of a dummy &#8216;leader&#8217; just for the sake of appearances, since there&#8217;s nothing in the leader I actually need. In standard MFHD usage, you would rarely (never?) have an individual MFHD record just for an item, but the dlfexpanded &#8220;coat tree&#8221; gives me hooks to hang MFHDs for individual items, and that makes it a lot more convenient to express and retrieve things unambigously, so why not. So anyway, it&#8217;s MFHD, but I&#8217;m not neccesarily saying any existing MFHD-processing tools will be able to do much with it, I&#8217;m using it so unusually (although not illegally in any way as far as I can tell). Oh well, at least it&#8217;s a standard format.</p>
<p>Interestingly, while MFHD theoretically lets you express serial run statements in a machine readable form&#8230;  A) I don&#8217;t have that info in my ILS anyway, and B) that machine readability in the way mfhd has you express it is a lot more theoretical than practical.  So I&#8217;m not doing that.  If my ILS had the data, I&#8217;d probably express it in the more straightforward <a href="http://www.editeur.org/18/Current-Releases/#Coverage%20statement">ONIX Serial Coverage Statement</a> instead of MFHD.  (Note to ONIX people &#8212; why oh why do you only provide the actual schema in a zip file online? You used to provide it individually. Very inconvenient.)</p>
<h3>But wait, there&#8217;s more</h3>
<p>But to completely express all the data I&#8217;d need to duplicate my OPAC display in external software, mfhd still didn&#8217;t quite do it for me. Mostly, I wanted more internal ILS codes.  mfhd lets me express &#8216;location&#8217; and &#8216;collection&#8217; as user-presentable strings, but I want to reveal my internal non-mutable codes for these too. mfhd doesn&#8217;t let me express the concept of &#8216;item type&#8217; that&#8217;s in my catalog at all!</p>
<p>So after looking around some more for something to do that, I gave up and just created my own very simple XML schema to do it, which I&#8217;m calling <a href="http://code.google.com/p/ils-holdings-schema/">&#8220;ILS holdings schema&#8221;</a> for expressing internal codes and such, in case you want to.</p>
<h3>And one more plug for DAIA</h3>
<p>And as I alluded to <a href="http://bibwild.wordpress.com/2009/09/02/daia-and-ils-complexity/">my last post</a>, I&#8217;m using DAIA too &#8212; at this point solely to expose the URL that can be accessed to issue a &#8216;request&#8217; for the item through HIP.  This is a bit against the spirit of DAIA, since exactly what a &#8216;request&#8217; will do is unclear [recall a checked out item, or only add you to a hold list?  Let you check it out, or only request it to be provided in the special collections reading room?  Deliver it to a circ desk, or actually to your office (as we provide to some people). Who knows!]</p>
<p>And worse, I&#8217;m not able to actually pre-check if &#8216;request&#8217; <em>really</em> is available or not, for reasons discussed in the last post.  Which is really against the spirit of DAIA.</p>
<p>But oh well, it was such a nice little schema for simply revealing a URL for a service, and my OPAC &#8216;request&#8217; feature is a service&#8230; so I used it.</p>
<p>At some later point I hope to go back and make a real nice DAIA response, but it&#8217;ll be a buncha work, which isn&#8217;t required by the specs of the project I&#8217;m working on presently.</p>
<p>Oh, and I only provide DAIA at the item-level too, not at the Copy or Bib level. (I think some people&#8217;s Horizon setups actually do allow Requests at the Copy or Bib level, but not ours, so I couldn&#8217;t quite figure out how it should/would work and didn&#8217;t have time for it).</p>
<h2>Performance Issues</h2>
<p>So I think the servlet is <em>reasonably</em> fast, but the trickwhen you&#8217;re developing an API that&#8217;s going to be used by other software is&#8230; &#8220;reasonable&#8221; gets a lot less forgiving. I mean, let&#8217;s say there&#8217;s a search result &#8216;hit list&#8217; with 20 hits on it &#8212; my software might want to call this API 20 times for one web page!  A 0.2 second response time might be pretty good for a user-facing web app, but not for an API that needs to be called 20 times to deliver one page to the user.</p>
<p>So I might have some speed issues, that theoretically I can optimize to some extent. (Although I&#8217;m not looking forward to it. Java is not my specialty. If I had to do it over again, not sure I would have done this in Java, although it made sense at the time for several reasons. And if I were going to do it in Java, I think I&#8217;d want to use a framework of some kind, not do it with the pretty low-level stuff that JDBC and Servlet APIs alone give you. But that would result in it&#8217;s own trade-offs.)</p>
<p>But perhaps worse than the speed issues are some response size issues. I took a look at the response for a bib I knew would have a lot of items &#8212; JAMA, with dozens of holdingsets and hundreds or more items. The dlfexpanded response was 1.2 megs!  That might be an issue for sending accross the network, loading into memory, and parsing the XML on the client side.</p>
<p>It&#8217;s so large in part because there&#8217;s some redundancy in the multiple metadata formats we use to express everything.  A basic schema-less ad hoc uchicago-created XML response for the same data is only 220k. Which is still pretty big.</p>
<p>So, I provided some extra query parameters (not specified in dlf ils-di of course) to allow the client to limit the data returned, if it doesn&#8217;t really need all of it. The client can choose which metadata payloads it wants for items or copies, instead of taking all of them. And the client can choose NOT to have items included in a response that includes copies, just to include the copy information, and let the client ask for the item info later if it needs it.</p>
<p>We will see how it goes.</p>
<h2>Standard or not? Workable or not?</h2>
<p>So, okay, I&#8217;m providing my info in the DLF ils-di &#8216;dlfexpanded&#8217; format, but how standard is it?  If someone says &#8220;Oh yeah, I have code that can consume dlfexpanded&#8221;, does that mean it will automatically work with my (or anyone elses!) dlfexpanded info?</p>
<p>Doubtful.  You&#8217;ve got your choice of metadata payloads to hang on that &#8216;coat tree&#8217;, and everyone can choose different things. Even once you&#8217;ve chosen, two people providing the same ones may be using them slightly differently (as evidenced by a few choices I had to make here and there with how to use mfhd).</p>
<p>On top of that, for performance related reasons, or to fit &#8216;dlfexpanded&#8217; into the actual use cases I have (which go beyond simple DLF &#8220;getAvailability&#8221;), my dlfexpanded responses sometimes don&#8217;t include everything &#8212; just because there are no &#8216;items&#8217; listed in the response doesn&#8217;t necessarily mean there are no items, they might have been suppressed based on the request parameters for performance. And, those request parameters are non-standard, but I think (at least for my use cases), the client is really going to need to use them to avoid a performance nightmare.</p>
<p>Or, if you asked my API for info on a certain item, you get a dlfexpanded response that <em>only</em> has that item in it, not all the other items belonging to the same bib, which may or may not be misleading or confusing to the consumer.</p>
<p>Meanwhile, I&#8217;ve only written the <em>producer</em> end of things so far, I haven&#8217;t even written the consumer. When I get around to writing the consumer, I&#8217;m probably going to run into even more tricks and problems requiring me to go back and revise, including but not limited to performance stuff.</p>
<p>So we&#8217;ll see. I don&#8217;t blame the DLF ils-di task force for this; they did a great job. But we make the map as we tread the path, there&#8217;s no way to map out everything without actually trying it in practice first, and trying it in a bunch of different use cases and scenarios to abstract out the commonalities.  So, we&#8217;re figuring it out as we go, that&#8217;s the only way to do it, and the ils-di task force wisely recognized that and didn&#8217;t try to map everything out in advance.</p>
<p>Still, it means this stuff is trickier than it might originally seem. The specs, standards, and best practices are not &#8220;done&#8221;, not even close.  We&#8217;ve got to figure out a bunch of stuff.</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/972/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/972/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/972/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/972/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/972/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/972/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/972/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/972/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/972/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/972/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=972&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/09/10/dlf-ils-di-dlfexpanded-service-for-horizon/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>DAIA and ILS complexity</title>
		<link>http://bibwild.wordpress.com/2009/09/02/daia-and-ils-complexity/</link>
		<comments>http://bibwild.wordpress.com/2009/09/02/daia-and-ils-complexity/#comments</comments>
		<pubDate>Wed, 02 Sep 2009 18:21:31 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=970</guid>
		<description><![CDATA[So DAIA is a nice little response format-slash-API specification from Jakob Voss.
It&#8217;s focused on a very specific goal: describing what services are available for a given item, possibly providing URLs to access that service for a given item, telling the user how long they&#8217;ll have to wait to get that service, etc.
Some more specific scenarios [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=970&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>So <a href="http://www.gbv.de/wikis/cls/DAIA_-_Document_Availability_Information_API">DAIA </a>is a nice little response format-slash-API specification from Jakob Voss.</p>
<p>It&#8217;s focused on a very specific goal: describing what services are available for a given item, possibly providing URLs to access that service for a given item, telling the user how long they&#8217;ll have to wait to get that service, etc.</p>
<p>Some more specific scenarios mapped to my library might make things more clear. For a given item and user, that user might be able to:</p>
<ul>
<li>Look at the item in the library. Which they might be able to do immediately (upon finding it in the stacks), or there might be a 1 or 2 business day delay because it&#8217;s in some kind of closed stacks or offsite storage, and they&#8217;re going to have to request it.
<ul>
<li>OR, there might be a longer delay, because the item is currently checked out, and they&#8217;re going to have to wait until it comes back &#8212; or maybe they have &#8216;recall&#8217; privileges, and there&#8217;s still a delay, but shorter!</li>
</ul>
</li>
<li>Check the book out?  Again, maybe they can, or maybe they can&#8217;t at all. If they can, maybe they&#8217;re going to have to first &#8216;recall&#8217; it (if they&#8217;re allowed to), with a longer delay.</li>
<li>Request the book for delivery to a circ desk?  Related to recall/checkout, but in rare cases they might be able to request delivery to a circ desk, but only view it in library! And there are cases where they might be able to check it out, but NOT request delivery.  Or where they can request delivery, but they won&#8217;t get it until the book comes back on it&#8217;s own, they have no &#8216;recall&#8217; privileges.</li>
</ul>
<p>Now, the answers to these questions, once determined, are easily expressible in DAIA, no problem.</p>
<p>The problem is, as the complicated foregoing discussion may have hinted, that <em>determining</em> the answers to these questions from our ILS is enormously complex. All the info is in the ILS somehow. In the end, either the ILS is going to allow a &#8216;request&#8217; or a &#8216;loan&#8217; or a &#8216;recall&#8217;, or it&#8217;s not.  And there&#8217;s info in the ILS to let us predict what&#8217;s going to happen, and estimate how long it&#8217;ll take until the user gets access (as DAIA allows us to express once we&#8217;ve figured it out).  It&#8217;s all there somehow &#8212; but trying to figure out how to actually predict it, oh boy, I get confused really quick. There are <em>dozens</em> of different tables I need to consult in the ILS, and figure out how they interact and which takes priority or overrides which other.  Privileges can be set on item statuses, locations, groups, etc. Borrower statuses, groups, types, etc. And they are not set, in my ILS Horizon, in only one place, but in dozens of different places with different semantics that all interact in ill-defined ways.</p>
<p>Phew.</p>
<p>It seems like something a user would expect, in this day and age, that when they look up a book the listing could actually TELL them if they can check the book out (and how long they&#8217;ll have to wait to get it, if there&#8217;s a recall involved, etc), if they can view it in the library, if they can request it for delivery, etc.  Our ILS is currently incapable of doing that &#8212; to the extent that it even always displays a &#8216;request&#8217; button, and the user has to actually click on it to find out if they actually <em>can</em> make a request or not.  Which is generally the only way a user can find out what services are available, by <em>trying</em> them.  Which depending on the service may or may not be able to be done over the web (can you look at it in the library? Who knows unless you go there and try. Or call a librarian and hope they aren&#8217;t as confused as I am!).  You want to know how long you&#8217;re probably going to have to wait to get it?  Too bad.</p>
<p>At first I optimistically thought I could calculate all this stuff from the ILS, deliver it in DAIA, and then use it in new interfaces to actually tell the users what they&#8217;re going to want to know. DAIA is quite up to it.  But writing code to actually calculate these things &#8212; very non-trivial.  Not so happy with Horizon right now.</p>
<p>Anyone reading this know about the open source ILS&#8217;s?  Would this be easier in any of them?</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/970/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/970/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/970/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/970/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/970/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/970/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/970/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/970/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/970/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/970/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=970&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/09/02/daia-and-ils-complexity/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>back at work</title>
		<link>http://bibwild.wordpress.com/2009/09/01/back-at-work/</link>
		<comments>http://bibwild.wordpress.com/2009/09/01/back-at-work/#comments</comments>
		<pubDate>Tue, 01 Sep 2009 13:32:34 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=967</guid>
		<description><![CDATA[I have returned from my leave of absence, and am back at work.
Posted in General       <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=967&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>I have returned from my leave of absence, and am back at work.</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/967/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/967/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/967/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/967/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/967/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/967/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/967/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/967/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/967/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/967/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=967&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/09/01/back-at-work/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>maps, territories, and discretion</title>
		<link>http://bibwild.wordpress.com/2009/08/04/maps-territories-and-discretion/</link>
		<comments>http://bibwild.wordpress.com/2009/08/04/maps-territories-and-discretion/#comments</comments>
		<pubDate>Tue, 04 Aug 2009 14:47:43 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=953</guid>
		<description><![CDATA[Lorcan Dempsey mentions in passing that decisions of which manifestations belong to the same work set is&#8221;discretionary at the edges&#8221;:
A note on &#8216;discretionary&#8217;. We cluster stuff based on aggregate cataloger choices. I like Tim Spalding&#8217;s characterization of the &#8216;cocktail party test&#8217; in a blog entry about works and LibraryThing.
Regarding &#8216;discretionary&#8217;, I think this is exactly [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=953&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><a href="http://orweblog.oclc.org/archives/001992.html">Lorcan Dempsey mentions in passing</a> that decisions of which manifestations belong to the same work set is&#8221;discretionary at the edges&#8221;:</p>
<blockquote><p>A note on &#8216;discretionary&#8217;. We cluster stuff based on aggregate cataloger choices. I like Tim Spalding&#8217;s characterization of the &#8216;cocktail party test&#8217; in a <a href="http://www.librarything.com/blog/2008/05/works-editions-isbns-and-cocktails.php">blog entry</a> about works and LibraryThing.</p></blockquote>
<p>Regarding &#8216;discretionary&#8217;, I think this is exactly right. It&#8217;s important to note that the &#8216;work set&#8217; is a subjective and contextual choice, not some objective piece of data waiting to be discovered. But that doesn&#8217;t mean it&#8217;s useless, it&#8217;s very important because (in Western culture at least?), the concept of &#8216;work&#8217; exists, and is of value to users &#8212; it&#8217;s socially constructed, its got grey edges, reasonable people may disagree in edge cases, but that doesn&#8217;t mean it doesn&#8217;t exist or isn&#8217;t useful!  (For instance, when a patron comes in asking for a copy of Hamlet, try telling her &#8220;Sorry, there&#8217;s no such thing as &#8216;Hamlet&#8217;, please come back when you can tell me a particular edition published in a particular place at a particular year that you want.&#8221;  Ha!)</p>
<p>The <a href="http://archive.ifla.org/VII/s13/frbr/frbr_current3.htm#3.2">FRBR report says</a>: &#8220;The concept of what constitutes a work and where the line of demarcation lies between one work and another may in fact be viewed differently from one culture to another.&#8221;  Quite right.</p>
<p><strong>(This element of &#8216;discretion&#8217; is present to some extent in ALL models of reality &#8212; and our bibliographic description is indeed a model. </strong>And <em>always has been</em>, even if it hasn&#8217;t always been formalized, even if it&#8217;s been based on traditional implicit shared understanding, not spelled out. <strong>The &#8216;map&#8217; is never the &#8216;territory&#8217;, just a useful abstraction/approximation, with certain discretionary choices made as to be useful to a certain community/context).</strong></p>
<p>So traditional cataloging tries to make these work distinctions (to the extent that they are <em>implied</em> in AACR2 choices like &#8216;uniform title&#8217;, and hopefully more explicit in RDA, but I can&#8217;t say for sure) by setting out precise instructions meant to result in choices that match that cultural determination of &#8216;work&#8217;.</p>
<p>LibraryThing tries to do it instead by just relying on members of that culture using their intution, and averaging out everyone&#8217;s choices and relying on them to reach consensus through discussion.</p>
<p>Neither is more &#8216;correct&#8217;, and neither is more &#8216;FRBR&#8217;, just two different approaches to trying to create a collective decision about work sets that is useful to users.  Both are discretionary and subjective.</p>
<p>Sometimes this is hard for those in the library community to understand; we seem to attract people who want there to be a &#8216;right&#8217; and &#8216;wrong&#8217; answer, who want &#8220;but is it REALLY the same work?&#8221; to be based on some kind of objective reality with an absolute discoverable &#8216;correct&#8217; answer.  Sorry, that&#8217;s just not reality, modelling reality is inherently full of discretionary choices. But we can, as we traditionally do in library cataloging, set out rules and guidelines to make it as likely as possible that different people at different places will make the same choices.</p>
<p>But we really ought to explain <em>why</em> those rules and guidelines are what they are, and what the <em>goal</em> is. In order to better allow catalogers to use their professional judgement to make the choices most likely to accomplish that goal.</p>
<p>AACR2 rules that kind of sort of provide guidelines for making &#8216;work set&#8217; decisions, but couch them in terms of simple <em>orthographic</em> decisions for &#8216;uniform title&#8217;, without ever even mentioning that this is <em>really</em> a decision about work sets &#8212; gee, it definitely doesn&#8217;t help us understand what we&#8217;re doing, and try to do it to serve the users as well as possible.  (It also doesn&#8217;t help that &#8216;uniform title&#8217; is meant to express (at least) several other things in addition to &#8216;work set&#8217; &#8212; we really need to record the &#8216;is in work set X&#8217; decision as it&#8217;s own discrete reconstructable data element.) Anyone know if RDA improves on any of this? I still haven&#8217;t had the fortitude to try and make it through the RDA draft.</p>
<h2>Work-centric or manifestation-centric display?</h2>
<p>Lorcan also points out that:</p>
<blockquote><p>Interestingly, Goodreads and LibraryThing seem to default to a work-based view: the entry is at the work level&#8230;   Amazon seems to default to a particular &#8216;manifestation&#8217; or &#8216;expression&#8217;&#8230; Google Books seems to do something similar&#8230;. Worldcat.org is more like Amazon and Google. At the moment, it aims to show the most highly held member of a work set in a result, and then link to other editions from that&#8230;</p></blockquote>
<p>I&#8217;d be interested if Worldcat is considering trying to make a &#8216;work&#8217; view the default &#8216;landing page&#8217; from a search, a bit more like LibraryThing. I suspect this would actually be of more general use than the library legacy practice of always showing individual manifestations as search &#8216;landing pages&#8217;.</p>
<p>Lorcan says: &#8220;There are reasons for taking these various approaches and each service make decisions based on what it is trying to do, and the view it takes of its user interests.&#8221;  Certainly true as far as it goes &#8212; but I&#8217;ve never seen a written out clear analysis of what the reasons for the traditional library manifestation-centered display are, what they are trying to accomplish, what user interests we believe they are meeting.</p>
<p>I suspect that in fact this choice isn&#8217;t based on any actual clearly thought attempt to meet certain user interests &#8212; but instead just because we&#8217;ve always done it that way. Because in the card catalog world it was impractical to do otherwise. And in the online world, it takes a bit more work to do otherwise.  Not because doing it this way actually is necessarily optimal for meeting identified user interests.</p>
<p>Please note that I&#8217;m not saying that we should &#8216;catalog at the work level&#8217;, whatever that would mean. Our cataloging practices certainly still need to describe manifestations, and there need to be different records for different manifestations. (On the other hand, something like subject cataloging probably <em>is</em> best done once at the work-level, not duplicated effort for every manifestation.) But a work-centric <em>display </em>can still be provided &#8212; <strong><em>if</em></strong><em> </em>there is sufficient data recorded to allow software to reconstruct cataloger decisions about work-set groupings!  Current practice makes this difficult.</p>
<p>(And note that&#8217;s why Amazon and Google don&#8217;t have workset-centric displays. They don&#8217;t have the data to do it! Even Google&#8217;s vaunted algorithmic prowess can&#8217;t, apparently, determine work set groupings reliably enough to make a work-centric display. At least not within the resources Google is willing to throw at the problem. LibraryThing can do it because of volunteer human labor!  Library cataloging theoretically relies on such human labor, and we certainly spend an enormous amount of person hours in such labor &#8212; but don&#8217;t actually capture the fruit of that labor in unambiguous enough form to make it easy for the software to take advantage of. Shame on us.)</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/953/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/953/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/953/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/953/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/953/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/953/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/953/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/953/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/953/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/953/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=953&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/08/04/maps-territories-and-discretion/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>on leave</title>
		<link>http://bibwild.wordpress.com/2009/07/31/on-leave/</link>
		<comments>http://bibwild.wordpress.com/2009/07/31/on-leave/#comments</comments>
		<pubDate>Fri, 31 Jul 2009 22:06:46 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=951</guid>
		<description><![CDATA[I will be on a leave of absence from work for the entire month of August, getting some much needed rejuvenation, hopefully coming back to work renergized. heh.
I will have spotty internet access throughout the month of August. If you need to get in touch with me, better off leaving a comment here (which will [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=951&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>I will be on a leave of absence from work for the entire month of August, getting some much needed rejuvenation, hopefully coming back to work renergized. heh.</p>
<p>I will have spotty internet access throughout the month of August. If you need to get in touch with me, better off leaving a comment here (which will end up notifying me at my personal email address, which I&#8217;ll check spottily but occasionally), then sending to my work address (which I probably won&#8217;t check at all). Feel free to leave a comment asking me to get in touch with you (just don&#8217;t get upset if you don&#8217;t hear back for a while!); the email address you enter in your blog comment will be visible to me.</p>
<p>See you all in September if not before then!</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/951/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/951/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/951/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/951/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/951/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/951/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/951/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/951/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/951/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/951/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=951&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/07/31/on-leave/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>exposing holdings in dlf ils-di standard format web service</title>
		<link>http://bibwild.wordpress.com/2009/07/31/exposing-holdings-in-dlf-ils-di-standard-format-web-service/</link>
		<comments>http://bibwild.wordpress.com/2009/07/31/exposing-holdings-in-dlf-ils-di-standard-format-web-service/#comments</comments>
		<pubDate>Fri, 31 Jul 2009 21:03:13 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=947</guid>
		<description><![CDATA[So, as we move toward Blacklight implementation, I needed some way to expose item/holdings details from my Horizon ILS so they could be consumed for display (and/or indexing) in Blacklight.
I figured, as long as I&#8217;m doing this, I might as well do it in some kind of standard (rather than custom ad-hoc) format, so the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=947&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>So, as we move toward Blacklight implementation, I needed some way to expose item/holdings details from my Horizon ILS so they could be consumed for display (and/or indexing) in Blacklight.</p>
<p>I figured, as long as I&#8217;m doing this, I might as well do it in some kind of standard (rather than custom ad-hoc) format, so the consumer on the Blacklight end could be standard-ish. And it could be possibly re-used by others, or re-used by ourselves if we switch ILSs, we&#8217;d just need to write the provider end on the ILS end, and could keep the same consumer.</p>
<p>So looking around for standard formats, the <a href="http://www.diglib.org/architectures/ilsdi/">DLF ILS-di format</a> (<a href="http://www.diglib.org/architectures/ilsdi/schemas/1.1/dlfexpanded.xsd">xsd schema</a>) seemed pretty suitable, designed for just this task.</p>
<p>So, thankfully, Horizon actually keeps all it&#8217;s info in a fairly well normalized rdbms that you can access directly, making this not too hard a task. On top of that. So, those fine folks at uchicago already had a little extension to Horizon to provide the item information in their own custom ad-hoc format, which they kindly shared with me. So I took that, and modified it to produce in ils-di format.</p>
<h3>Metadata formats</h3>
<p>Now, the thing about the ils-di format.  It gives you a sort of skeleton to hang your info on. You can list items. You can list what ils-di calls &#8216;holdingsets&#8217; (and Horizon confusingly calls &#8216;copies&#8217;, and I don&#8217;t know what your ILS calls them &#8212; a group of related items, like all the bound volumes of a particular bib; or the multiple volumes of a multi-volume set).  You can express which items are in which holdingsets.</p>
<p>This is all great, because there wasn&#8217;t a simple standard format to do that in before.  But when you actually want to say something <em>about</em> the holdingsets or items, dls-di just gives you a slot to put some other (hopefully standard) metadata format in.  ( With<em> one</em> exception &#8212; isl-di gives you &#8220;<em>SimpleAvailability</em>&#8221; to describe a human-displayable label, and one of four coded SimpleAvailability statuses to describe item availability/status.  This was wise of them, because there was no good way to provide status from a standard vocabulary without this.)</p>
<p>Now, I think ils-di is exactly right to do things this way. Break the problem into manageable chunks, solve one chunk with a solution meant to do one thing well, and make sure your solution can be &#8216;loosely coupled&#8217; with other solutions meant to solve the other parts. Fine, good show.</p>
<p>But that still leaves me to figure out how to actually describe what I want to describe, using what XML schemas, standardized if possible. (And leaves the community to arrive at a standard set of these extra schemas at a later date, if we want to write software that really is &#8216;plug and play&#8217; with each other. Oh well, that&#8217;s how it goes, better to try some things and define &#8216;best practices&#8217; and standards off of what works well, then to try and &#8217;standardize&#8217; before trying in the wild.).</p>
<h3>All my stuff</h3>
<p>So what&#8217;s all the data elements I have that I want to describe somehow, in these extra metadata packages embedded in dlf-di?</p>
<p>Well, you can see them right here in uchicago&#8217;s custom ad hoc format, what their servlet did out of the box, with this example of a <a href="http://hip-dev.mse.jhu.edu/bib/418855">moderately complex serials record</a>:</p>
<p><a href="http://hip-dev.mse.jhu.edu/items/bib/418855.uchicago">http://hip-dev.mse.jhu.edu/items/bib/418855.uchicago</a></p>
<p>So, okay, where to put it?  Well, bibIDs and itemIDs are already in the dlf-schema itself.  So what else do we have?  <a href="http://www.loc.gov/marc/holdings/echdhome.html">Marc Format for Holdings Data</a> in MarcXML seems likely.  Maybe ISO Holdings?  Maybe NCIP?</p>
<p>I started with MFHD in marcxml, because NCIP confuses me (and everyone else), and ISO Holdings you need to pay a couple hundred bucks to look at the standard (although you can see the <a href="http://www.loc.gov/standards/iso20775/ISOholdings_V1.0.xsd">.xsd schema alone</a> for free).</p>
<p>So in MFHD you can put a lot of stuff actually.  Although it&#8217;s somewhat confusing to look at, since it uses those obscure marc tag codes and such. But you can put in there:</p>
<ul>
<li>user-displayable &#8216;location&#8217; and &#8216;collection&#8217; in tag 852</li>
<li>&#8216;holding&#8217; (ie &#8216;holdingset&#8217; ie &#8216;copy&#8217;) identifier in tag 001.</li>
<li>shelfmark (ie call number/copy information) also in 852.</li>
<li>A coded value of whether that call number is LCC, NLM, Dewey, Sudocs, a couple others, or &#8216;other&#8217; or &#8216;unknown&#8217;. 852 indicator 1.</li>
<li>For &#8216;holdingsets&#8217; user-presentable coverage statements (for main run, indexes, or supplements), in 866-888.
<ul>
<li>( Note, if my ILS actually had machine-understandable coverage statements, which it does not, you theoretically maybe <em>could</em> put them in MFHD, but I&#8217;d much prefer <a href="http://www.editeur.org/onixserials/ONIX_Coverage09.html">ONIX Serial Coverage</a>, which I think does it much more elegantly and clearly. But I don&#8217;t have that data available  anyway.)</li>
</ul>
</li>
<li>I think you can provide an un-coded user-presentable item status/availability string somewhere, but SimpleAvailability takes care of that better so I didn&#8217;t worry about.</li>
</ul>
<p>Meanwhile, dlf:SimpleAvailability is handling my need for both a coded and user-displayable item status/availability string, great, one thing done well. (Although I needed to create a mapping from my 109 internal &#8216;item status&#8217; codes to the four dlf:SimpleAvailability values!).</p>
<p>But that still left me with some things I wanted to include.  Well, MFHD gives me user-displayable labels for location and collection. But I really wanted to include my ILS&#8217;s <em>internal codes</em> for location and collection and item status. Why would I want purely local internal codes? Well, because applications I&#8217;m using to consume this can possibly be configured to make use of them even though they are purely local identifiers (especially if I&#8217;m writing the apps myself!).  I also wanted to include &#8216;item type&#8217; as both an internal code and a user-displayable label, and strangely MFHD has no spot for even user-displayable label for that.  Also similarly wanted to expose my internal system &#8220;call number type&#8221; id, which is not always mappable to a standard type in MFHD like LCC or DDC or whatever.</p>
<p>I looked over what documentation I could find for NCIP, as well as the NCIP xml schema, didn&#8217;t seem to have the fields I needed either. I even looked at the ISO Holdings schema without any documentation (my skills at reading raw XML schemas have improved muchly through this project). Nope, not there.</p>
<h3>So, what?</h3>
<p>Ross Singer had an idea that you could do this purely with DublinCore (including refinements in &#8216;dcterms&#8217;) and RDF. That might be possible, but I just couldn&#8217;t figure out how to do it. But really, I don&#8217;t think there are sufficient elements in dc:terms to cover all of those data elements, although Ross found <a href="http://gist.github.com/152182">some clever ways</a> to try and express a few of them (Ross trying to do a bit MORE than I really needed, since he didn&#8217;t want to depend on the dlf-di schema but I&#8217;m just trying to get some metadata I can embed in dlf-di for now, that&#8217;s my use case).</p>
<p>So I guess there&#8217;s theoretically some way to express your <em>own</em> refinements to dcterms?  But I got lost trying to figure that out.</p>
<p>So one way or another,  I figured I was going to define my own vocabulary. I could do it as an <a href="http://www.w3.org/TR/rdf-schema/">RDF Vocabularly</a> alone, but I got confused trying to think about that, and once you go to trying to express that in RDF-XML&#8230; got confused again.  Or I could do it in a custom XML Schema.  If I&#8217;m going to have to create my own vocabulary anyway, XML Schema just seemed simpler, both to produce and to consume. (And it would be easy for me or someone else to convert this to RDF at a later date, starting from a schema.  <a href="http://www.w3.org/TR/rdf-syntax-grammar/">RDF-XML</a> even lets any defined XML namespace pretty much be RDF out of the box, just add a few RDF attributes here or there!).</p>
<p>So custom schema it was. I created (or am in the middle of creating) an awfully simple XML schema for these elements I needed, mostly internal ILS values, and for each one the schema says you can supply one or more (internal or external) identifiers using a child dc:identifier, a user-displayable label using a child dc:title, and if you like a longer-format user-displayable description. (Didn&#8217;t re-use dc:description for this because I really wanted a couple extra attributes there seemed to be no way to add to a dc:description).</p>
<p><a href="http://gist.github.com/159255">Here it is, work in progress.</a> (Not even sure if this validates yet).</p>
<h3>The (not so) final product</h3>
<p>So here it is, the current version of a dlf ils-di document produced live from my (development box) Horizon, including in it&#8217;s metadata payload MFHD in marcxml, dlf:SimpleAvailability, and my custom as yet un-named schema.</p>
<p>See for example this same <a href="http://hip-dev.mse.jhu.edu/bib/418855">moderately complicated serials record</a>:</p>
<p><a href="http://hip-dev.mse.jhu.edu/items/bib/418855">http://hip-dev.mse.jhu.edu/items/bib/418855</a></p>
<h3>Where to next?</h3>
<p>Well, I&#8217;ve got to finish polishing it off, make sure all the XML validate against the schemas, make sure the new schema I created is really valid, etc.  Polish off a few more things.</p>
<p>Then, I&#8217;d like to put this code (derived from uchicago&#8217;s code, with their permission) on Google Code, so that other Horizon institutions can use it to provide dlf ils-di responses from their catalogs, woo.  (I tried to keep the code as generalizable as possible &#8212; for instance, the mapping from your local item status codes to the four dlf:SimpleAvailability values is configurable in a properties file).</p>
<p>I&#8217;ve also got my eyes on <a href="http://www.gbv.de/wikis/cls/Document_Availability_Information_API">DAIA </a>as another metadata schema to include in the dlf ils-di response eventually.  DAIA is focused on doing what SimpleAvailability does, but with more detail: What services are available, and what&#8217;s the URL access points for that service? I need to figure out how to correctly extend DAIA to include services that aren&#8217;t in DAIA&#8217;s built-in four. (I specifically need the service &#8216;get a photocopy of a portion of this item&#8217;, and &#8216;place an ILS request/hold for pickup at circ desk&#8217;, two services we offer that DAIA doesnt&#8217; specify right now).</p>
<p>And Ross tells me what I&#8217;ve done so far has gotten me a lot of the way to a jangle implementation. Great, that was part of the goal, so apparently it succeeded. I&#8217;ll finish off the rest of jangle when I have a use case that demands it, which could be sooner or later! (And first i&#8217;ll need to understand jangle better!).</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/947/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/947/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/947/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/947/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/947/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/947/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/947/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/947/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/947/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/947/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=947&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/07/31/exposing-holdings-in-dlf-ils-di-standard-format-web-service/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>APIs and vendor lock-in</title>
		<link>http://bibwild.wordpress.com/2009/07/23/apis-and-vendor-lock-in/</link>
		<comments>http://bibwild.wordpress.com/2009/07/23/apis-and-vendor-lock-in/#comments</comments>
		<pubDate>Thu, 23 Jul 2009 14:15:06 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/2009/07/23/apis-and-vendor-lock-in/</guid>
		<description><![CDATA[Eric Lease Morgan asks on code4lib:
I heard someplace recently that APIs are the newest form of vendor
lock-in.  What&#8217;s your take?
My reply (expanded a bit from my listserv post):
Standards-Based
When they are custom vendor-specific APIs and not standards-based APIs, they can definitely function that way. I&#8217;m still not sure if even a vendor-specific API is more [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=940&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Eric Lease Morgan<a href="http://www.mail-archive.com/code4lib@listserv.nd.edu/msg05691.html"> asks on code4lib</a>:</p>
<blockquote cite="EC4CD735-E6E9-45CB-8D7C-BDAE6F2BF0C3@nd.edu"><p>I heard someplace recently that APIs are the newest form of vendor<br />
lock-in.  What&#8217;s your take?</p></blockquote>
<p>My reply (expanded a bit from my listserv post):</p>
<h3>Standards-Based</h3>
<p>When they are custom vendor-specific APIs and not standards-based APIs, they can definitely function that way. I&#8217;m still not sure if even a vendor-specific API is more or less lock-in than NOT having an API.  On the one hand, you will start to have software written against the vendor-specific API, that won&#8217;t work without changing it up if you switch vendors.  But on the other hand, with SFX and Umlaut, for instance, Umlaut does so much more than SFX, and the SFX adapter piece is such a small part, that in that case, for us at least, having SFX with an API and Umlaut on top of it it definitely makes it _easier_ for us to switch link resolvers without disrupting our services built on top of it.</p>
<h3>Which we don&#8217;t do well at</h3>
<p>But really, what you want is standards-based APIs, not vendor-specific APIs. That would give you the best of all worlds. There are a couple challenges that keep us from getting there though. One is that the library community, historically, is, well, pretty AWFUL at writing standards.  We come up with standards that don&#8217;t actually accomplish what they were intended to accomplish, are too complicated for anyone to implement right (on either producer or consumer side), and leave so much wiggle room that someone can claim they support the standard but not in a way that any other software will ever understand.  (NCIP anyone?)</p>
<h3>Outside standards?</h3>
<p>So there are a couple ways to try to get better at this. One is definitely looking outside the library world for standards to use. But unlike code4libbers, I don&#8217;t think (from my experience) that&#8217;s always possible or easy.  We have priority problems that, while they are not entirely foreign to the larger world, aren&#8217;t as high a priority for most of the non-library world, meaning they don&#8217;t yet have robust standards solutions. However, especially when standards are extensible (like XML ones often are), you can sometimes start with a general standard and extend it for the library space.</p>
<h3>Standards based on, not preceeding, practice</h3>
<p>Secondly, instead of creating standards before anyone has actually tried solving the problem the standard is meant to solve (as we often seem to do), the BEST standards are created by generalizing/abstracting from existing best practices. A buncha people try it first, you see what works and what doesn&#8217;t, you see what the actual use cases and needs are, you take the best out of what&#8217;s been done, and you standardize it.   But doing it this way means you need to go through a period of vendor/product specific (eg) APIs before you can get to the standard.  The library world is still immature in developing good software infrastructures, we&#8217;re going to need to through some more pain for a while, no way around it.</p>
<h3>Vendor capabilities?</h3>
<p>But another problem in all of this is that vendors may not have the interest OR the in-house expertise to actually provide standards-based APIs.  The APIs we often get now from vendors, frankly, are kind of kludgey, and do not fill me with confidence that the vendor actually has the proper staff or resources allocated to create good standards-based APIs &#8212; which, definitely, takes more time than creating a kludgey vendor-specific one-off.   Or maybe the vendor actually is dis-interested in this because they want lock-in.  Or maybe it&#8217;s just the case that the quality of your APIs doesn&#8217;t effect your sales at all, so it doesn&#8217;t make (short term at least) business sense to do it well.  (Heck, the _presence_ of an API has only just begun to effect sales, but libraries aren&#8217;t good enough at judging how good it is, that even a crappy API is probably &#8216;good enough&#8217; for sales).</p>
<h3>Open source, community work</h3>
<p>One way out of this is definitely open source. We&#8217;ll work out the best practices and standards ourselves, and then we start insisting that vendors follow them.  The DLF-DI API is perhaps one example of an attempt at this, created from a generalization of the experience of library developers.   But the library developer community is also small, and generally fairly in-experienced. Creating APIs is done best by experienced developers who understand what&#8217;s going to make the API useable or not.</p>
<p>But, anyway, one step at a time. I firmly believe that even vendor-specific kludgey APIs are better than no APIs at all &#8212; we learn how to do better by trying.</p>
<h3>Consuming applications</h3>
<p>It&#8217;s also worth pointing out, as some subsequent commenters on that thread did, that the application consuming an API bears some reponsibility here. As much as possible, you need abstract out the API connector code, so you can easily switch the app to use multiple APIs, so long as they all have more or less the same data/capabilities (something which certainly isn&#8217;t guaranteed, admitted).  This too takes more time, but is do-able. Among the software I work on, Umlaut manages to do it pretty well, Xerxes does not.  This is in part because of the more focused and limited function of a link resolver compared to a federated search engine, made it easier to do with Umlaut.  And I guess half of the SFX API more or less is standards-based: OpenURL. </p>
<p>As a result, even though both SFX and Metalib have vendor-specific APIs, our use of the SFX API, in my opinion, lessens our vendor lock-in, while our use of the Metalib API increases it. </p>
<p>In this case, this was mostly due to factors outside our control. But it also can definitely depend on how well you&#8217;ve architected your client code, to abstract out the API connectors. Sometimes I feel like this is heresy in code4lib with it&#8217;s &#8220;just get it done&#8221; ethos, but <b>good, well-architected code matters.</b></p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/940/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/940/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/940/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/940/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/940/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/940/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/940/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/940/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/940/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/940/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=940&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/07/23/apis-and-vendor-lock-in/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>What librarians do</title>
		<link>http://bibwild.wordpress.com/2009/07/01/what-librarians-do/</link>
		<comments>http://bibwild.wordpress.com/2009/07/01/what-librarians-do/#comments</comments>
		<pubDate>Wed, 01 Jul 2009 15:52:23 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=933</guid>
		<description><![CDATA[So I just gave (or co-gave) a presentation here on Umlaut as deployed here as our Find It service.
One of the most exciting parts to me was that various (non-IT)  librarians in the room, un-prompted, starting throwing out ideas of what it could do in the future. Quite good ideas. I had to resist the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=933&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>So I just gave (or co-gave) a presentation here on Umlaut as deployed here as our Find It service.</p>
<p>One of the most exciting parts to me was that various (non-IT)  librarians in the room, un-prompted, starting throwing out ideas of what it <em>could</em> do in the future. Quite good ideas. I had to resist the techies urge to respond to them with &#8220;Well, yeah, but see, that&#8217;s harder than it might seem to make work like that&#8230;&#8221;, and instead try to be encouraging and positive, because it was <em>great</em> to have such a conversation. We hardly ever have such conversations.</p>
<p>Why? I think becuase usually a non-technical librarian has absolutely no way to put such innovative thoughts into practice.  As Karen Schneider <a href="http://www.librarywebchic.net/wordpress/2007/02/28/hurry-up-please-its-time-karen-schneider-keynote/">talked about</a> in her <a href="http://freerangelibrarian.com/2007/03/03/code4lib-keynote-again/">2007 Code4Lib Keynote</a>, libraries have ended up outsourcing a significant part of their core business to vendors,  in a way that we pay for it, and we get it, and we pretty much take what we get.</p>
<p>My experience made me realize today that one of the (many) negative side effects of this is that librarians have lost the opportunity (and thus been implicitly  &#8216;trained&#8217; not to even bother trying) of doing what librarians should be doing in this era when so many of our services are delivered over the web: Figuring out how to make these services meet our users needs better!</p>
<p>Contrary to popular belief, you can&#8217;t just let your users tell you what your services will be. Sure, of course you need to listen to your users. And if you listen and observe very carefully, you can figure out what your users <em>needs</em> are, some of which they may not even be able to articulate themselves, but others of which they most certainly can.  But you can&#8217;t count on your users to identify the best <strong>solutions </strong>to these needs. That&#8217;s what <em>we&#8217;re</em> for, that&#8217;s why we&#8217;re professionals!</p>
<p>And, to me at least, it&#8217;s one of the most most interesting and rewarding parts of our jobs.</p>
<p>But the outsourcing of much of the libraries business to vendors has taken the opportunity to do that away from most of us &#8212; an IT geek like me in a library that let&#8217;s him get away with it still has some. Most non-IT librarians have had it reinforced that they shouldn&#8217;t even bother. And while you have to be an IT type to <em>implement</em> new online services or features, you shouldn&#8217;t have to be one to be engaged in dreaming up and planning them.</p>
<p>One thing open source can do is return this power to us.   I&#8217;m pretty pleased where Umlaut (and my ability to explain it) is finally at the point where it&#8217;s future potential can be seen enough to encourage non-technical librarians to start suggesting &#8220;Hey, but what if it could do <em>this</em> and <em>that</em> to? Wouldn&#8217;t that be great?&#8221;</p>
<p>And, if I can somehow find the time amongst the way too many really great things that I&#8217;d like to do if I had time, maybe soon it will!</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/933/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/933/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/933/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/933/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/933/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/933/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/933/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/933/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/933/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/933/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=933&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/07/01/what-librarians-do/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>cataloging theory really is useful</title>
		<link>http://bibwild.wordpress.com/2009/06/30/cataloging-theory-really-is-useful/</link>
		<comments>http://bibwild.wordpress.com/2009/06/30/cataloging-theory-really-is-useful/#comments</comments>
		<pubDate>Tue, 30 Jun 2009 15:02:44 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=929</guid>
		<description><![CDATA[As much as I&#8217;m sometimes frustrated by our common inherited legacy cataloging practices, I actually do think the cataloging theory developed by Lubetzky, Svenonius, Cutter, and others is still useful &#8212; sometimes you just need to &#8216;translate&#8217; it to the modern environment.
I&#8217;ve been thinking about how having persistent unique identifiers (bib IDs) for our records [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=929&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>As much as I&#8217;m sometimes frustrated by our common inherited legacy cataloging practices, I actually do think the cataloging theory developed by Lubetzky, Svenonius, Cutter, and others is still useful &#8212; sometimes you just need to &#8216;translate&#8217; it to the modern environment.</p>
<p>I&#8217;ve been thinking about how having persistent unique identifiers (bib IDs) for our records is really important &#8212; but not generally prioritized in some of our legacy cataloging practice. There are a bunch of ways to explain why this is important (and it&#8217;s kind of obvious to the CS-perspective-inclined).</p>
<p>But I realized another way goes back to some language used in my cataloging class.  A cataloging record is called a &#8217;surrogate&#8217; for the physical item described. That&#8217;s exactly what it is, even more so in the digital age:  it allows the physical item to be &#8216;projected&#8217; into the digital environment as a digital object which is a &#8217;surrogate&#8217; for the physical object (or sets of objects, depending on context you consider it in) it represents.</p>
<p>Perhaps this helps explain why a persistent bib ID is important using cataloging theory language.  As a surrogate for the physical object in the digital environment, we want to be able to link to the surrogate in different ways &#8212; from simply bookmarking it, to building more complicated &#8217;semantic&#8217; relationships based upon it.  All of that depends on having a persistent identifier &#8212; a persistent bib ID &#8212; for the surrogate.  Changing the bib ID of the surrogate in the digital environment in unpredictable ways would be analagous to periodically changing where the physical item is physically shelved in unpredictable ways!  The internal unique identifier for the surrogate is essentially it&#8217;s digital &#8220;location&#8221;.</p>
<p>[That's a bit of an oversimplification -- giving the digital surrogate a reliable digital 'location' requires some layering on top of the unique internal ID, to give it a unique persistent URI too. But the <em>pre-requisite </em>for that is a persistent unique internal ID.]</p>
<p>[And, incidentally, for the semantic web geeks reading, this gets at some of my dissatisfaction with this focus on 'real world objects' vs 'documents' or whatever they're currently calling the second class. I don't think it's at all a clear distinction, and can often get confusing right quick, and I think it's probably a mistake to rely on such a confusing distinction for crucial parts of your 'specs'.  A cataloging record is a 'web document', surely, but it's also a surrogate (not JUST a 'description') for a real world object.  Sure, we can split hairs and talk about how to handle that. But the fact that it gets so confusing and abstract and hair-splitting and subject to debate worries me and makes me suspicious of relying on such a distinction for describing how to 'do business' in the sem web.]</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/929/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/929/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/929/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/929/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/929/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/929/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/929/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/929/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/929/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/929/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=929&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/06/30/cataloging-theory-really-is-useful/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
		<item>
		<title>NYU goes live with Umlaut</title>
		<link>http://bibwild.wordpress.com/2009/06/29/nyu-goes-live-with-umlaut/</link>
		<comments>http://bibwild.wordpress.com/2009/06/29/nyu-goes-live-with-umlaut/#comments</comments>
		<pubDate>Mon, 29 Jun 2009 15:21:40 +0000</pubDate>
		<dc:creator>jrochkind</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://bibwild.wordpress.com/?p=926</guid>
		<description><![CDATA[NYU has gone live with Umlaut. I&#8217;m holding my breath hoping that nothing will go wrong with their installation that&#8217;s my fault.  
Hi all,
We&#8217;ve deployed Umlaut to our production Primo environment at NYU.
Umlaut is available through the &#8220;GetIt&#8221; link on a search results page at
http://www.bobcat.nyu.edu and is hosted at http://getit.library.nyu.edu
Thanks,
Scot Dalton
Web Development
Division of Libraries
New [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=926&subd=bibwild&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>NYU has gone live with Umlaut. I&#8217;m holding my breath hoping that nothing will go wrong with their installation that&#8217;s my fault. <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<blockquote><p>Hi all,<br />
We&#8217;ve deployed Umlaut to our production Primo environment at NYU.</p>
<p>Umlaut is available through the &#8220;GetIt&#8221; link on a search results page at<br />
<a href="http://www.bobcat.nyu.edu/">http://www.bobcat.nyu.edu</a> and is hosted at <a href="http://getit.library.nyu.edu/">http://getit.library.nyu.edu</a></p>
<p>Thanks,</p>
<p>Scot Dalton<br />
Web Development<br />
Division of Libraries<br />
New York University</p></blockquote>
<p>It&#8217;s interesting to me that they are using Umlaut to work around an exceptionally poor part of Primo&#8217;s user experience &#8212; the page (or really <strong>pages</strong> in a &#8216;tabbed&#8217;  frameset wrapper) that actually gets the user to accessing the document (physical location/availability or electronic availability etc).</p>
<p>Turns out Umlaut is exceptionally well suited to replace this role in Primo, because Primo already well relies/supports calling out to an  OpenURL receiver, and because Umlaut is designed for this kind of &#8216;known item&#8217; and/or &#8216;last mile&#8217; service.  I think (un-humbly) that the mark of a well-thought-out piece of software is when it can serve well in situations that aren&#8217;t exactly like it was designed for.  A &#8216;known item service provider&#8217; is something we needed all along but didn&#8217;t realize it, and once you have one you can find ways to use it I never thought of.  I expect that more Primo customers will become interested in Umlaut.</p>
<p>And, my understanding is that Summon will also rely on sending out an OpenURL for actual local &#8216;last mile&#8217; access, so I predict that Summon customers will similarly be interested in Umlaut.</p>
<p>I hope anyway!  Thanks very much to Scot from NYU for spearheading the Umlaut deployment there;  I have been very impressed by how quickly Scot was able to get things up and running, with little help from me, including writing some new features and plug-ins to talk to Aleph. Although I&#8217;d like to think that the quality of Umlaut&#8217;s code and documentation gets some credit here, Scot has been a pleasure to work with, and I hope he will continue working on Umlaut.</p>
<p>Somewhat oddly from my point of view, NYU has deployed Umlaut only in the context of their Primo OPAC/discovery layer.  Traditional link resolver use still goes right to SFX.  Personally, I think that our users in most of our libraries already have too many different interfaces to deal with, and I place a priority on consolidating and integrating them. Umlaut&#8217;s goal is to serve this role by providing a &#8216;known item last mile&#8217; interface in as many contexts as possible.  But I understand that politically it can be difficult to make big changes at once, and my understanding is that NYU does eventually plan to target Umlaut for traditional link resolver use too.</p>
Posted in General  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/bibwild.wordpress.com/926/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/bibwild.wordpress.com/926/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/bibwild.wordpress.com/926/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/bibwild.wordpress.com/926/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/bibwild.wordpress.com/926/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/bibwild.wordpress.com/926/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/bibwild.wordpress.com/926/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/bibwild.wordpress.com/926/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/bibwild.wordpress.com/926/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/bibwild.wordpress.com/926/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=bibwild.wordpress.com&blog=835412&post=926&subd=bibwild&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://bibwild.wordpress.com/2009/06/29/nyu-goes-live-with-umlaut/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">jrochkind</media:title>
		</media:content>
	</item>
	</channel>
</rss>