Monday, January 28, 2008

Federated search bake-off

For anyone in the market for a federated search tool, it can be hard to compare products. As Sol Lederman at the Federated Search Blog notes, it's a challenge "because a number of vendors don’t have publicly available demo applications" or because, "where the demos do exist, it’s hard to compare them because they’re not searching the same sources."

Lederman is proposing to vendors that they allow him to set up demo site for their products on his blog in which each product is linked to the same set of publicly available databases (e.g., Medline, ERIC, etc.) He's asking his readers to submit a list of the databases that should be included in the standard set of ten that all products will work with. I think Lederman has got a great idea. Whether any of the vendors actually participate is an open question. Regardless, it's worth submitting your list of databases as comments on Lederman's original post.

Our library at Baruch College is about to launch a federated search tool using Serial Solutions' 360 Search. I am on the committee that was charged with figuring out how to set up the product; I've learned a lot of ugly truths about how search results are retrieved via federated search and how they are aggregated. Much of the literature in the library field that that I've read talks about the relationship between federated search and usability or federated search and information literacy; I'd like to see more that gets into the nuts and bolts of how well these tools actually work and where they are typically hamstrung by the database vendors, who often give the federated search vendors less than optimal gateways into their data. Having a central place where you could test drive different federated search tools would not only reveal how different interfaces work, it might also show how the connectors that federated search companies build for you are not all created equal.

2 Comments:

At 5:41 PM , Anonymous Paul Pival said...

Hey, thanks for the pointer to the new blog- very timely for me. Do you have any info you can share that led you to choose Serials Solutions?

 
At 9:25 PM , Blogger Stephen Francoeur said...

Paul, I'm afraid I wasn't part of the decision process. If you'd like the deets on what it was really like to set it up, let me know. It was surprising to a newbie to fed search like me how many databases we had that wouldn't work well in that environment. It was also surprising how many databases we had don't give results back via Z39.50 or XML but instead as HTML, which means Serials Solutions is forced to do screen scraping to get results.

I'll say here that we've been having ongoing problems with ProQuest databases in Serials Solutions (some records are retrieved that we can't get via the native interface for ProQuest) and for Gale (Serials Solutions would say in search results, "here's a hit from Business and Company Resource Center," but when you click on the link it takes you to a record in General Business File (a Gale database we don't even have).

Most frustrating for us has been our inability so far to figure out why our Ex Libris Aleph 500 catalog won't work in 360 Search. Other libraries have got it working, and the problem appears to be on our end.

Anyway, there's a ton of issues, fixes, and reality checks that I've had to deal with as I've been on the committee setting up our 360 Search implementation. I'm not sure about this, but I get the sense that the problems we've been having are not necessarily unique to 360 Search.

FYI: to throw a bone to Serials Solutions, the clustering of search results is pretty kewl (though you don't want to try to think to hard while trying to disambiguate the subject clusters that get automatically generated).

 

Post a Comment

Links to this post:

Create a Link

<< Home