pazpar2: New Features and Challenges

It has been quiet on the pazpar2 front lately, but I will make amends with this blog post.

Lately I have been working on a Harvester that would harvest into a Local Unified Index (LUI). The LUI has been implemented with Solr.

This means we can implement Integrated Search, which is our name for doing both searching remote targets (meta-searching) and a Local Unified Index (LUI), aka Central Index.

Code4lib 2012

Code4lib 2012, Seattle

I was the lucky winner of the Index Data lottery (no actual lottery took place) to go to Code4lib 2012. I was a (Code4lib) Newbie, so I didn’t really know what to expect, but reading Jakub’s blog about his experiences, it sounded like great fun.

It was also my first time in Seattle, so I did take some extra days on both ends to do some exploring. Arriving on Saturday to sunny and warm weather (15 degrees Celsius warmer than Copenhagen, nice!), Seattle did its best to welcome me.

ID's DPLA Submission

Index Data has submitted a proposal to the Digital Public Library of America (DPLA) beta sprint.

We believe that the DPLA is an important initiative, and we wanted to share our particular perspective on the world of structured data, information retrieval, standards, and open source software. We look forward to following the development of this work in the future.

Position: Scripter/integrator/cataloger

We are looking for a part-time Connectivity/integration specialist – with the possibility of moving to full time over time. Part of our work is to integrate different information systems. This involves a variety of different tasks, including testing Z39.50/SRU targets using our diagnostic tools, building Connectors for websites using our visual integration environment, working with proprietary APIs, and processing data sets in batch mode.

What if it's about the People, Stupid?

Most of the science of Information Retrieval centers around being able to find and rank the right set of documents in response to a given query. We spend much time arguing about technical details like ranking algorithms and the benefits of indexing versus broadcast searching. Every Information Professional I know both deifies and fears Google because they get it right most of the time – enough so that many people tend to assume that whatever pops to the top of a Google search MUST be right, because it’s right there, in the result screen.


Subscribe to Index Data RSS