Jenn Riley sent MARC work identification algorithm specifications released to the Music Library Association mailing list yesterday:
The Indiana University Variations3 digital music library project has performed a series of experiments designed to maximize the information we can map to our work-based metadata model from MARC Bibliographic and Authority records. The biggest strength of our "batch loading" algorithm is better identification of Works that are represented in bibliographic records, especially in cases where one record represents *multiple* work. The output of our batch loading work is a full mapping from MARC to data that conforms to the current Variations work-based metadata model, although we will be updating our specifications to output fully FRBRized data in the future. The full current algorithm and supporting documents are now available at <http://wiki.dlib.indiana.edu/confluence/x/OQBGBQ>. We hope that even if they aren’t fully FRBRized yet they may be of use to others in the community looking to use MARC records for the basis of FRBRized systems.
The work to migrate these batch loading specifications will be done as part of the newly IMLS-funded Variations/FRBR project at Indiana University. More information on the Variations/FRBR project can be found at <http://www.dlib.indiana.edu/projects/vfrbr/>.
Exciting stuff! I asked Jenn if they had any code to release — the programs they’d written to implement the algorithm — and she said not yet, but they would, along with more details about the XML representation of the data. Work is continuing, and this is definitely something to watch.
(Via Stacy Allison-Cassin.)