OR 2014

Developer Challenge. Winners!

Here are the winners for the Dev Challenge for 2014. This page was put together by various volunteers – if there are any errors, let @ptsefton know. Also – please send us your presentations.

The main Sponsor was the Digital Library Federation, represented at the conference by Rachel Frick.

The judging team was: Rachel Frick (chair) Peter Sefton, Andrew Woods,
Sarah Shreeves,Sebastien Francios, Balviar Notay, Andrea Schweer

1. Archive Space + Fedora + Trello

Fedora 4 Prize Winner ($250 Amazon card)

Chris Fitzpatrick (Lyrasis) and Leila Sterman(Montana State University)

Team 1 created a link between Fedora 4.0 and Archive Space enabling a task-based workflow between multiple people and tools. Including adding an item to the online task/list management system, Trello.

Archive Space does not have the functionality of a repository. This working plugin pushes a file to a repository. The jruby app is in Archive Space with Fedora 4 running alongside. [Fedora Prize]

2. Cut/Paste

Winner (3 Amazon cards, $200 each, $600 total) Julie Allinson, Joonas Kesäniemi & Keith Maull

End users have redundant tasks they need to perform like populating web forms with data from articles, papers, etc.

Team 2 has enabled abstracted information from a doc to be added to a form by automating the extraction of PDF data to map directly to a submission form. This improves the accuracy of extracted data and repopulates web forms.

Grobid for PDF extraction. Proof of concept for a DSpace eThesis.

3. Video characterisation comparative analysis viewer @kvanmalssen

Winner / People’s Choice ( 2 $300 Amazon gift cards, $600)

Kara Van Malssen and Cameron Green

Video characterization analysis viewer

Outputs from analysis tools such as Exiftool and MediaInfo were off by as much as two minutes. Team 3’s solution successfully compares the output from different video analysis tools to give users the ability to do a comparative analysis in order to determine which tool is most accurate for digital preservation. You are only as good as your tool set. See the documentation.

4. The Big Elephant @LuytenBram

Best Air Guitar – Deck of Cards

Bram Luyten

50% of your repository traffic is driven by Google. Do you get the most visibility from your Google interaction? Not getting the maximum exposure of your resources from Google or Google Scholar is what Team 4 set out to solve.


This tool gives you an analysis of how robust your robot.text file is on the web and sends you an email detailed report suggesting where you can improve your script.

5. Data Site-ORCID refresh @work_rob_work

Robert Peters – Laura Paglione

When a third party does not have the resource anymore Data Site-ORCID refreshes it–documented and deployed. Developed in Ruby.

6. Fill my list – FML

Winner ($200 Gift cards, $1000 total)

Adam Field, Claire Knowles, Kim Shepherd, Jiadi Yao, Jared Watts

IDs are useful, you can’t trust people to add verified information to IRs so we wanted to automate using authority services.

Team 6 developed a repositories and authorities broker for classes of data from various sources: Optimize it once for the whole community.

Created an abstract form for each class of data with a plugin for data for an explicit repository platforms.

See the Code, with links to example sites that can find IDs for both people and equipment..

7. Finding Alternative SHERPA/RoMEO Colours @PTMillington

Peter Millington

Everyone uses them and thinks they are out of date. Having the accepted version at time of publication is desirable but rating the stage that the publication is in would be helpful. This system gathers opinions on where the publication is in process from a wide range of interested people and allows them to assign Green, Blue or Yellow indicators.

Presentation here. (Demo coming soon)

8. Docker & Repositories – @_cb_

Chris Beer

Docker is a production-ready lightweight virtual machine. The cool thing is that after you create a docker image it will download and launch a VM immediately. This is much faster than spinning up a Fedora VM, for example.

Makes an immediate instance and gives developers the ability to see how it works in 20 seconds, instant gratification.

9. ORCID in Sufia – @no_reply

Tom Johnson

Quick intro to what it would take to get something running of Sufia. There is no UI so it’s hard to demo right now. Would need another day to see that the Orcid is pulled down.

10. MODSBridge – @eosadler

Winner – Best hack of the dev challenge itself; developing people using software development techniques, creative approach to people challenge. best use of social media to augment team. ( $100 each, $300 total)

Becky Yoose, Bess Sadler, Linda Newman

There is a skills gap between people needed to edit MODS records and those with the ability to perform this task. Many people have a metadata background, but do not have technical skills. The Idea for this training project came from code4lib.

What would the curriculum tool look like if you wanted to give them skills they need to do this? The idea is to bring the philosophy of testing and development into data maintenance.

Like ‘Metadata carpentry’. code is available for lessons for 1 and 2.

Posted in Uncategorized