MoMA on GitHub

The Museum of Modern Art has followed in the footsteps of Tate and Cooper Hewitt and published their collections data on GitHub.


As I’m currently in the final phase of my PhD, I have to dedicate more time to writing and less to doing. Even so I can’t let MoMA’s datasets go by unnoticed.

The above screenshot is from a timeline tool I developed for visually analysing large cultural collections. I imported the MoMA dataset and visualised the object records along their production dates. We can see the timeframe the collection spans, with earliest pieces from the late 1700s and – obviously – a focus on twentieth century and contemporary items.


The block shape around 1820 and the rectangular spike at 1900 represent large numbers of items that have the same, or very similar, production dates. Such anomalies can stand for series of items in the collection, they can be traces of curatorial decisions in cataloguing, they could be mistakes in dating, etc.

I inspected a few records in the 1900 spike and encountered a few photographs, which gave me the idea that the spike could represent a larger series of photographs – this would explain the high production output in a short timeframe. The tool allows me to colour records according to a field value, so I gave it a try and coloured all photographs in green:

Continue reading

Coding da Vinci

As part of the project Coding da Vinci, which will take place in Berlin next weekend, I will lead a workshop on practical approaches of visualising cultural data. Coding da Vinci aims to encourage new uses for cultural datasets by joining designers and developers with cultural institutions.

Screen Shot 2015-04-21 at 11.31.43

Registration is still open for the inaugural event on 25th/26th of April 2015, which will be followed by a ten week period for the participants to work on their projects, and culminate in a final event and presentations on the 5th of July.

Find more information and how to register on the organiser’s website

Media Archaeology as Artistic Practice

I will be presenting at this workshop, taking place on February 1st 2015 at the House of World Cultures in Berlin. The event forms part of the CTM festival and transmediale and is open to the public.


The House of World Cultures in Berlin. Image by Avda

The organisers Shintaro Miyazaki and Jamie Allen write:

Media archaeology is an academic method, but also an artistic practice and material inquiry. Playful, ironic aesthetics and critically historical approaches to media cultures and their technologies is gaining increased attention. We live in an archive of the media technological storage and of regurgitation of bygone times — such a situation requires artistic reactions and interventions.

In this context I will present my research and will focus on two recent projects on mining and visualising Wikipedia article revisions. A Wikipedia article, as commonly accessed through a browser, only represents the most recent version of that article. Underneath the surface are often thousands of earlier revisions of the same article. Wikipedia is not only an encyclopaedia, but also a history of an encyclopaedia and a reflection of changing knowledge, beliefs, concerns and social issues. Through my work I try to expose these hidden layers and mine the cultural archive of Wikipedia.

Backstory: 13 years of HIV/AIDS on Wikipedia

For the 25th Day With(out) Art an interactive timeline I created will occupy the website of the ICA Philadelphia on World AIDS Day, 1 December 2014. Visitors to the site will be able to explore the history of HIV and AIDS as captured by more than 8,000 revisions of the HIV/AIDS Wikipedia article.

Screen Shot 2014-11-28 at 15.05.38

The visualisation is based on Backstory, a tool I designed during the Beautiful Data workshop at Harvard metaLab. For the Day With(out) arts, a campaign organised by Visual AIDS, I have expanded it into an interactive timeline, which lets users explore the revision history of the HIV/AIDS Wikipedia article.

In the words of Becky Huff Hunter from the ICA Philadelphia:

BackStory: 13 years of HIV/AIDS on Wikipedia is an online visualization tool which allows viewers to explore a subjective, contested, and constantly expanding history of HIV/AIDS, through a chronology of revisions to Wikipedia articles on this topic.

When you read the article about HIV/AIDS on Wikipedia, what you see is just the latest version of a document, that has undergone 13 years of collaborative writing and editing. This revision history, which is exposed through this visualisation, reflects the changing views and discourses around HIV/AIDS.

The basis for this visualisation form over 8’000 versions of the HIV/AIDS wikipedia article, which have been curated around three chosen keywords: Condoms, Viral Load and Safe Sex.

Visit www.icaphila.org on December 1st to see the project live.

See also the announcement on the ICA’s website.

BackStory is an online visualisation tool for exploring the history of wikipedia articles. It lets you access and navigate through past revisions of Wikipedia articles.

The Search Is Over! – Day 1

This is a long overdue blog post about a workshop I co-organised on the topic of exploring Cultural Collections with visualisation: The Search is Over! It was conceived by Marian Dörk, Mitchell Whitelaw and Stephen Drucker, and took place during DL2014, 11-12 September 2014 at the City University in London.


The line-up of speakers promised these two days to be exciting and it was matched by an engaged and enterprising group of participants. In this post I present a summary of the two keynotes of the first day given by Lizzy Jongma (Rijksmuseum) and Aaron Straup Cope (Cooper Hewitt).

Continue reading