Bit-rot and Digital History
John Naughton has an interesting comment on a subject in which I have a professional interest. As a student of contemporary history, I worry about digital archiving.
At first, researching and writing about the first British government of the internet age was a thrill and a liberation. The Blair premiership was the first to have so many of its primary historical documents on line. Command papers, Hansard, select committee evidence; not to mention huge quantities of contemporary newspaper and BBC reporting.
So it was a shock to bump into the limit of the information utopia – something that happened quite abruptly on 27 June 2007. The No 10 website was rebuilt overnight, and searches for familiar material brought up an Error 404 File Not Found. Tony Blair had become a digital non-person.
Of course, the Brown supremacy was not quite engaged in Stalinist obliteration of the past. The entire No 10 website of the preceding years had been copied and stored in a series of snapshots (such as this, above) by The National Archives. It became harder to find things, although The National Archives has worked on improving the search functions since.
As time passed, it also became harder to find other older documents, as the growing power of Google’s algorithms could not quite keep up with the spread of broken links and defunct websites.
Anyway, Naughton says:
The longer I’ve been around, the more concerned I become about long-term data loss — in the archival sense. What are the chances that the digital record of our current period will still be accessible in 300 years’ time? The honest answer is that we don’t know. And my guess is that it definitely won’t be available unless we take pretty rigorous steps to ensure it. Otherwise it’s posterity be damned.
It’s a big mistake to think about this as a technical problem — to regard it as a matter of bit-rot, digital media and formats. If anything, the technical aspects are the trivial aspects of the problem. The really hard questions are institutional: how can we ensure that there are organisations in place in 300 years that will be capable of taking responsibility for keeping the archive intact, safe and accessible?
I’m not quite as gloomy as he, having a high opinion of Google’s ability to innovate, and of its institutional integrity. But it is undoubtedly an important question.Tagged in: contemporary history, digital archiving
Latest from Independent journalists on Twitter