Transfer Metrics Needed Tracking (TMNT)archives digital media transfer TMNT
This (fiscal) year, I’m taking some next steps using the data produced by my digital media survey at MIT. Now that we have a record of what media exist in the collections to the best of our knowledge, we can begin transferring the data off of them at a larger scale and into preservation and access storage. But wait… could I use this work as a representative sample to better plan for the future? Enter the Transfer Metrics Needed Tracking (TMNT) project.
What is it?
This is a project to record some basic numbers about the time and success rate of data transfer from digital media over the next year. This is similar to what other institutions have done, such as the British Library’s Flashback project.
For this project I’ve imported the media survey data into Airtable and linked it up with another base where we’ll use an interface to record the following info:
- Setup time
- Transfer time and attempts
- Transfer method
- Overall success/failure
- Extraction time and attempts (if not performed in the transfer process)
- Packaging time and attempts (if not performed in the transfer process)
- Data size
- Computer used
The information we gather will be enhanced in the final data set by contextual information we already know from the survey: format types and media manufacturer information. The details around actually processing the files will be recorded with the rest of our processing metrics.
Why are you doing this??
Really this is for planning purposes. We need to know about how long these projects might take in order to allocate enough time and person power to get the work done while balancing many other priorities. The project isn’t a way to establish benchmarks that need to be met in some sort of Taylorist archival transfer process nightmare scenario.
With a decent amount of a digital media backlog in existing collections, there will be many projects to transfer the data to a more stable medium over the coming years. At the same time, we’ll have new collections coming in. With a better sense of how much work we have in store, we can also make more informed decisions on whether we have the capacity to accept collections with substantial amounts of digital media or respectfully decline.
My hope is that this information will be useful to others as well, so I’ll to share my results out at the end and possibly along the way. Maybe somewhere else will do a similar project and we can compare numbers? In any event, grab a slice of pizza and check back for updates.