Cory Snavely (University of Michigan, Hathitrust) gave a brief overview of Hathitrust, a repository of digital content shared by many of the Big Ten schools and a few other partners.
Brad McLean (Duraspace) reported on DuraCloud and results from the initial pilot partners. (ICPSR is part of the current pilot, but was not a member of the original, smaller pilot program.) He noted theseconcerns about using the cloud for digital preservation:
- Some services (such as Amazon's S3) have limits on the size of objects (files)
- Bandwidth limits on a per-server basis can impede function and performance
- Large files are troublesome
- Performance across the cloud can vary widely
- (File) naming matters; some storage services limit the type of characters in a name
Matt Schulz (MetaArchive) updated us on the MetaArchive, including a current partnership with Chronopolis.
David Minor (San Diego Supercomputer Center) updated us on the Chronopolis project. David noted that SDSC is reimplementing its data center, and described three levels of storage in its future architecture:
- High-performance storage for scratch content
- Traditional filesystem storage
- Archival storage
I'll write-up my notes from Day Two early next week.