Technology at ICPSR: Designing Storage Architectures for Digital Preservation

Wednesday, October 13, 2010

Designing Storage Architectures for Digital Preservation - Day Two, Part Two

The final session of the conference featured six speakers.

Jimmy Lin (University of Maryland) is spending some time at Twitter, and described their technology stack: hardware, HDFS, Hadoop, and pig, which he described as the "perl/python of big data."
Mike Smorul (University of Maryland) gave an overview of their "time machine for the web" and the challenges of managing a web archive
John Johnson (Pacific Northwest National Laboratory) proposed that the scientific process has changed in that data produced by computation is now one of the drivers for creating and testing new theories
Leslie Johnston (Library of Congress) spoke briefly about an IBM emerging technology called "big sheets"
Dave Fellinger (DataDirect Networks) urged the audience to "don't be afraid to count machine cycles" when analyzing storage systems for bottlenecks that increase service latency
Kevin Kambach (Oracle) finished the session with industry notes about large data

The day then concluded with two final talks. One was from Subodh Kulkarni (Imation) who gave an overview of storage technology from magnetic tape to hard disk, and the other was from David Rosenthal (LOCKSS) who gave an abbreviated version of his iPres talk, "How Green is Digital Preservation?" David mentioned a very interesting, large-scale, low-power computing and storage platform being produced by a company called Seamicro.

Technology at ICPSR

Wednesday, October 13, 2010

Designing Storage Architectures for Digital Preservation - Day Two, Part Two

No comments:

Post a Comment