- Jimmy Lin (University of Maryland) is spending some time at Twitter, and described their technology stack: hardware, HDFS, Hadoop, and pig, which he described as the "perl/python of big data."
- Mike Smorul (University of Maryland) gave an overview of their "time machine for the web" and the challenges of managing a web archive
- John Johnson (Pacific Northwest National Laboratory) proposed that the scientific process has changed in that data produced by computation is now one of the drivers for creating and testing new theories
- Leslie Johnston (Library of Congress) spoke briefly about an IBM emerging technology called "big sheets"
- Dave Fellinger (DataDirect Networks) urged the audience to "don't be afraid to count machine cycles" when analyzing storage systems for bottlenecks that increase service latency
- Kevin Kambach (Oracle) finished the session with industry notes about large data
Wednesday, October 13, 2010
Designing Storage Architectures for Digital Preservation - Day Two, Part Two
The final session of the conference featured six speakers.