Tuesday, March 20, 2012

February 2012 deposits at ICPSR

Wow.

# of files# of depositsFile format
151F 0x07 video/h264
33application/msaccess
2311application/msword
5511application/msword application/msword
156application/octet-stream
5818application/pdf
21application/vnd.wordperfect
11application/x-dosexec
22application/x-empty
11application/x-rar
54application/x-sas
22application/x-shellscript
8412application/x-spss
32application/x-stata
55application/x-zip
11image/jpeg
11image/tiff
22message/rfc8220117bit
39659710text/html
195text/plain; charset=iso-8859-1
33text/plain; charset=unknown
297821text/plain; charset=us-ascii
11text/plain; charset=utf-8
202text/x-c++; charset=us-ascii
136text/x-c; charset=us-ascii
33text/x-mail; charset=us-ascii
94text/xml

So we have some of the usual suspects in the usual quantities.  And the usual miscues (e.g. C++ source code) due to entries in the magic database that we should really address.

And we have some new funkiness with magic or file with the strange video MIME type and the "double" MS-Word MIME type.

A few types we see but not often:  WordPerfect, Access, and some shell scripts (purportedly).  Nice.

But nearly 400k HTML files?  Wow.  Double wow.

I'm not sure what that is, but it seems like something big.

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.