Wednesday, August 3, 2011

July 2011 deposits at ICPSR

Another month, another deposit summary:


# of files# of depositsFile format
32application/msaccess
51016application/msword
3172application/octet-stream
52822application/pdf
2217application/vnd.ms-excel
11application/vnd.wordperfect
181application/x-123
1152application/x-dbase
31application/x-empty
33application/x-sas
21212application/x-spss
53application/x-stata
55application/x-zip
11audio/mpeg
11image/jpeg
101message/rfc8220117bit
22text/html
52text/plain; charset=unknown
2329text/plain; charset=us-ascii
11text/rtf
11text/x-c++; charset=us-ascii
502text/xml
122video/unknown

The usual suspects appear in the usual volumes:  lots of SPSS, PDF, and MS Word.  There seems to be a lot of dBase in this month's report:  that is unusual, and is worth investigating.  The service that generates MIME type works pretty well most of the time, but is not 100% error-free.  And to that point, I suspect that the purported video files and C++ source code are actually something else.

The two deposits with many "unknown" (application/octet-stream) are worth a look too.  They may be some esoteric format that we do not see all that often.

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.