Wednesday, September 8, 2010

A snapshot in time: deposited files in August 2010

A typical month (for the summer) of deposits at ICPSR by file format type:


# files File format# of deposits
3application/msaccess2
3application/msoffice2
159application/msword29
359application/pdf30
13application/vnd.ms-excel6
24application/vnd.ms-powerpoint2
31application/x-sas5
88application/x-spss27
2application/x-stata2
1application/x-zip1
1audio/mpeg1
1image/png1
5message/rfc8222
30text/html2
1text/plain; charset=iso-8859-11
24text/plain; charset=unknown4
423text/plain; charset=us-ascii18
5text/rtf4
8text/x-c; charset=us-ascii2
1text/x-mail; charset=us-ascii1
5text/xml2

Lots of plain text and PDF. Plenty of files in the typical stat packages. A handful of Microsoft Office formats.

Our main tool for automagically calculating MIME types is file, and almost certainly the files it identified as C program text are actually just plain text, or maybe a setup file.

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.