# of files | # of deposits | File format |
2 | 1 | application/msaccess |
23 | 1 | application/msoffice |
165 | 23 | application/msword |
698 | 4 | application/octet-stream |
266 | 28 | application/pdf |
144 | 11 | application/vnd.ms-excel |
1 | 1 | application/vnd.ms-powerpoint |
14 | 2 | application/vnd.wordperfect |
141 | 1 | application/x-123 |
4 | 1 | application/x-arc011lzw |
25 | 1 | application/x-dbase |
23 | 1 | application/x-dosexec |
1 | 1 | application/x-empty |
1 | 1 | application/x-rar |
19 | 5 | application/x-sas |
1307 | 21 | application/x-spss |
10 | 4 | application/x-stata |
3 | 3 | application/x-zip |
20 | 9 | message/rfc8220117bit |
8 | 7 | text/html |
12 | 6 | text/plain; charset=iso-8859-1 |
10 | 6 | text/plain; charset=unknown |
4386 | 46 | text/plain; charset=us-ascii |
2 | 1 | text/plain; charset=utf-8 |
11 | 4 | text/rtf |
7 | 2 | text/x-c++; charset=us-ascii |
1 | 1 | text/x-c; charset=us-ascii |
1 | 1 | text/x-mail; charset=us-ascii |
1 | 1 | text/xml |
153 | 2 | video/unknown |
In addition to the usual suspects like plain ASCII, SAS, SPSS, MS Word, PDF, we also have some of the usual problems, such as files being reported by the automated checker as containing C or C++ source code, when the truth is that they are likely text/plain instead.
One interesting data point is the pair of deposits that contain video files, and lots of them. Upon further review these appear to be vintage SPSS files for the IBM PC. Here's a string that appears in all of the files:
SPSS/PC+ System File Written by Data Entry II
and here is another one:
PCSPSS SYSTEM FILE. IBM PC DOS, SPSS/PC+ V3.0
From a timestamp located nearby, it looks like these files were from 1994. Or maybe they were moved from a mainframe to a PC in 1994?
And there are a few others on the list above that would benefit from some human scrutiny as well.
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.