We are using deposit #15868 as an example to illustrate the Fedora Objects we will use to store social science research data and documentation that has been deposited at ICPSR. We chose this deposit since each of the files are readily available on a public web site and pose no risk of disclosure. Each of the images below is also a hyperlink to the corresponding object in our public Fedora Commons repository.
In this case the survey data are in a format produced by the statistical analysis software called SAS, and our file format identification software has assigned it the MIME type of application/x-sas. This content goes into its own Datastream (last one on the left), and we Fedora calculates a message digest to fingerprint the file (Fixity).
We note the original name of the file in the DC Datastream along with the identity of the depositor, the origin of the file, and the identity of the organization that created the file (Provenance).
We capture its relationship to the higher-level deposit transaction via a relationship in the RELS-EXT Datastream, and we also later capture what role this file plays in the data curation lifecycle at ICPSR (Context).
Not captured or shown at this level are the terms to which the deposit agreed when transferring this content to ICPSR (Access Rights). We will store those in the aggregate-level object. Typically the depositor grants ICPSR non-exclusive rights to reproduce and publish the content, but this is not exclusively true.
Likewise, we capture similar information for the other three files in the deposit:
In the next blog post on this topic, I'll publish a description of the aggregate object to which these four assert an isPartOf relationship. Once we have that object as well, we can begin talking about producing an OAIS Submission Information Package (SIP) for each object.