Difference between revisions of "Workflow:Digital archiving workflow (high-level)"

From COPTR
Jump to navigation Jump to search
(Information updated to v0.5 of the workflow)
Line 16: Line 16:
 
; START
 
; START
 
: A request for deposit of digital materials (by transfer, donation, or purchase) is submitted to the University Archives.<br/>
 
: A request for deposit of digital materials (by transfer, donation, or purchase) is submitted to the University Archives.<br/>
 +
 
; PRE-ACQUISITION APPRAISAL
 
; PRE-ACQUISITION APPRAISAL
: Processes for evaluating whether a deposit request will be accepted by the University Archives.  
+
: Processes for evaluating whether a deposit request will be accepted by the University Archives.
# Prepare a records survey and/or pre-accession assessment of the proposed deposit.
+
# Check the deposit request against the [https://www.gla.ac.uk/myglasgow/archivespecialcollections/collectionsdevelopmentpolicy/ Archives & Special Collections collection development policy] - does the request align with the core collecting areas?
# Evaluate the results of the records survey against the Appraisal & Retention policy, which:
+
# Prepare a records survey and/or pre-accession assessment of the proposed deposit.
#* addresses issues pertaining to selection and long-term retention of digital objects
+
# Evaluate the results of the records survey against the Appraisal & Retention policy, which:
#* extends the [https://www.gla.ac.uk/myglasgow/archivespecialcollections/collectionsdevelopmentpolicy/ Archives & Special Collections collection development policy]
+
#* addresses issues pertaining to selection and long-term retention of digital objects
#* Ensures that retention decisions are balanced between value and capacity to preserve for the long-term; and
+
#* extends the collections development policy
#* provides clarity to avoid assumptions over digital storage costs and availability.
+
#* ensures that retention decisions are balanced between value and capacity to preserve for the long-term; and
# Decide whether the requested deposit aligns with policy:
+
#* provides clarity to avoid assumptions over digital storage costs and availability.
#* If not, re-evaluate acquisition and/or reject deposit.
+
# Decide whether the requested deposit aligns with policies:
#* If yes, proceed to Acquisition.
+
#* If not, re-evaluate acquisition and/or reject deposit.
 +
#* If yes, proceed to Acquisition.
 +
 
 
; ACQUISITION
 
; ACQUISITION
 
: Processes for acquiring digital materials by transfer, donation, or purchase.
 
: Processes for acquiring digital materials by transfer, donation, or purchase.
# Follow methodology in the Space data and information transfer systems — Producer-archive interface — Methodology abstract standard (PAIMAS) ISO 20652:2006 standard. The standard "identifies, defines and provides structure to the relationships and interactions between an information producer and an archive. It defines the methodology for the structure of actions that are required from the initial time of contact between the producer and the archive until the objects of information are received and validated by the archive." ([https://www.iso.org/standard/39577.html ISO]). For more information, see [https://www.dpconline.org/handbook/organisational-activities/acquisition-and-appraisal Acquisition and appraisal, Digital Preservation Handbook (DPC)].
+
# Follow the methodology in the Space data and information transfer systems — Producer-archive interface — Methodology abstract standard (PAIMAS) ISO 20652:2006 standard. The standard "identifies, defines and provides structure to the relationships and interactions between an information producer and an archive. It defines the methodology for the structure of actions that are required from the initial time of contact between the producer and the archive until the objects of information are received and validated by the archive." ([https://www.iso.org/standard/39577.html ISO]). For more information, see [https://www.dpconline.org/handbook/organisational-activities/acquisition-and-appraisal Acquisition and appraisal, Digital Preservation Handbook (DPC)].
# Check digital materials for viruses. See [https://www.nationalarchives.gov.uk/archives-sector/projects-and-programmes/plugged-in-powered-up/digital-preservation-workflows/1-select-and-transfer/ Select and transfer] section 1.3 for a reasonable process. Depending on the results of virus checks:
+
# Follow the Accepted file formats/media procedure, which:
#* if virus is found, quarantine and attempt removal; and/or request clean versions from source. If all these fail, prepare a report documenting actions and re-evaluate acquisition.
+
#* Specifies decisions on file formats and/or storage media that the University Archives will accept.  
#* if virus-free, proceed to Accessioning.
+
#* Aligns with preservation planning decisions for format normalisation; and capability to access storage media (esp. legacy media, e.g. floppy or zip disks).
; ACCESSIONING
+
#* For a summary table of options, see the Acquisition workflow section in [https://www.dpconline.org/handbook/organisational-activities/acquisition-and-appraisal Acquisition and appraisal, Digital Preservation Handbook (DPC)].
:Process of formally accepting material into the University Archives, which enables intellectual control over the digital materials.
+
# For acquisitions deposited in physical storage media:
# Follow standard accessioning practices as per non-digital acquisitions.
+
#* Place all incoming items in quarantine area on arrival, inspect for pest infestation and mould; and follow handling and moving procedures.
# Follow the Accepted file formats/media procedure, which:
+
#* Create physical conservation and preservation report, documenting all actions on the acquired media.
#* Specifies decisions on file formats and/or storage media that the University Archives will accept.  
+
#* Proceed to Transfer.
#* Aligns with preservation planning decisions for format normalisation; and capability to access storage media (esp. legacy media, e.g. floppy or zip disks).
+
# For acquisitions deposited digitally (e.g. file transfer):
#* For a summary table of options, see the Acquisition workflow section in [https://www.dpconline.org/handbook/organisational-activities/acquisition-and-appraisal Acquisition and appraisal, Digital Preservation Handbook (DPC)].
+
#* Proceed to Transfer.
# Proceed to Transfer.
+
 
; TRANSFER
 
; TRANSFER
 
: Processes for transferring digital materials to the University Archives.
 
: Processes for transferring digital materials to the University Archives.
# Choose a method for transferring files:
+
# Choose a method for transferring files:
#* Copy files from source media. OR
+
#* Copy files from source media. OR
#* Create a disk image from source storage media. OR
+
#* Create a disk image from source storage media. OR
#* Request that digital materials are submitted as a BagIt container.
+
#* Request that digital materials are submitted as a BagIt container.
# In all cases, generate checksums to verify data integrity during transmission and/or storage:
+
# Check digital materials for viruses. See [https://www.nationalarchives.gov.uk/archives-sector/projects-and-programmes/plugged-in-powered-up/digital-preservation-workflows/1-select-and-transfer/ Select and transfer workflow in the TNA guide] (section 1.3) for a reasonable process. Depending on the results of virus checks:
#* For digital acquisitions <i>in situ</i>, it might be appropriate to first store files in a temporary location for virus and/or integrity checks, before transferring to process store. Not applicable to all scenarios.
+
#* if virus is found, quarantine and attempt removal; and/or request clean versions from source. If all these fail, prepare a report documenting actions and re-evaluate acquisition.
#* Transfer digital materials to Process store (e.g. network drive).
+
#* if virus-free, proceed with transfer.
# Use tools to identify file types and validate file formats (e.g. DROID, JHOVE), then proceed to Appraisal
+
# Generate checksums to verify data integrity during transmission and/or storage:
 +
#* For digital acquisitions <i>in situ</i>, it might be appropriate to first store files in a temporary location for virus and/or integrity checks, before transferring to process store. Not applicable to all scenarios.
 +
#* Transfer digital materials to Process store (e.g. network drive).
 +
# Use tools to identify file types and validate file formats (e.g. DROID, JHOVE), then proceed to Accessioning.
 +
 
 +
; ACCESSIONING
 +
:Process of formally registering deposit into the University Archives, which enables intellectual control over the digital materials.
 +
# Generate a unique accession number, based on the University Archives' archival processing guidelines.
 +
# Compare the file manifests generated during Acquisition and Transfer to make sure that the transfer includes everything that was deposited by the source.
 +
# List the accession number into the University's Collections Management System for cataloguing. Cataloguing processes include decisions over the system of arrangement and level of description that will be used for the deposited materials; definition of access and reproduction conditions; and documentation via descriptive metadata.
 +
# Proceed to Appraisal.
 +
 
 
; APPRAISAL
 
; APPRAISAL
: Processes for selecting digital materials from an acquisition, which are deemed as appropriate for lo-term preservation and archiving.
+
: Processes for selecting digital materials from a transfer, which are deemed as appropriate for long-term preservation and archiving.
 +
# Perform appraisal actions that have been deemed necessary (if any), including at a minimum:
 +
#* Selecting which files from a transfer should be added to a SIP and how these files should be arranged within the SIP.
 +
#* Removing duplicates, e.g. by comparing checksums. See [https://www.nationalarchives.gov.uk/archives-sector/projects-and-programmes/plugged-in-powered-up/digital-preservation-workflows/2-ingest/ Ingest workflow in the TNA guide] (section 2.5) for a reasonable process.
 +
# Complete legal checks, including:
 +
#* Checks against GDPR principles and possible exemptions for archiving in the public interest. The National Archives provide key information in their [https://www.nationalarchives.gov.uk/archives-sector/legislation/archives-data-protection-law-uk/gdpr-faqs/ Archives and GDPR: frequently asked questions] page.
 +
#* Checks for Personal Identifiable Information and other restricted/confidential information included in the deposited files that should not be made puclicly accessible.
 +
# Proceed to Ingest.
 +
 +
; INGEST
 +
: Processes for generating SIPs from tranferred digital objects, normalising for preservation and/or access; packaging into AIPs for archival storage; and/or DIPs for access systems.
 +
# Add any descriptive metadata (created in Cataloguing) and rights metadata (created during Appraisal) in the SIP.
 +
# Choose a normalisation strategy for file formats (based on preservation planning).
 +
# Archival storage solutions should follow the [https://www.dpconline.org/handbook/organisational-activities/storage Principles for using IT storage systems for digital preservation] defined in the DPC Handbook.
 +
# Periodic monitoring should include:
 +
#* File integrity checks by monitoring checksum changes.
 +
#* Format obsolescence checks, by monitoring changes to preservation planning and normalisation strategies.
 +
 
  
 
==Purpose, Context and Content==
 
==Purpose, Context and Content==
Line 63: Line 94:
  
 
==Evaluation/Review==
 
==Evaluation/Review==
The workflow has been successful in guiding tests of the University Archives' current digital preservation capability; and evaluating the suitability of our current setup for production-level digital preservation. Currently in version 0.3, it is continuously amended until reaching version 1.0 after which it will be reviewed on an annual basis.  
+
The workflow has been successful in guiding tests of the University Archives' current digital preservation capability; and evaluating the suitability of our current setup for production-level digital preservation. Currently in version 0.5, it is continuously amended until reaching version 1.0 after which it will be reviewed on an annual basis.  
  
 
==Further Information==
 
==Further Information==

Revision as of 16:06, 1 March 2022

Digital archiving workflow (high-level)
Status:Experimental
Tools:
Input:Request for deposit of digital materials by transfer, donation, or purchase to the University Archives.
Output:AIPs stored in Archival storage and/or DIPs uploaded to an Access system; or a Re-evaluation process in cases where digital materials are not deemed appropriate for acquisition.
Organisation:Archives & Special Collections (ASC), University of Glasgow

Workflow Description

Digital archiving workflow (high level, overview) produced by Archives and Special Collections at the University of Glasgow.

START
A request for deposit of digital materials (by transfer, donation, or purchase) is submitted to the University Archives.
PRE-ACQUISITION APPRAISAL
Processes for evaluating whether a deposit request will be accepted by the University Archives.

# Check the deposit request against the Archives & Special Collections collection development policy - does the request align with the core collecting areas? # Prepare a records survey and/or pre-accession assessment of the proposed deposit. # Evaluate the results of the records survey against the Appraisal & Retention policy, which: #* addresses issues pertaining to selection and long-term retention of digital objects #* extends the collections development policy #* ensures that retention decisions are balanced between value and capacity to preserve for the long-term; and #* provides clarity to avoid assumptions over digital storage costs and availability. # Decide whether the requested deposit aligns with policies: #* If not, re-evaluate acquisition and/or reject deposit. #* If yes, proceed to Acquisition.

ACQUISITION
Processes for acquiring digital materials by transfer, donation, or purchase.

# Follow the methodology in the Space data and information transfer systems — Producer-archive interface — Methodology abstract standard (PAIMAS) ISO 20652:2006 standard. The standard "identifies, defines and provides structure to the relationships and interactions between an information producer and an archive. It defines the methodology for the structure of actions that are required from the initial time of contact between the producer and the archive until the objects of information are received and validated by the archive." (ISO). For more information, see Acquisition and appraisal, Digital Preservation Handbook (DPC). # Follow the Accepted file formats/media procedure, which: #* Specifies decisions on file formats and/or storage media that the University Archives will accept. #* Aligns with preservation planning decisions for format normalisation; and capability to access storage media (esp. legacy media, e.g. floppy or zip disks). #* For a summary table of options, see the Acquisition workflow section in Acquisition and appraisal, Digital Preservation Handbook (DPC). # For acquisitions deposited in physical storage media: #* Place all incoming items in quarantine area on arrival, inspect for pest infestation and mould; and follow handling and moving procedures. #* Create physical conservation and preservation report, documenting all actions on the acquired media. #* Proceed to Transfer. # For acquisitions deposited digitally (e.g. file transfer): #* Proceed to Transfer.

TRANSFER
Processes for transferring digital materials to the University Archives.

# Choose a method for transferring files: #* Copy files from source media. OR #* Create a disk image from source storage media. OR #* Request that digital materials are submitted as a BagIt container. # Check digital materials for viruses. See Select and transfer workflow in the TNA guide (section 1.3) for a reasonable process. Depending on the results of virus checks: #* if virus is found, quarantine and attempt removal; and/or request clean versions from source. If all these fail, prepare a report documenting actions and re-evaluate acquisition. #* if virus-free, proceed with transfer. # Generate checksums to verify data integrity during transmission and/or storage: #* For digital acquisitions in situ, it might be appropriate to first store files in a temporary location for virus and/or integrity checks, before transferring to process store. Not applicable to all scenarios. #* Transfer digital materials to Process store (e.g. network drive). # Use tools to identify file types and validate file formats (e.g. DROID, JHOVE), then proceed to Accessioning.

ACCESSIONING
Process of formally registering deposit into the University Archives, which enables intellectual control over the digital materials.

# Generate a unique accession number, based on the University Archives' archival processing guidelines. # Compare the file manifests generated during Acquisition and Transfer to make sure that the transfer includes everything that was deposited by the source. # List the accession number into the University's Collections Management System for cataloguing. Cataloguing processes include decisions over the system of arrangement and level of description that will be used for the deposited materials; definition of access and reproduction conditions; and documentation via descriptive metadata. # Proceed to Appraisal.

APPRAISAL
Processes for selecting digital materials from a transfer, which are deemed as appropriate for long-term preservation and archiving.

# Perform appraisal actions that have been deemed necessary (if any), including at a minimum: #* Selecting which files from a transfer should be added to a SIP and how these files should be arranged within the SIP. #* Removing duplicates, e.g. by comparing checksums. See Ingest workflow in the TNA guide (section 2.5) for a reasonable process. # Complete legal checks, including: #* Checks against GDPR principles and possible exemptions for archiving in the public interest. The National Archives provide key information in their Archives and GDPR: frequently asked questions page. #* Checks for Personal Identifiable Information and other restricted/confidential information included in the deposited files that should not be made puclicly accessible. # Proceed to Ingest.

INGEST
Processes for generating SIPs from tranferred digital objects, normalising for preservation and/or access; packaging into AIPs for archival storage; and/or DIPs for access systems.

# Add any descriptive metadata (created in Cataloguing) and rights metadata (created during Appraisal) in the SIP. # Choose a normalisation strategy for file formats (based on preservation planning). # Archival storage solutions should follow the Principles for using IT storage systems for digital preservation defined in the DPC Handbook. # Periodic monitoring should include: #* File integrity checks by monitoring checksum changes. #* Format obsolescence checks, by monitoring changes to preservation planning and normalisation strategies.


Purpose, Context and Content

The workflow aims to formalise digital archiving activities, and incorporate digital preservation processing requirements. It has been designed as an experimental tool for testing production-level digital preservation processing at the University of Glasgow Archives. Production-level here describes in vivo test conditions in terms of:

  • data volume per transfer and ingest
  • file format types and variability
  • acquisition and accessioning procedures
  • currently available technical infrastructure, including computing, local/shared storage and network speed/capacity on campus.


Evaluation/Review

The workflow has been successful in guiding tests of the University Archives' current digital preservation capability; and evaluating the suitability of our current setup for production-level digital preservation. Currently in version 0.5, it is continuously amended until reaching version 1.0 after which it will be reviewed on an annual basis.

Further Information