Difference between revisions of "Workflow:Creating a SIP from content downloaded from OneDrive (or other Cloud based source)"

From COPTR
Jump to navigation Jump to search
 
(6 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
{{Infobox COW
 
{{Infobox COW
 
|status=Draft
 
|status=Draft
|tools=DROID (Digital Record Object Identification), TeraCopy
+
|tools=CSV Validator, DROID (Digital Record Object Identification), TeraCopy
 
|input=Digital records intended for deposit with the archives for long term preservation,
 
|input=Digital records intended for deposit with the archives for long term preservation,
 
|output=A SIP ready for ingest into the preservation system.
 
|output=A SIP ready for ingest into the preservation system.
 +
|organisation=Modern Records Centre, University of Warwick
 +
|organisationurl=https://warwick.ac.uk/services/library/mrc/
 
}}
 
}}
 
==Workflow Description==
 
==Workflow Description==
Line 12: Line 14:
  
 
<!-- Describe your workflow here with an overview of the different steps or processes involved-->
 
<!-- Describe your workflow here with an overview of the different steps or processes involved-->
# Create folder for SIP with metadata subfolder
+
# Create folder location for files with metadata subfolder
 
# Download content from OneDrive
 
# Download content from OneDrive
 
# Run DROID report on downloaded zip folder
 
# Run DROID report on downloaded zip folder
# Export DROID report to SIP folder metadata subfolder
+
# Export DROID report to metadata subfolder as "initial" report
# Move zip folder to SIP folder using Teracopy
+
# Move zip folder to folder using Teracopy
 
# Save Teracopy report  
 
# Save Teracopy report  
# Copy Teracopy report to SIP folder metadata subfolder
+
# Copy Teracopy report to metadata subfolder
# Extract zip folder to SIP folder
+
# Extract zip folder to folder
# Copy extracted files to another folder in a different location as working copy
+
# Copy extracted files and metadata to another folder in a different location as working copy
# Check for duplicates using ...
+
# Use CSV Validator and the DROID report to check for duplicates
 +
# Using working copy check for duplicates using ...
 
# Appraisal: delete duplicate and other files not selected for preservation
 
# Appraisal: delete duplicate and other files not selected for preservation
 +
# When appraisal is complete run a second DROID report and save in metadata folder as "final" report
 +
# Create a manifest of the original file names from the DROID report, save as a text file and add to the metadata folder
 +
 +
[[User:Rsm01|Rsm01]] ([[User talk:Rsm01|talk]]) 14:15, 28 April 2021 (UTC)
  
 
==Purpose, Context and Content==
 
==Purpose, Context and Content==
 
<!-- Describe what your workflow is for - i.e. what it is designed to achieve, what the organisational context of the workflow is, and what content it is designed to work with -->
 
<!-- Describe what your workflow is for - i.e. what it is designed to achieve, what the organisational context of the workflow is, and what content it is designed to work with -->
 +
The workflow is a step by step guide to capturing digital deposits from internal (and possibly external) sources, capturing metadata using basic tools, deduplicating and preparing the material as a SIP for ingest into the preservation system.
  
 
==Evaluation/Review==
 
==Evaluation/Review==
Line 36: Line 44:
  
 
<!-- Note that your workflow will be marked with a CC3.0 licence -->
 
<!-- Note that your workflow will be marked with a CC3.0 licence -->
 +
I wrote a blog post about the deduplication process here: https://anoldhanddigital.wordpress.com/tag/checksums/

Latest revision as of 14:15, 28 April 2021

Creating a SIP from content downloaded from OneDrive (or other Cloud based source)
Status:Draft
Tools:
Input:Digital records intended for deposit with the archives for long term preservation,
Output:A SIP ready for ingest into the preservation system.
Organisation:Modern Records Centre, University of Warwick

Workflow Description[edit]

Textual description

  1. Create folder location for files with metadata subfolder
  2. Download content from OneDrive
  3. Run DROID report on downloaded zip folder
  4. Export DROID report to metadata subfolder as "initial" report
  5. Move zip folder to folder using Teracopy
  6. Save Teracopy report
  7. Copy Teracopy report to metadata subfolder
  8. Extract zip folder to folder
  9. Copy extracted files and metadata to another folder in a different location as working copy
  10. Use CSV Validator and the DROID report to check for duplicates
  11. Using working copy check for duplicates using ...
  12. Appraisal: delete duplicate and other files not selected for preservation
  13. When appraisal is complete run a second DROID report and save in metadata folder as "final" report
  14. Create a manifest of the original file names from the DROID report, save as a text file and add to the metadata folder

Rsm01 (talk) 14:15, 28 April 2021 (UTC)

Purpose, Context and Content[edit]

The workflow is a step by step guide to capturing digital deposits from internal (and possibly external) sources, capturing metadata using basic tools, deduplicating and preparing the material as a SIP for ingest into the preservation system.

Evaluation/Review[edit]

Further Information[edit]

I wrote a blog post about the deduplication process here: https://anoldhanddigital.wordpress.com/tag/checksums/