Difference between revisions of "Workflow:Creating a SIP from content downloaded from OneDrive (or other Cloud based source)"

From COPTR
Jump to navigation Jump to search
Line 39: Line 39:
  
 
<!-- Note that your workflow will be marked with a CC3.0 licence -->
 
<!-- Note that your workflow will be marked with a CC3.0 licence -->
 +
I wrote a blog post about the deduplication process here: https://anoldhanddigital.wordpress.com/tag/checksums/

Revision as of 13:56, 28 April 2021

Creating a SIP from content downloaded from OneDrive (or other Cloud based source)
Status:Draft
Tools:
Input:Digital records intended for deposit with the archives for long term preservation,
Output:A SIP ready for ingest into the preservation system.

Workflow Description

Textual description

  1. Create folder location for files with metadata subfolder
  2. Download content from OneDrive
  3. Run DROID report on downloaded zip folder
  4. Export DROID report to metadata subfolder as "initial" report
  5. Move zip folder to folder using Teracopy
  6. Save Teracopy report
  7. Copy Teracopy report to metadata subfolder
  8. Extract zip folder to folder
  9. Copy extracted files and metadata to another folder in a different location as working copy
  10. Use CSV Validator and the DROID report to check for duplicates
  11. Using working copy check for duplicates using ...
  12. Appraisal: delete duplicate and other files not selected for preservation
  13. When appraisal is complete run a second DROID report and save in metadata folder as "final" report

Purpose, Context and Content

The workflow is a step by step guide to capturing digital deposits from internal (and possibly external) sources, capturing metadata using basic tools, deduplicating and preparing the material as a SIP for ingest into the preservation system.

Evaluation/Review

Further Information

I wrote a blog post about the deduplication process here: https://anoldhanddigital.wordpress.com/tag/checksums/