Store

From COPTR
Jump to navigation Jump to search
Lifecycle stage definition: Functions that support the DCC Lifecycle stage defined as "Store the data in a secure manner adhering to relevant standards."
Lifecycle order: 6

Functions within this lifecycle stage

FuntionDefinition
Active Data StorageTools that support the storage, management, and ultimately the preservation, of evolving research data.
BackupTools that support the backing up of digital data to another storage location, typically in a scheduled manner.
File ManagementTools that support general file management activities such as viewing or renaming
FixityTools that support the verification of file fixity, typically through the generation and validation of checksum based manifests.
Managing Active Research DataTools that enable researchers to manage data from its point of creation, facilitating its productive use in the present, but also establishing the support structures necessary to ensure its future survival.
Persistent IdentificationTools that support the unique and persistent identification of files or intellectual entities.
StorageTools that support the storage of digital resources, possibly in multiple locations to avoid loss of data due to hardware or other failures.

Tools for this lifecycle stage

ToolFunctionPurpose
7-ZipRendering
Transfer
Fixity
7-Zip is a file archiver with a high compression ratio, and encryption and fixity check capabilities
ACE (Audit Control Environment)FixityThe Auditing Control Environment is a mature set of software designed to help libraries and archives prove their holdings are intact and trustworthy.
ADIGRESStorage
File Management
Access
ADIGRES is a powerful cross-platform Document Management System written in Java.
ARK plugin for OmekaPersistent IdentificationGenerating and resolving ARK identifiers for resources in Omeka
AVP FixityFixityFixity monitoring for digital collections
Amazon CloudStorage
Service
Amazon Cloud is an internet-based storage location designed to hold files indefinitely.
ArchiFiltreFile Management
Appraisal
Overview of folder trees with fine diagrams
Archive::BagItFixity
File Copy
BagIt API for Perl
Ark servicePersistent IdentificationARK identifiers generator in python
AudacityRedaction
Personal Archiving
File Management
The open source audio editor
AvccFile ManagementAVCC is an open source application that enables collaborative, efficient item-level cataloging of audiovisual collections.
BAT: BnfArcToolsFile ManagementBAT is a Perl package for processing Internet Archive ARC, DAT and CDX file format.
BIL (BagIt Library)Fixity
File Copy
BagIt Library is a Java software library that supports the creation, manipulation and validation of bags.
BagIt Transfer UtilitiesFixity
File Copy
BagIt transfer Utilities are a collection of tools developed for the purpose of validation and transfer of bags.
BaggerFixity
Transfer
GUI application to facilitate the creation and verification of BagIt bags.
BitCuratorFile Management
Fixity
Metadata Extraction
Metadata Processing
Quality Assurance
Validation
Workflow
The BitCurator Environment is an Ubuntu Linux distribution geared to the needs of archivists and librarians. It includes a suite of open source digital forensics and data analysis tools to help collecting institutions process born-digital materials.
Bulk Rename UtilityFile ManagementBulk Rename Utility is a free file renaming software for Windows. Bulk Rename Utility allows you to easily rename files and entire folders based upon extremely flexible criteria.
CASTOR (CERN Advanced STORage manager)StorageCASTOR, which stands for the CERN Advanced STORage manager, is a hierarchical storage management (HSM) system developed at CERN used to store physics production files and user files.
CRunchWorkflow and Lab Notebook Management
Managing Active Research Data
cRunch provides an infrastructure for exploratory data analysis with the statistical programming language and environment R
CarboniteStorage
Backup
Service
an online backup service that automatically backs up documents, e-mails, music, photos, and settings. Info gathered early March 2013.
Checksum (by Corz)FixityFast hashing tool using a GUI interface
ChronopolisStorage
Backup
Service
"Chronopolis digital preservation network provides services for the long-term preservation and curation of America's digital holdings"
Cksum Unix commandFixitycksum computes a cyclic redundancy check (CRC) checksum for each given file, or standard input if none are given
CyberChefEncryption Detection
File Management
Decryption
Metadata Extraction
Personal Archiving
Binary & Hexidecimal Editing
Discovery
A forensic tool with workflow capabilities to analyse files and containers
D-Net Software KitPlanning
Data Management Planning
Managing Active Research Data
Software Kit creates a network of repositories that share the infrastructure services necessary to process and provide access to digital content.
DART (Digital Archivist's Resource Tool)Storage
File Management
Fixity
Transfer
Provides both a GUI and a command-line interface for packaging files and uploading them to remote repositories.
DCape (ingest only)Preservation System
Storage
"The goal of the DCAPE project is to build a distributed production preservation environment that meets the needs of archival repositories for trusted archival preservation services." (Note: This is a work in progress, see notes for more information)
DIMAGMetadata Extraction
Preservation System
Access
File Format Migration
Storage
Web Crawl
Workflow
A software suite supporting archives with preservation of digital information for eternity
DMPToolPlanning
Data Management Planning
Managing Active Research Data
DMPTool is an online service to enable researchers to create data management plans now required by many funding agencies, and to receive tailored institutional guidance to help them in the process.
DMPonlinePlanning
Data Management Planning
Managing Active Research Data
Service
DMPonline is the DCC's data management planning tool.
Data VaultBackup
Storage
Managing Active Research Data
A storage broker and front end for archiving research data that is no longer active but that does not have a need for open publication
DataCitePersistent Identification
Managing Active Research Data
Citation and Impact Tracking
DataCite works with data centres to assign persistent identifiers to datasets using the Digital Object Identifier (DOI) infrastructure.
DataFlowManaging Active Research Data
Storage
DataFlow is a two-stage data management infrastructure that is designed to allow researchers to work with, annotate, publish, and permanently store research data.
DataStageActive Data Storage
Managing Active Research Data
DataStage is a flexible data storage system that provides controlled access, secure backup, and the ability to transfer selected files to a more permanent archiving facility.
DataverseActive Data Storage
Preservation System
Managing Active Research Data
Storage
The Dataverse is an open source web application to share, preserve, cite, explore and analyze research data.
DcflddFile Management
Forensic
File Copy
dcfldd is an enhanced version of GNU dd with features useful for forensics and security.
Directory List & PrintMetadata Extraction
File Management
A universal metadata extractor
DiscImageChefMetadata Extraction
Backup
Disk Imaging
Media dump software and disc image manager
DiskViewFile ManagementDiskView shows you a graphical map of your disk, allowing you to determine where a file is located or, by clicking on a cluster, seeing which file occupies it.
DropboxStorage
Backup
Service
Dropbox is a free service that lets you bring all your photos, docs, and videos anywhere. This means that any file you save to your Dropbox will automatically save to all your computers, phones and even the Dropbox website. Dropbox also makes it super easy to share with others, whether you're a student or professional, parent or grandparent. Even if you accidentally spill a latte on your laptop, have no fear! You can relax knowing that Dropbox always has you covered, and none of your stuff will ever be lost.
EZARKPersistent IdentificationARK identifiers management tool and sub-publishers registry
EZIDPersistent IdentificationEZID (easy-eye-dee) makes it easy to create and manage unique, persistent identifiers.
EmailchemyDe-Duplication
File Format Migration
File Management
File Recovery
Converts proprietary emails to standard portable formats
ExactFileFixityMaking sure that what you hash is what you get
Explore2fsFile ManagementExplore2fs is a GUI explorer tool for accessing ext2 and ext3 filesystems.
FCIVFixity
Transfer
Generates and compares MD5 values stored in an XML file.
File Analyzer and Metadata Harvester V2File Management
Fixity
Metadata Extraction
Metadata Processing
Quality Assurance
Validation
Workflow
The File Analyzer is a general purpose desktop (and command line) tool designed to automate simple, file-based operations. The File Analyzer assembles a toolkit of tasks a user can perform. The tasks that have been written into the File Analyzer code base have been optimized for use by libraries, archives, and other cultural heritage institutions.
FileVerifier++Fixity
De-Duplication
Windows utility for verifying file contents
FixiFixityFixi is a command-line utility that indexes, verifies, and updates checksum information for collections of files.
Free Video Cutter JoinerRedaction
Personal Archiving
File Management
It cuts and joins video streams without altering the codecs.
FreeCommanderFile ManagementSplit-screen file manager with desirable extras
FslintDe-Duplication
File Management
Set of utilities to find and clean various forms of lint on a filesystem, such as duplicate files, empty directories, and bad file names.
Glacier (Amazon)Backup
Storage
Amazon Glacier is a secure, durable, and extremely low-cost cloud storage service for data archiving and long-term backup.
Google CloudStorageGoogle Cloud Storage allows users to store, access, and manage their data.
GreensPersistent IdentificationARK identifiers minter and resolver
HopplaPreservation System
Storage
Hoppla is an archiving solution that combines back-up and fully automated migration services for data collections in small office environments.
IRODS (integrated Rule Oriented Data Systems)StorageiRODS software was designed to allow curators utilising heterogeneous storage and computing facilities to define policies without being concerned with the technical detail of how the system implements those policies and without having to respond to changes in technical infrastructure.
Java library implementing PairtreeFile Management
De-Duplication
The PAIRTREE LIBRARY is a software library that supports the mapping between identifiers and filepaths according to the Pairtree Specification.
KeplerWorkflow and Lab Notebook Management
Managing Active Research Data
Kepler is a scientific workflow modelling and management system that enables users, regardless of programming experience, to set up data analysis pipelines.
LOCKSS (Lots of Copies Keep Stuff Safe)Storage
Preservation System
Access
LOCKSS software allows libraries to create preserved digital collections out of materials that would otherwise be accessible only through a licensed academic subscription.
LabTroveWorkflow and Lab Notebook Management
Managing Active Research Data
LabTrove is a blogging platform specifically designed for use in a research environment.
Legacy LockerStorageLegacy Locker is a safe, secure repository for your vital digital property that lets you grant access to online assets for friends and loved ones in the event of loss, death, or disability.
MailStore HomeMetadata Extraction
Metadata Processing
File Management
Discovery
Unifies your private emails into one searchable, platform-independent repository
Md5deep and hashdeepFixitymd5deep is a set of programs to compute MD5, SHA-1, SHA-256, Tiger, or Whirlpool message digests on an arbitrary number of files. hashdeep is a program to compute, match, and audit hashsets.
Md5sum Unix commandFixitymd5sum computes a 128-bit checksum (or fingerprint or message-digest) for each specified file.
Md5summerFixityMD5summer is an application for Microsoft Windows 9x, NT, ME, 2000 and XP which generates and verifies md5 checksums.
Minimum Preservation ToolFixity
Preservation System
The Minimum Preservation Tool (MPT) can be used to create an interim preservation storage environment for files awaiting preservation in a longer term repository solution. It supports checksum generation, fixity checking, and replication across two or more storage nodes.
MyExperimentWorkflow and Lab Notebook Management
Managing Active Research Data
Academic Social Networking
Workflow
myExperiment is an online social networking service aimed at scientific researchers; the site fosters collaboration by allowing members to share scientific workflows, experiment plans, and other digital objects.
NARA File Analyzer and Metadata HarvesterFixity
Metadata Extraction
File Format Identification
NARA File Analyzer and Metadata Harvester allows a user to analyze the contents of a file system or external drive and generates statistics about the contents of the contained directories.
NOID (Ruby)Persistent IdentificationA version of NOID in Ruby
Nice Opaque Identifiers (NOID)Persistent IdentificationIdentifiers management tool to generate, bind and resolve different kinds of identifiers
NoidsPersistent IdentificationIdentifiers management tool
Omeka Identity pluginPersistent IdentificationPlugin for Omeka to assign ARK identifiers
PackageHandlerFile Management
Metadata Processing
Validation
Personal Archiving
Appraisal
View, create, edit, and validate Swiss archival packages
Proofpoint Enterprise ArchiveDiscovery
Service
Storage
Proofpoint Enterprise Archive is a SaaS email archiving solution that addresses three key challenges eDiscovery, regulatory compliance and email storage management without the headaches of managing archiving in-house.
Python checkm packageFixityThis is a Python implementation of the checkm specification.
RE (Rename Expert)Metadata Processing
File Management
Controlled renaming of file collections
RackSpaceStorage
Service
RackSpace provices cloud based services to businesses of all sizes through the world.
ReACT (Resource Audit and Comparison Tool)File Management
Quality Assurance
A file audit and comparison tool using Microsoft Excel and VBA.
ReDBoxMetadata Processing
Managing Active Research Data
ReDBox and Mint are two complimentary applications designed to create, store, and provide access to research metadata.
ReNamerFile ManagementReNamer is a very powerful and flexible file renaming tool.
Remove Empty DirectoriesFile ManagementRemoves empty directories
RhashFixityRHash (Recursive Hasher) is a console utility for computing and verifying hash sums of files.
SAFE Archive Audit SystemFixityPolicy-based replication and Auditing of LOCKSS networks.
SRB (The DICE Storage Resource Broker)StorageThe DICE Storage Resource Broker (SRB) supports shared collections that can be distributed across multiple organizations and heterogeneous storage systems.
SSDeepFixity
De-Duplication
Recursive piecewise hashing tool
TavernaWorkflow
Workflow and Lab Notebook Management
Managing Active Research Data
Taverna is a scientific workflow management system designed to assemble, run, document and share sequences sequences of web services and scripts.
TeraCopyTransfer
File Copy
File Management
Performs file copying, whilst also logging and verifying accuracy and completeness by using checksums
The RenameFile ManagementBulk renaming of files
The aDORe FederationStorageThe aDORe Federation is a federated repository framework and reference implementation which aims to address many of the scalability issues experienced by large scale digital object repositories.
TreeMetadata Processing
File Management
Appraisal
Tree displays the directory structure of a path or of the disk in a drive graphically.
TreeSizeFile Management
Appraisal
Manage disk space and scan your hard disks.
UKWA Access APIPersistent IdentificationWeb archives access API
VRenamerFile Management
Metadata Processing
vRenamer is a cross platform tool for batch renaming files
Warc-proxyRendering
File Management
Warc-proxy is a simple tool to view WARC content in Firefox
WarcManagerWeb Crawl
File Management
The WARC Manager is a web-based UI for managing and querying collections of web crawl data.
WebCiteWeb Snapshot
Persistent Identification
WebCite is an on-demand web archiving service that takes snapshots of Internet-accessible digital objects at the behest of users, storing the data on their own servers and assigning unique identifiers to those instances of the material.
WinMergeDe-Duplication
File Management
A visual tool for differencing and merging of file collections, images and texts.
XArchManaging Active Research DataXArch is an archive management system that allows one to create, populate, and query archives of multiple database versions.