Difference between revisions of "Create or Receive (Acquire)"

From COPTR
Jump to navigation Jump to search
(Created page with "{{Infobox stage |definition=Functions that support the DCC Lifecycle Stage defined as "Create data including administrative, descriptive, structural and technical metadata. Pr...")
(No difference)

Revision as of 10:13, 20 April 2021

Lifecycle stage definition: Functions that support the DCC Lifecycle Stage defined as "Create data including administrative, descriptive, structural and technical metadata. Preservation metadata may also be added at the time of creation. Receive data, in accordance with documented collecting policies, from data creators, other archives, repositories or data centres, and if required assign appropriate metadata."
Lifecycle order: Missing order

Functions within this lifecycle stage

FuntionDefinition
AppraisalTools that enable the assessment of content against in order to decide on it's relevance or appropriateness for preservation
Data capture and DepositTools that enable the capture and deposit of data.
Disk ImagingTools that enable the capture, viewing or extraction of contents of a disk image (which is a computer file containing the contents and structure of a disk volume or an entire data storage device, such as a hard drive or floppy disk).
File CopyTools that support the copying of files from one storage location to another, typically with facilities to verify the completeness of the copy and enable resumption of copying after an interruption.
OCRTools that support the generation of text from bitmap images, otherwise known as Optical Character Recognition
TransferTools that support transfer of packaged digital resources from one organization to another.
Web CaptureTools that support the capture of data from the world wide web, by "crawling" links between resources or other approaches.
Workflow and Lab Notebook ManagementTools that support the capture and management of research data as well as the details of the research activities which generated them.

Tools for this lifecycle stage

ToolFunctionPurpose
7-ZipFixity
Rendering
Transfer
7-Zip is a file archiver with a high compression ratio, and encryption and fixity check capabilities
AFF Open Source Computer Forensics SoftwareDisk ImagingTools for the creation of disk images, used in conjunction with the AFF open and extensible file format to store disk images and associated metadata.
ANTS (Archives Network Transfer System)TransferANTS runs on a Windows desktop and is designed to package digital records with contextual metadata and transfer them to an institutional archives.
Aaru Data Preservation SuiteBackup
Disk Imaging
Metadata Extraction
Media dump software and disc image manager
ApplesauceDisk ImagingStandalone disk image analysis/repair/conversion tool.
ArchiFiltreFile Management
Appraisal
Overview of folder trees with fine diagrams
Archifiltre-MailsAnnotation
Data capture and Deposit
Metadata Processing
Transfer
Appraisal
Archifiltre-Mails connects to email containers and visualizes their content, helping you in exploring and adding metadata.
Archive-ItService
Web Capture
Archive-It is the leading web archiving service for collecting and accessing cultural heritage on the web. It is a service provided by the Internet Archive.
Archive::BagItFixity
File Copy
BagIt API for Perl
ArchiveBoxPersonal Archiving
Web Capture
ArchiveBox is an open source tool that lets organizations & individuals archive both public & private web content while retaining control over their data.
ArchiveFacebookWeb CaptureArchiveFacebook is a Firefox extension which allows individuals to save and manage Facebook web content.
ArtivityData capture and Deposit
Workflow and Lab Notebook Management
A tool for capturing contextual data produced during the creative process of artists and designers while working on a computer.
Autopsy Digital ForensicsContent Profiling
De-Duplication
Disk Imaging
Forensic
Appraisal
Open source, free digital forensics tool
BIL (BagIt Library)Fixity
File Copy
BagIt Library is a Java software library that supports the creation, manipulation and validation of bags.
BagIt Transfer UtilitiesFixity
File Copy
BagIt transfer Utilities are a collection of tools developed for the purpose of validation and transfer of bags.
BaggerFixity
Transfer
GUI application to facilitate the creation and verification of BagIt bags.
BrozzlerWeb CaptureFrom GitHub (https://github.com/internetarchive/brozzler):

Brozzler is a distributed web crawler that uses a real browser (Chrome or Chromium) to fetch pages and embedded URLs and to extract links.

Brozzler is designed to work in conjunction with warcprox for web archiving.
BrunnhildeContent Profiling
Metadata Extraction
Appraisal
Siegfried-based characterization of directories and disk images
CDRDAO (CDR Disk At Once)Disk ImagingCdrdao records audio or data CD-Rs in disk-at-once (DAO) mode based on a textual description of the CD contents.
CINCHWeb CaptureCINCH (Capture INgest and CHecksum Tool) facilitates batch downloading and ingest of Internet-accessible documents and/or images to a central repository.
CND3DActive Data Storage
Data capture and Deposit
Managing Active Research Data
Storage
Store, Preserve and publish 3D objects produced in Digital Humanities Research
CRunchManaging Active Research Data
Workflow and Lab Notebook Management
cRunch provides an infrastructure for exploratory data analysis with the statistical programming language and environment R
CloneCDDisk ImagingCloneCD is the perfect tool to make backup copies of your music and data CDs, regardless of copy protection.
ContextMinerMetadata Processing
Web Capture
ContextMiner is a framework to collect, analyze, and present the contextual information along with the data.
Cp Unix commandFile Copycp copies files (or, optionally, directories). Part of GNU coreutils.
CryptcatFile CopyCryptcat is a lightweight version of netcat with integrated transport encryption capabilities.
Curate.UsWeb CaptureWith a simple click of the mouse, you can create visually compelling clips and quotes of web content that are easily embedded in blog posts, email, forums, and websites.
CzkawkaDe-Duplication
File Management
Appraisal
Czkawka is a simple, fast and free app to remove unnecessary files from your computer.
DART (Digital Archivist's Resource Tool)Fixity
File Management
Storage
Transfer
Provides both a GUI and a command-line interface for packaging files and uploading them to remote repositories.
DArcMailAccess
Data capture and Deposit
Appraisal
Processing and access to email accounts
DIMAG IngestListMetadata Extraction
Transfer
Accompanies ingest process from donor to archive, logs process steps.
Dc3dd for computer forensicsDisk Imaging
Forensic
dc3dd is a patched version of GNU dd with a number of features useful for computer forensics.
DcflddFile Copy
File Management
Forensic
dcfldd is an enhanced version of GNU dd with features useful for forensics and security.
Dd Unix commandFile CopyThis page gives information on using the dd Unix command.
Dd rescueDisk Imaging
File Recovery
dd_rescue is suitable for rescuing data from a medium with errors, i.
DeepArcFile Format Migration
Web Capture
Intended for preserving web sites from the back-end, this is a database-to-XML curation tool.
DiPS (Digital Preservation Solution)Secure Deletion
Access
Active Data Storage
Validation
File Format Identification
File Format Migration
File Management
Metadata Extraction
Preservation System
Storage
Transfer
Workflow
Service
DiPS (OAIS compliant Digital Preservation Solution)
Disk2FDIBackup
Disk Imaging
Forensic
Personal Archiving
Disk2FDI is a professional disk imaging software designed to create binary images of floppy disks to the Formatted Disk Image (FDI) file format, as well as sector-based standard formats.
DiskFormatIDDisk Imaging
File Format Identification
Identify floppy disk formats from kryoflux stream files
DisktypeDisk Imaging
Metadata Extraction
Tool for detecting the content format of a disk or disk image. It knows about common file systems, partition tables, and boot codes.
Docuteam packerFixity
Data capture and Deposit
File Management
Metadata Processing
Appraisal
Creates and edits SIPs
DocworksOCR
Quality Assurance
Workflow
Document digitization workflow software
Double CommanderFixity
De-Duplication
File Copy
File Management
Batch Rename
Open source file manager with two panels side by side
DriveImage XMLDisk ImagingDriveImage XML is an easy to use and reliable program for imaging and backing up partitions and logical drives.
Duke Data AccessionerValidation
File Copy
File Format Identification
Metadata Extraction
Transfer
Data Accessioner provides a graphical user interface to aid in migrating data from physical media to a dedicated file server, documenting the process and using MD5 checksums to identify any errors introduced in transfer.
EPADDAccess
Content Profiling
Metadata Extraction
Metadata Processing
Appraisal
ePADD is a software package developed by Stanford University's Special Collections & University Archives that supports archival processes around the appraisal, ingest, processing, discovery, and delivery of email archives.
Easy CD-DA ExtractorDisk Imaging
File Format Migration
Metadata Extraction
Easy CD-DA Extractor is CD Ripper, Music Converter, Audio Converter, Metadata Editor, and CD/DVD burning software.
Exact Audio CopyDisk Imaging
File Format Migration
Metadata Extraction
Exact Audio Copy is an audio grabber for audio CDs using standard CD and DVD-ROM drives on Windows only.
ExactlyTransferPacks data in BagIt bags and transfers them to/from remote location via FTP, SFTP
FC5025Data capture and DepositDevice Side Data's FC5025 USB 5.25" floppy controller plugs into any computer's USB port and enables you to attach a 5.25" floppy drive.
FCIVFixity
Transfer
Generates and compares MD5 values stored in an XML file.
Find It! Keep It!Web CaptureFind It! Keep It! is a tool to save and organise web content.
FreeCommanderFixity
De-Duplication
File Copy
File Management
Split-screen file manager with desirable extras
GImageReaderOCRA customisable GUI for Tesseract
GNU WgetWeb CaptureNon-interactive network downloader
GetDriveInfo2Disk ImagingGetDriveInfo2 is a Win32 program that examines the optical and removable media drives currently mounted on a computer, and returns information about those devices (in the case of optical devices it also returns information about the any media currently mounted in the device).
GoobiOCR
Planning
Quality Assurance
Workflow
Workflow Management Tool
GreaseweazleDisk ImagingTools for accessing a floppy drive at the raw flux level.
HTTrackWeb CaptureHTTrack is a website copying utility.
HeritrixWeb CaptureHeritrix is an open-source web crawler, allowing users to target websites they wish to include in a collection and to harvest an instance of each site.
Heritrix plug-in for rich media captureWeb CaptureThe Rich Media Capture module (RMC), developed in the LiWA (Living Web Archives) project, is designed to enhance the capturing capabilities of the crawler, with regards to different multimedia content types.
HxC Floppy Emulator toolkitAppraisal
Rendering
Forensic
Provides comprehensive support for working with floppy disk images—importing and converting multiple formats, analyzing streams, creating custom layouts, and performing low-level disk inspections.
IMAGEDisk ImagingIMAGE is a DOS application capable of generating either highly compressed or "flat" images for forensic analysis.
IMacrosQuality Assurance
Web Capture
iMacros makes it easy to test web-based applications.
IsoBusterDisk ImagingRecover data from CD, DVD, BD, HDD, Flash drive, USB stick, media card, SD and SSD.
KeplerManaging Active Research Data
Workflow and Lab Notebook Management
Kepler is a scientific workflow modelling and management system that enables users, regardless of programming experience, to set up data analysis pipelines.
Khtml2pngWeb Capturekhtml2png is a command line program to create screenshots of webpages.
KrakenOCROpen Source turn-key OCR system forked from ocropus
KryoFluxDisk ImagingFloppy disk controller software that accompanies a KryoFlux drive
LabTroveManaging Active Research Data
Workflow and Lab Notebook Management
LabTrove is a blogging platform specifically designed for use in a research environment.
Library (xklb)File Management
Quality Assurance
Web Capture
Media indexing multi-tool with more than 70 CLI subcommands
Limb ProcessingMetadata Processing
OCR
Software for processing, enhancing and converting cultural heritage into digital cultural heritage
MailExtractTransferExtract Emails from many kinds of Mailbox formats
MakeStaticSiteWeb Capture
Personal Archiving
Data capture and Deposit
Managing Active Research Data
Rendering
MakeStaticSite is a command-line tool for generating and deploying refined static versions of both existing websites — typically dynamic, i.e. depend on server-side scripting and databases — and sites archived by the Wayback Machine.
MetaproductsWeb CaptureMetaproducts offers several commercial capture and off-line browsing tools.
Micr'OlonysAccess
Backup
File Recovery
Storage
Transfer
Passive Data Storage
Micr’Olonys is the software solution for long-term passive digital archiving on film and paper.
MyExperimentAcademic Social Networking
Managing Active Research Data
Workflow
Workflow and Lab Notebook Management
myExperiment is an online social networking service aimed at scientific researchers; the site fosters collaboration by allowing members to share scientific workflows, experiment plans, and other digital objects.
NetarchiveSuiteWeb CaptureNetarchiveSuite is a web archiving software package designed to plan, schedule and run web harvests of parts of the Internet.
NumaHOPOCR
Quality Assurance
Platform for digitization projects management
NutchWAXWeb CaptureNutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search.
OSFMountDisk Imaging
Forensic
disk image file mounting
Optical-media-checkDisk ImagingCollates information into a CSV from log files for a batch optical media rip
Package HandlerValidation
File Management
Metadata Processing
Personal Archiving
Appraisal
View, create, edit, and validate Swiss archival packages
PageVaultWeb CapturepageVault supports the archiving of all unique responses generated by a web server.
ParanoiaDisk Imaging"Use your CDROM drive to read audio tracks.... and have it actually work right!"
Pearl Crescent Page SaverWeb CapturePearl Crescent Page Saver is an extension for Mozilla Firefox that lets you capture images of web pages, including Flash content.
PhotoRescueDisk Imaging
File Recovery
PhotoRescue is a picture and data recovery solution for digital film - sd cards, compact flash, memory sticks, microdrive, etc.
Power ISODisk ImagingPowerISO is a powerful CD/DVD image file processing tool, which allows you to open, extract, create, edit, compress, encrypt, split and convert ISO files, and mount these files with internal virtual drive.
QPxToolDisk ImagingWith QPxTool you can measure the quality of CDs and DVDs.
RARC (ARC replicator)Web CapturerARC is a distributed system that enables Internet users to provide storage space from their computers to replicate small parts of the archived data stored in the central repository of the Web archive.
RATOMDiscovery
Metadata Extraction
Appraisal
Review, Appraisal, and Triage of Mail (RATOM) is software to assist archives and other collecting organizations with email analysis, selection, and appraisal tasks
RODA-InFixity
Transfer
Appraisal
The tool creates SIPs from files and folders available on the local file system.
RocflFile Copy
Storage
rocfl is a command line utility for interacting with OCFL repositories on the local filesystem or in S3.
SPARQLing Unicorn QGIS PluginData capture and Deposit
Discovery
File Format Migration
Web Capture
Plugin for QGIS. Fetches data from Wikidata and other Linked Data SPARQL endpoints and adds a new layer in a QGIS project. Just insert a SPARQL query for Geo-Items and get a new vector layer into QGIS.
SafeBackDisk ImagingSafeBack is used to create mirror-image (bit-stream) backup files of hard disks or to make a mirror-image copy of an entire hard disk drive or partition.
SafeMoverFixity
Data capture and Deposit
Transfer
Python tool to support the overtly "safe" copying of files from one location to another. Uses fixity, and OS file system metadata.
Screen-scraperData capture and Deposit
Web Capture
screen-scraper is a tool for extracting data from websites.
SiteStoryWeb CaptureSiteStory is a transactional web archive. It archives resources of a web server it is associated with.
SnagitData capture and DepositSnagit is screen capture software to create interesting training documents, collaborative design work, IT bug reports, and more.
Spadix softwareWeb CaptureSpadix Software can download websites from a starting URL, search engine results or web dirs, and is able to follow external links.
... further results