Semantic search

Jump to navigation Jump to search
ToolFunctionPurpose
IsoBusterDisk ImagingRecover data from CD, DVD, BD, HDD, Flash drive, USB stick, media card, SD and SSD.
KeplerWorkflow and Lab Notebook Management
Managing Active Research Data
Kepler is a scientific workflow modelling and management system that enables users, regardless of programming experience, to set up data analysis pipelines.
Khtml2pngWeb Capturekhtml2png is a command line program to create screenshots of webpages.
KrakenOCROpen Source turn-key OCR system forked from ocropus
KryoFluxDisk ImagingFloppy disk controller software that accompanies a KryoFlux drive
LabTroveWorkflow and Lab Notebook Management
Managing Active Research Data
LabTrove is a blogging platform specifically designed for use in a research environment.
Library (xklb)File Management
Quality Assurance
Web Capture
Media indexing multi-tool
Limb ProcessingMetadata Processing
OCR
Software for processing, enhancing and converting cultural heritage into digital cultural heritage
LunasStorage
File Copy
A syncing cli tool that can handle more than two directories locally and remotely
MailExtractTransferExtract Emails from many kinds of Mailbox formats
MetaproductsWeb CaptureMetaproducts offers several commercial capture and off-line browsing tools.
Micr'OlonysStorage
Access
Passive Data Storage
Backup
Transfer
File Recovery
Micr’Olonys is the software solution for long-term passive digital archiving on film and paper.
MyExperimentWorkflow and Lab Notebook Management
Managing Active Research Data
Academic Social Networking
Workflow
myExperiment is an online social networking service aimed at scientific researchers; the site fosters collaboration by allowing members to share scientific workflows, experiment plans, and other digital objects.
NetarchiveSuiteWeb CaptureNetarchiveSuite is a web archiving software package designed to plan, schedule and run web harvests of parts of the Internet.
NumaHOPQuality Assurance
OCR
Platform for digitization projects management
NutchWAXWeb CaptureNutchWAX is software for indexing ARC files (archived Web sites gathered using Heritrix) for full text search.
OSFMountDisk Imaging
Forensic
disk image file mounting
Optical-media-checkDisk ImagingCollates information into a CSV from log files for a batch optical media rip
Package HandlerFile Management
Metadata Processing
Personal Archiving
Validation
Appraisal
View, create, edit, and validate Swiss archival packages
PageVaultWeb CapturepageVault supports the archiving of all unique responses generated by a web server.
ParanoiaDisk Imaging"Use your CDROM drive to read audio tracks.... and have it actually work right!"
Pearl Crescent Page SaverWeb CapturePearl Crescent Page Saver is an extension for Mozilla Firefox that lets you capture images of web pages, including Flash content.
PhotoRescueDisk Imaging
File Recovery
PhotoRescue is a picture and data recovery solution for digital film - sd cards, compact flash, memory sticks, microdrive, etc.
Power ISODisk ImagingPowerISO is a powerful CD/DVD image file processing tool, which allows you to open, extract, create, edit, compress, encrypt, split and convert ISO files, and mount these files with internal virtual drive.
QPxToolDisk ImagingWith QPxTool you can measure the quality of CDs and DVDs.
RARC (ARC replicator)Web CapturerARC is a distributed system that enables Internet users to provide storage space from their computers to replicate small parts of the archived data stored in the central repository of the Web archive.
RATOMAppraisal
Discovery
Metadata Extraction
Review, Appraisal, and Triage of Mail (RATOM) is software to assist archives and other collecting organizations with email analysis, selection, and appraisal tasks
RODA-InTransfer
Fixity
Appraisal
The tool creates SIPs from files and folders available on the local file system.
RocflFile Copy
Storage
rocfl is a command line utility for interacting with OCFL repositories on the local filesystem or in S3.
SPARQLing Unicorn QGIS PluginData capture and Deposit
Discovery
Web Capture
File Format Migration
Plugin for QGIS. Fetches data from Wikidata and other Linked Data SPARQL endpoints and adds a new layer in a QGIS project. Just insert a SPARQL query for Geo-Items and get a new vector layer into QGIS.
SafeBackDisk ImagingSafeBack is used to create mirror-image (bit-stream) backup files of hard disks or to make a mirror-image copy of an entire hard disk drive or partition.
SafeMoverData capture and Deposit
Transfer
Fixity
Python tool to support the overtly "safe" copying of files from one location to another. Uses fixity, and OS file system metadata.
Screen-scraperData capture and Deposit
Web Capture
screen-scraper is a tool for extracting data from websites.
SiteStoryWeb CaptureSiteStory is a transactional web archive. It archives resources of a web server it is associated with.
SnagitData capture and DepositSnagit is screen capture software to create interesting training documents, collaborative design work, IT bug reports, and more.
Spadix softwareWeb CaptureSpadix Software can download websites from a starting URL, search engine results or web dirs, and is able to follow external links.
StorytrackerWeb CaptureTools for tracking stories on news homepages
Sumfolder1Appraisal
De-Duplication
Fixity
sumfolder1 is a utility for use within the archival and digital preservation community to generate checksums for file system directories, and to generate an overall "collection" checksum for a given set of files. The utility may be used in support of de-duplication at a directory/folder level.
TOMES (Transforming Online Mail with Embedded Semantics)File Format Migration
Content Profiling
Metadata Processing
Data capture and Deposit
A package of open source tools for handling the preservation of government email records
TabulaData capture and DepositExtract tabular data from PDF files
TavernaWorkflow
Workflow and Lab Notebook Management
Managing Active Research Data
Taverna is a scientific workflow management system designed to assemble, run, document and share sequences sequences of web services and scripts.
TeleportWeb CaptureTeleport is a web crawling tool that enables offline browsing
TeraCopyTransfer
File Copy
File Management
Performs file copying, whilst also logging and verifying accuracy and completeness by using checksums
Tesseract-ocrOCROpen source OCR engine, accepting uncompressed TIFF files as input
The DeDuplicator (Heritrix add-on module)De-Duplication
Web Capture
The DeDuplicator is an add-on module for Heritrix to reduce the amount of duplicate data collected in a series of snapshot crawls.
TreeMetadata Processing
File Management
Appraisal
Tree displays the directory structure of a path or of the disk in a drive graphically.
TreeSizeFile Management
Appraisal
Manage disk space and scan your hard disks.
TubeKitWeb CaptureTubeKit is a toolkit for creating YouTube crawlers.
Tufts Submission-Agreement Builder ToolPlanning
Data capture and Deposit
SABT is a web-based tool that guides records creators and records managers through the process of creating submission agreements, both for single transfers and for standing submissions.
UKWA GSuite Add-OnAppraisal
Validation

GSuite functions for people working with web archives. The functions use the Memento API (specifically the TimeGate) to look up whether a given archive holds a given URL. It currently supports checks against:

  • UK Web Archive
  • UK Government Web Archive
  • Internet Archive