Workflow:Browsertrix-crawler Workflow

From COPTR
Revision as of 16:35, 9 December 2021 by MichaelTobinUKGWA (talk | contribs) (UKGWA workflow for capturing with Browsertrix-crawler)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search
Browsertrix-crawler Workflow
Status:Experimental
Tools:
Input:Website
Output:WARC file
Organisation:UK Government Web Archive

Workflow Description

Flowchart workflow for capturing a website with Browsertrix-crawler


The workflow involves the decision to capture a website with Browsertrix-crawler. It shows the iterative process of crawling a page with Browsertrix, QAing the results in Conifer and recrawling with adjusted settings.

Purpose, Context and Content

Evaluation/Review

Further Information