Editing Brozzler
Jump to navigation
Jump to search
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.
The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.
Latest revision | Your text | ||
Line 1: | Line 1: | ||
{{Infobox tool | {{Infobox tool | ||
− | |||
|purpose=From GitHub (https://github.com/internetarchive/brozzler): | |purpose=From GitHub (https://github.com/internetarchive/brozzler): | ||
Brozzler is a distributed web crawler that uses a real browser (Chrome or Chromium) to fetch pages and embedded URLs and to extract links. | Brozzler is a distributed web crawler that uses a real browser (Chrome or Chromium) to fetch pages and embedded URLs and to extract links. | ||
Line 11: | Line 10: | ||
{{Infobox tool details}} | {{Infobox tool details}} | ||
== Description == | == Description == | ||
− | Brozzler is a distributed browser based web crawler. It was built by the Internet Archive. | + | <!-- Brozzler is a distributed browser based web crawler. It was built by the Internet Archive. |
From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000343186-What-is-Brozzler- | From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000343186-What-is-Brozzler- | ||
Line 22: | Line 21: | ||
Link to GitHub rep: | Link to GitHub rep: | ||
− | https://github.com/internetarchive/brozzler | + | https://github.com/internetarchive/brozzler --> |
== User Experiences == | == User Experiences == | ||
− | From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000351986-How-and-when-to-use-Brozzler | + | <!-- From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000351986-How-and-when-to-use-Brozzler |
When to Use Brozzler: | When to Use Brozzler: | ||
− | "If dynamic elements on a page were not captured in a Standard crawl or you're seeing a number of error pages when viewing results in Wayback, running a test crawl using Brozzler is a good next step". | + | "If dynamic elements on a page were not captured in a Standard crawl or you're seeing a number of error pages when viewing results in Wayback, running a test crawl using Brozzler is a good next step". --> |