<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en-GB">
	<id>https://coptr.digipres.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Eve+wrightnrs</id>
	<title>COPTR - User contributions [en-gb]</title>
	<link rel="self" type="application/atom+xml" href="https://coptr.digipres.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Eve+wrightnrs"/>
	<link rel="alternate" type="text/html" href="https://coptr.digipres.org/Special:Contributions/Eve_wrightnrs"/>
	<updated>2026-05-25T09:10:04Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.35.14</generator>
	<entry>
		<id>https://coptr.digipres.org/index.php?title=Brozzler&amp;diff=5533</id>
		<title>Brozzler</title>
		<link rel="alternate" type="text/html" href="https://coptr.digipres.org/index.php?title=Brozzler&amp;diff=5533"/>
		<updated>2021-12-09T16:16:27Z</updated>

		<summary type="html">&lt;p&gt;Eve wrightnrs: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox tool&lt;br /&gt;
|image=Brozzler-icon (1).png&lt;br /&gt;
|purpose=From GitHub (https://github.com/internetarchive/brozzler):&lt;br /&gt;
Brozzler is a distributed web crawler that uses a real browser (Chrome or Chromium) to fetch pages and embedded URLs and to extract links. &lt;br /&gt;
&lt;br /&gt;
Brozzler is designed to work in conjunction with warcprox for web archiving.&lt;br /&gt;
|homepage=https://github.com/internetarchive/brozzler&lt;br /&gt;
|function=Web Capture&lt;br /&gt;
|content=Web&lt;br /&gt;
}}&lt;br /&gt;
{{Infobox tool details}}&lt;br /&gt;
== Description ==&lt;br /&gt;
Brozzler is a distributed browser based web crawler. It was built by the Internet Archive.&lt;br /&gt;
&lt;br /&gt;
From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000343186-What-is-Brozzler-&lt;br /&gt;
&lt;br /&gt;
Brozzler is our newest crawling technology, built at the Internet Archive.&lt;br /&gt;
&lt;br /&gt;
Brozzler differs from Archive-It's &amp;quot;Standard&amp;quot; crawling technology (Heritrix and Umbra) in its reliance on an actual web browser to interact with web content before that content is indexed and archived into WARC files. Instead of following hyperlinks and downloading files, Brozzler records interactions between servers and web browsers as they occur, more closely resembling how a human user would experience the web. It also uses youtube-dl to enhance media capture capabilities. (as of January 2020 both Brozzler and Standard crawls use youtube-dl).&lt;br /&gt;
&lt;br /&gt;
For more information on how this process works, and the related open-source tools on which it relies, you can review Brozzler’s code and technical documentation in its GitHub repository.&lt;br /&gt;
&lt;br /&gt;
Link to GitHub rep:&lt;br /&gt;
https://github.com/internetarchive/brozzler&lt;br /&gt;
&lt;br /&gt;
== User Experiences ==&lt;br /&gt;
From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000351986-How-and-when-to-use-Brozzler&lt;br /&gt;
&lt;br /&gt;
When to Use Brozzler:&lt;br /&gt;
&lt;br /&gt;
&amp;quot;If dynamic elements on a page were not captured in a Standard crawl or you're seeing a number of error pages when viewing results in Wayback, running a test crawl using Brozzler is a good next step&amp;quot;.&lt;/div&gt;</summary>
		<author><name>Eve wrightnrs</name></author>
	</entry>
	<entry>
		<id>https://coptr.digipres.org/index.php?title=File:Brozzler-icon_(1).png&amp;diff=5532</id>
		<title>File:Brozzler-icon (1).png</title>
		<link rel="alternate" type="text/html" href="https://coptr.digipres.org/index.php?title=File:Brozzler-icon_(1).png&amp;diff=5532"/>
		<updated>2021-12-09T16:15:36Z</updated>

		<summary type="html">&lt;p&gt;Eve wrightnrs: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Eve wrightnrs</name></author>
	</entry>
	<entry>
		<id>https://coptr.digipres.org/index.php?title=Brozzler&amp;diff=5531</id>
		<title>Brozzler</title>
		<link rel="alternate" type="text/html" href="https://coptr.digipres.org/index.php?title=Brozzler&amp;diff=5531"/>
		<updated>2021-12-09T16:12:52Z</updated>

		<summary type="html">&lt;p&gt;Eve wrightnrs: /* User Experiences */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox tool&lt;br /&gt;
|purpose=From GitHub (https://github.com/internetarchive/brozzler):&lt;br /&gt;
Brozzler is a distributed web crawler that uses a real browser (Chrome or Chromium) to fetch pages and embedded URLs and to extract links. &lt;br /&gt;
&lt;br /&gt;
Brozzler is designed to work in conjunction with warcprox for web archiving.&lt;br /&gt;
|homepage=https://github.com/internetarchive/brozzler&lt;br /&gt;
|function=Web Capture&lt;br /&gt;
|content=Web&lt;br /&gt;
}}&lt;br /&gt;
{{Infobox tool details}}&lt;br /&gt;
== Description ==&lt;br /&gt;
Brozzler is a distributed browser based web crawler. It was built by the Internet Archive.&lt;br /&gt;
&lt;br /&gt;
From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000343186-What-is-Brozzler-&lt;br /&gt;
&lt;br /&gt;
Brozzler is our newest crawling technology, built at the Internet Archive.&lt;br /&gt;
&lt;br /&gt;
Brozzler differs from Archive-It's &amp;quot;Standard&amp;quot; crawling technology (Heritrix and Umbra) in its reliance on an actual web browser to interact with web content before that content is indexed and archived into WARC files. Instead of following hyperlinks and downloading files, Brozzler records interactions between servers and web browsers as they occur, more closely resembling how a human user would experience the web. It also uses youtube-dl to enhance media capture capabilities. (as of January 2020 both Brozzler and Standard crawls use youtube-dl).&lt;br /&gt;
&lt;br /&gt;
For more information on how this process works, and the related open-source tools on which it relies, you can review Brozzler’s code and technical documentation in its GitHub repository.&lt;br /&gt;
&lt;br /&gt;
Link to GitHub rep:&lt;br /&gt;
https://github.com/internetarchive/brozzler&lt;br /&gt;
&lt;br /&gt;
== User Experiences ==&lt;br /&gt;
From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000351986-How-and-when-to-use-Brozzler&lt;br /&gt;
&lt;br /&gt;
When to Use Brozzler:&lt;br /&gt;
&lt;br /&gt;
&amp;quot;If dynamic elements on a page were not captured in a Standard crawl or you're seeing a number of error pages when viewing results in Wayback, running a test crawl using Brozzler is a good next step&amp;quot;.&lt;/div&gt;</summary>
		<author><name>Eve wrightnrs</name></author>
	</entry>
	<entry>
		<id>https://coptr.digipres.org/index.php?title=Brozzler&amp;diff=5530</id>
		<title>Brozzler</title>
		<link rel="alternate" type="text/html" href="https://coptr.digipres.org/index.php?title=Brozzler&amp;diff=5530"/>
		<updated>2021-12-09T16:12:35Z</updated>

		<summary type="html">&lt;p&gt;Eve wrightnrs: /* Description */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox tool&lt;br /&gt;
|purpose=From GitHub (https://github.com/internetarchive/brozzler):&lt;br /&gt;
Brozzler is a distributed web crawler that uses a real browser (Chrome or Chromium) to fetch pages and embedded URLs and to extract links. &lt;br /&gt;
&lt;br /&gt;
Brozzler is designed to work in conjunction with warcprox for web archiving.&lt;br /&gt;
|homepage=https://github.com/internetarchive/brozzler&lt;br /&gt;
|function=Web Capture&lt;br /&gt;
|content=Web&lt;br /&gt;
}}&lt;br /&gt;
{{Infobox tool details}}&lt;br /&gt;
== Description ==&lt;br /&gt;
Brozzler is a distributed browser based web crawler. It was built by the Internet Archive.&lt;br /&gt;
&lt;br /&gt;
From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000343186-What-is-Brozzler-&lt;br /&gt;
&lt;br /&gt;
Brozzler is our newest crawling technology, built at the Internet Archive.&lt;br /&gt;
&lt;br /&gt;
Brozzler differs from Archive-It's &amp;quot;Standard&amp;quot; crawling technology (Heritrix and Umbra) in its reliance on an actual web browser to interact with web content before that content is indexed and archived into WARC files. Instead of following hyperlinks and downloading files, Brozzler records interactions between servers and web browsers as they occur, more closely resembling how a human user would experience the web. It also uses youtube-dl to enhance media capture capabilities. (as of January 2020 both Brozzler and Standard crawls use youtube-dl).&lt;br /&gt;
&lt;br /&gt;
For more information on how this process works, and the related open-source tools on which it relies, you can review Brozzler’s code and technical documentation in its GitHub repository.&lt;br /&gt;
&lt;br /&gt;
Link to GitHub rep:&lt;br /&gt;
https://github.com/internetarchive/brozzler&lt;br /&gt;
&lt;br /&gt;
== User Experiences ==&lt;br /&gt;
&amp;lt;!-- From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000351986-How-and-when-to-use-Brozzler&lt;br /&gt;
&lt;br /&gt;
When to Use Brozzler:&lt;br /&gt;
&lt;br /&gt;
&amp;quot;If dynamic elements on a page were not captured in a Standard crawl or you're seeing a number of error pages when viewing results in Wayback, running a test crawl using Brozzler is a good next step&amp;quot;. --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Eve wrightnrs</name></author>
	</entry>
	<entry>
		<id>https://coptr.digipres.org/index.php?title=Brozzler&amp;diff=5528</id>
		<title>Brozzler</title>
		<link rel="alternate" type="text/html" href="https://coptr.digipres.org/index.php?title=Brozzler&amp;diff=5528"/>
		<updated>2021-12-09T16:10:27Z</updated>

		<summary type="html">&lt;p&gt;Eve wrightnrs: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox tool&lt;br /&gt;
|purpose=From GitHub (https://github.com/internetarchive/brozzler):&lt;br /&gt;
Brozzler is a distributed web crawler that uses a real browser (Chrome or Chromium) to fetch pages and embedded URLs and to extract links. &lt;br /&gt;
&lt;br /&gt;
Brozzler is designed to work in conjunction with warcprox for web archiving.&lt;br /&gt;
|homepage=https://github.com/internetarchive/brozzler&lt;br /&gt;
|function=Web Capture&lt;br /&gt;
|content=Web&lt;br /&gt;
}}&lt;br /&gt;
{{Infobox tool details}}&lt;br /&gt;
== Description ==&lt;br /&gt;
&amp;lt;!-- Brozzler is a distributed browser based web crawler. It was built by the Internet Archive.&lt;br /&gt;
&lt;br /&gt;
From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000343186-What-is-Brozzler-&lt;br /&gt;
&lt;br /&gt;
Brozzler is our newest crawling technology, built at the Internet Archive.&lt;br /&gt;
&lt;br /&gt;
Brozzler differs from Archive-It's &amp;quot;Standard&amp;quot; crawling technology (Heritrix and Umbra) in its reliance on an actual web browser to interact with web content before that content is indexed and archived into WARC files. Instead of following hyperlinks and downloading files, Brozzler records interactions between servers and web browsers as they occur, more closely resembling how a human user would experience the web. It also uses youtube-dl to enhance media capture capabilities. (as of January 2020 both Brozzler and Standard crawls use youtube-dl).&lt;br /&gt;
&lt;br /&gt;
For more information on how this process works, and the related open-source tools on which it relies, you can review Brozzler’s code and technical documentation in its GitHub repository.&lt;br /&gt;
&lt;br /&gt;
Link to GitHub rep:&lt;br /&gt;
https://github.com/internetarchive/brozzler --&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== User Experiences ==&lt;br /&gt;
&amp;lt;!-- From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000351986-How-and-when-to-use-Brozzler&lt;br /&gt;
&lt;br /&gt;
When to Use Brozzler:&lt;br /&gt;
&lt;br /&gt;
&amp;quot;If dynamic elements on a page were not captured in a Standard crawl or you're seeing a number of error pages when viewing results in Wayback, running a test crawl using Brozzler is a good next step&amp;quot;. --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Eve wrightnrs</name></author>
	</entry>
	<entry>
		<id>https://coptr.digipres.org/index.php?title=Brozzler&amp;diff=5525</id>
		<title>Brozzler</title>
		<link rel="alternate" type="text/html" href="https://coptr.digipres.org/index.php?title=Brozzler&amp;diff=5525"/>
		<updated>2021-12-09T16:03:16Z</updated>

		<summary type="html">&lt;p&gt;Eve wrightnrs: Created page with &amp;quot;{{Infobox tool |purpose=From GitHub (https://github.com/internetarchive/brozzler):  Brozzler is a distributed web crawler that uses a real browser (Chrome or Chromium) to fetc...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox tool&lt;br /&gt;
|purpose=From GitHub (https://github.com/internetarchive/brozzler):&lt;br /&gt;
Brozzler is a distributed web crawler that uses a real browser (Chrome or Chromium) to fetch pages and embedded URLs and to extract links. &lt;br /&gt;
&lt;br /&gt;
Brozzler is designed to work in conjunction with warcprox for web archiving.&lt;br /&gt;
|homepage=https://github.com/internetarchive/brozzler&lt;br /&gt;
|function=Web Capture&lt;br /&gt;
|content=Web&lt;br /&gt;
}}&lt;br /&gt;
{{Infobox tool details}}&lt;br /&gt;
== Description ==&lt;br /&gt;
&amp;lt;!-- Brozzler is a distributed browser based web crawler. It was built by the Internet Archive.&lt;br /&gt;
&lt;br /&gt;
From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000343186-What-is-Brozzler-&lt;br /&gt;
&lt;br /&gt;
Brozzler is our newest crawling technology, built at the Internet Archive.&lt;br /&gt;
&lt;br /&gt;
Brozzler differs from Archive-It's &amp;quot;Standard&amp;quot; crawling technology (Heritrix and Umbra) in its reliance on an actual web browser to interact with web content before that content is indexed and archived into WARC files. Instead of following hyperlinks and downloading files, Brozzler records interactions between servers and web browsers as they occur, more closely resembling how a human user would experience the web. It also uses youtube-dl to enhance media capture capabilities. (as of January 2020 both Brozzler and Standard crawls use youtube-dl).&lt;br /&gt;
&lt;br /&gt;
For more information on how this process works, and the related open-source tools on which it relies, you can review Brozzler’s code and technical documentation in its GitHub repository.&lt;br /&gt;
&lt;br /&gt;
Link to GitHub rep:&lt;br /&gt;
https://github.com/internetarchive/brozzler&lt;br /&gt;
--&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== User Experiences ==&lt;br /&gt;
&amp;lt;!-- From Internet Archive: https://support.archive-it.org/hc/en-us/articles/360000351986-How-and-when-to-use-Brozzler&lt;br /&gt;
&lt;br /&gt;
When to Use Brozzler:&lt;br /&gt;
&lt;br /&gt;
&amp;quot;If dynamic elements on a page were not captured in a Standard crawl or you're seeing a number of error pages when viewing results in Wayback, running a test crawl using Brozzler is a good next step&amp;quot;. --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Eve wrightnrs</name></author>
	</entry>
</feed>