Difference between revisions of "User:COPTR Bot"
Jump to navigation
Jump to search
(3 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
− | The COPTR Bot is a robot account used for importing and checking COPTR content. The source code used is [https://github.com/ | + | The COPTR Bot is a robot account used for importing and checking COPTR content. The source code used is [https://github.com/digipres/coptr here on GitHub]. |
== Phase 1 - Upload == | == Phase 1 - Upload == | ||
At first, COPTR Bot has been used to populate the Wiki based on an Excel spreadsheet that contained the manually normalised and de-duplicated content of the original source registries. | At first, COPTR Bot has been used to populate the Wiki based on an Excel spreadsheet that contained the manually normalised and de-duplicated content of the original source registries. | ||
− | == Phase 2 - Sentinel == | + | == Phase 2 - Projections == |
+ | The next goal is to use the structured COPTR data to project summary tables, similar to the [http://digitalpowrr.niu.edu/ POWRR project] [http://digitalpowrr.niu.edu/tool-grid/ Tool Grid]. | ||
+ | |||
+ | == Phase ? - Ideas... == | ||
+ | |||
+ | === Sentinel === | ||
The next phase should be to periodically scan pages and flag or fix known issues. These may include: | The next phase should be to periodically scan pages and flag or fix known issues. These may include: | ||
* Converting old-style H1-level pages down to H2. | * Converting old-style H1-level pages down to H2. |
Latest revision as of 19:39, 26 September 2014
The COPTR Bot is a robot account used for importing and checking COPTR content. The source code used is here on GitHub.
Phase 1 - Upload[edit]
At first, COPTR Bot has been used to populate the Wiki based on an Excel spreadsheet that contained the manually normalised and de-duplicated content of the original source registries.
Phase 2 - Projections[edit]
The next goal is to use the structured COPTR data to project summary tables, similar to the POWRR project Tool Grid.
Phase ? - Ideas...[edit]
Sentinel[edit]
The next phase should be to periodically scan pages and flag or fix known issues. These may include:
- Converting old-style H1-level pages down to H2.
- Updating changes to category names.
- Checking if the Description is just the same text as the Purpose and flagging the entries to be fixed (using a category as the flag).
c.f. http://coptr.digipres.org/Talk:Main_Page
And other ideas:
- Checking the page structure is as expected, and the infobox fields are filled.
- Checking links are valid.
- Extracting links to ensure they are in web archives.
- Adding inline references to web archive holdings.