Difference between revisions of "User:COPTR Bot"
Jump to navigation
Jump to search
(Created page with "The COPTR Bot is a robot account used for importing and checking COPTR content. The source code used is [https://github.com/openplanets/coptr here on GitHub].") |
|||
(6 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
− | The COPTR Bot is a robot account used for importing and checking COPTR content. The source code used is [https://github.com/ | + | The COPTR Bot is a robot account used for importing and checking COPTR content. The source code used is [https://github.com/digipres/coptr here on GitHub]. |
+ | |||
+ | == Phase 1 - Upload == | ||
+ | At first, COPTR Bot has been used to populate the Wiki based on an Excel spreadsheet that contained the manually normalised and de-duplicated content of the original source registries. | ||
+ | |||
+ | == Phase 2 - Projections == | ||
+ | The next goal is to use the structured COPTR data to project summary tables, similar to the [http://digitalpowrr.niu.edu/ POWRR project] [http://digitalpowrr.niu.edu/tool-grid/ Tool Grid]. | ||
+ | |||
+ | == Phase ? - Ideas... == | ||
+ | |||
+ | === Sentinel === | ||
+ | The next phase should be to periodically scan pages and flag or fix known issues. These may include: | ||
+ | * Converting old-style H1-level pages down to H2. | ||
+ | * Updating changes to category names. | ||
+ | * Checking if the Description is just the same text as the Purpose and flagging the entries to be fixed (using a category as the flag). | ||
+ | |||
+ | c.f. http://coptr.digipres.org/Talk:Main_Page | ||
+ | |||
+ | And other ideas: | ||
+ | * Checking the page structure is as expected, and the infobox fields are filled. | ||
+ | * Checking links are valid. | ||
+ | * Extracting links to ensure they are in web archives. | ||
+ | * Adding inline references to web archive holdings. |
Latest revision as of 19:39, 26 September 2014
The COPTR Bot is a robot account used for importing and checking COPTR content. The source code used is here on GitHub.
Phase 1 - Upload[edit]
At first, COPTR Bot has been used to populate the Wiki based on an Excel spreadsheet that contained the manually normalised and de-duplicated content of the original source registries.
Phase 2 - Projections[edit]
The next goal is to use the structured COPTR data to project summary tables, similar to the POWRR project Tool Grid.
Phase ? - Ideas...[edit]
Sentinel[edit]
The next phase should be to periodically scan pages and flag or fix known issues. These may include:
- Converting old-style H1-level pages down to H2.
- Updating changes to category names.
- Checking if the Description is just the same text as the Purpose and flagging the entries to be fixed (using a category as the flag).
c.f. http://coptr.digipres.org/Talk:Main_Page
And other ideas:
- Checking the page structure is as expected, and the infobox fields are filled.
- Checking links are valid.
- Extracting links to ensure they are in web archives.
- Adding inline references to web archive holdings.