Difference between revisions of "User:COPTR Bot"

From COPTR
Jump to navigation Jump to search
 
(5 intermediate revisions by the same user not shown)
Line 1: Line 1:
The COPTR Bot is a robot account used for importing and checking COPTR content. The source code used is [https://github.com/openplanets/coptr here on GitHub].
+
The COPTR Bot is a robot account used for importing and checking COPTR content. The source code used is [https://github.com/digipres/coptr here on GitHub].
  
 
== Phase 1 - Upload ==
 
== Phase 1 - Upload ==
 
At first, COPTR Bot has been used to populate the Wiki based on an Excel spreadsheet that contained the manually normalised and de-duplicated content of the original source registries.
 
At first, COPTR Bot has been used to populate the Wiki based on an Excel spreadsheet that contained the manually normalised and de-duplicated content of the original source registries.
  
== Phase 2 - Sentinel ==
+
== Phase 2 - Projections ==
 +
The next goal is to use the structured COPTR data to project summary tables, similar to the [http://digitalpowrr.niu.edu/ POWRR project] [http://digitalpowrr.niu.edu/tool-grid/ Tool Grid].
 +
 
 +
== Phase ? - Ideas... ==
 +
 
 +
=== Sentinel ===
 
The next phase should be to periodically scan pages and flag or fix known issues. These may include:
 
The next phase should be to periodically scan pages and flag or fix known issues. These may include:
 
* Converting old-style H1-level pages down to H2.
 
* Converting old-style H1-level pages down to H2.
 
* Updating changes to category names.
 
* Updating changes to category names.
 +
* Checking if the Description is just the same text as the Purpose and flagging the entries to be fixed (using a category as the flag).
 +
 +
c.f. http://coptr.digipres.org/Talk:Main_Page
  
 
And other ideas:
 
And other ideas:
 +
* Checking the page structure is as expected, and the infobox fields are filled.
 
* Checking links are valid.
 
* Checking links are valid.
 
* Extracting links to ensure they are in web archives.
 
* Extracting links to ensure they are in web archives.
 
* Adding inline references to web archive holdings.
 
* Adding inline references to web archive holdings.

Latest revision as of 19:39, 26 September 2014

The COPTR Bot is a robot account used for importing and checking COPTR content. The source code used is here on GitHub.

Phase 1 - Upload[edit]

At first, COPTR Bot has been used to populate the Wiki based on an Excel spreadsheet that contained the manually normalised and de-duplicated content of the original source registries.

Phase 2 - Projections[edit]

The next goal is to use the structured COPTR data to project summary tables, similar to the POWRR project Tool Grid.

Phase ? - Ideas...[edit]

Sentinel[edit]

The next phase should be to periodically scan pages and flag or fix known issues. These may include:

  • Converting old-style H1-level pages down to H2.
  • Updating changes to category names.
  • Checking if the Description is just the same text as the Purpose and flagging the entries to be fixed (using a category as the flag).

c.f. http://coptr.digipres.org/Talk:Main_Page

And other ideas:

  • Checking the page structure is as expected, and the infobox fields are filled.
  • Checking links are valid.
  • Extracting links to ensure they are in web archives.
  • Adding inline references to web archive holdings.