Difference between revisions of "KOST-Val"

From COPTR
Jump to navigation Jump to search
 
(5 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
{{Infobox tool
 
{{Infobox tool
 
|image=KOST-Val.JPG
 
|image=KOST-Val.JPG
|purpose=KOST-Val is an open source validator for different file formats (TIFF, SIARD, PDF/A, JP2, JPEG) and Submission Information Package (SIP).
+
|purpose=KOST-Val is an open source validator for different file formats and Submission Information Package (SIP).
 
|homepage=http://kost-ceco.ch/cms/index.php?kost_val_de
 
|homepage=http://kost-ceco.ch/cms/index.php?kost_val_de
 
|license=[http://www.gnu.org/licenses/quick-guide-gplv3.html GNU General Public License 3+]
 
|license=[http://www.gnu.org/licenses/quick-guide-gplv3.html GNU General Public License 3+]
|platforms=should run under Java 1.6 on Windows
+
|platforms= Windows
|formats_in={{Format|TIFF}}, {{Format|SIARD}}, {{Format|PDF}}/A, {{Format|JP2}}, {{Format|JPEG}} and SIP
+
|function=Quality Assurance, Validation
|function=Validation, Quality Assurance, Preservation System
+
|content=Database, Document, Image
|content=Database, Image, Document
 
 
}}
 
}}
{{Infobox tool details}}
+
 
 
<!-- Use the structure provided in this template, do not change it! -->
 
<!-- Use the structure provided in this template, do not change it! -->
 
= Description =
 
= Description =
 
<!-- Describe the what the tool does, focusing on it's digital preservation value. Keep it factual. -->
 
<!-- Describe the what the tool does, focusing on it's digital preservation value. Keep it factual. -->
The KOST-Val application is used to validate {{Format|TIFF}}, {{Format|SIARD}}, {{Format|PDF}}/A, {{Format|JP2}}, {{Format|JPEG}} files and Submission Information Package (SIP).
+
The KOST-Val application is used to validate different file formats and Submission Information Package (SIP).
 
    
 
    
 
KOST-Val supersedes the format validation tools [[SIARD-VAL]], [[TIFF-Val]] and SIP-Val by KOST-CECO.
 
KOST-Val supersedes the format validation tools [[SIARD-VAL]], [[TIFF-Val]] and SIP-Val by KOST-CECO.
Line 21: Line 20:
 
KOST-Val complies with the following requirements.
 
KOST-Val complies with the following requirements.
  
* '''TIFF validation:''' KOST-Val reads a TIFF file and uses [[JHOVE (Harvard Object Validation Environment)| JHOVE]] to validate the structure, the content, and [[ExifTool|ExifTool]] to validate the key properties such as compression, colour space, and multipage. These properties can be configured.  
+
https://raw.githubusercontent.com/KOST-CECO/KOST-Val/master/KOST-Val_FuntionalPrincipleFormatValidation.JPG
* '''SIARD validation:''' KOST-Val reads a SIARD (eCH-0165 v1 and v2-2017) file and validates the structure and the content.
 
* '''PDF/A validation:''' KOST-Val reads a PDF or PDF/A file (ISO 19005-1 and 19005-2) and uses [[3-Heights(TM) PDF Validator|3-Heights™ PDF/A Validator]] by PDF-Tools or [[PdfaPilot|pdfaPilot]] by callas to validate the structure and the content of the PDF file. KOST-Val organises the different error messages into main categories such as fonts, graphics, and metadata. KOST-Val supplies only a limited  version from 3-Heights™ PDF/A Validator by PDF-Tools and pdfaPilot by callas. Module J extracts (with [[IText|iText]]) and validates the JPEG and JP2 images contained in the PDF file (depending on the configuration). It is also possible to configure whether the JBIG2 compression is accepted or not.
 
* '''JP2 validation:''' KOST-Val reads a JP2 file (ISO 15444) and uses [[Jpylyzer]] to validate the structure and the content.
 
* '''JPEG validation:''' KOST-Val reads a JPEG file (ISO 10918-1) and uses [[Bad Peggy]] to validate the structure and the content.
 
* '''SIP validation:''' KOST-Val reads an SIP (eCH-0160 v1 and v1.1 ) and validates the mandatory requirements of the SIP specification. The validated requirements are organised into groups such as folder structure, schema validation, and checksum validation. At the outset, a file format validation is performed.  
 
  
 
The results (including information on inconsistencies and errors) are output for every step and written into a validation log.
 
The results (including information on inconsistencies and errors) are output for every step and written into a validation log.
The validation steps are executed sequentially. Whenever possible the validation shall continue after an error has been detected in order to reduce the number of correction cycles.  
+
The validation steps are executed sequentially. Whenever possible the validation shall continue after an error has been detected in order to reduce the number of correction cycles.
 
 
[[File:KOST-Val_FuntionalPrincipleFormatValidation.JPG|800px]]
 
  
=== Third-party applications ===
 
KOST-Val uses unmodified components of other manufacturers by embedding them directly into the source code. Users of KOST-Val are requested to adhere to these components ‘terms of licence.
 
 
* The TIFF validation module uses [[JHOVE (Harvard Object Validation Environment)| JHOVE]] and [[ExifTool|ExifTool]] and evaluates its output further.
 
* For the PDF/A validation module [[PdfaPilot|pdfaPilot]] or [[3-Heights(TM) PDF Validator|3-Heights™ PDF/A Validator]] are used.
 
* The JP2 validation module uses [[Jpylyzer]] and translates the failed tests into appropriate error messages (DE/FR/EN).
 
* The JPEG validation module uses [[Bad Peggy]] and evaluates the error message "Not a JPEG file" further.
 
* To extract the JPEG and JP2 images from PDF/A the [[IText|iText library]] is used. 
 
* For the file format identification [[DROID_(Digital_Record_Object_Identification)|DROID]] is used. For performance and granularity reasons an own SignatureFile is used instead of the official PRONOM registry.
 
 
 
 
=== Read Me & Download ===
 
=== Read Me & Download ===
The KOST-Val application is used to validate TIFF, SIARD, PDF/A, JP2, JPEG files and Submission Information Package (SIP).
+
https://raw.githubusercontent.com/KOST-CECO/KOST-Val/master/KOST-Val_README.md
 
 
KOST-Val, Copyright (C) 2012-2017 Claire Roethlisberger (KOST-CECO), Christian Eugster, Olivier Debenath, Peter Schneider (Staatsarchiv Aargau), Markus Hahn (coderslagoon), Daniel Ludin (BEDAG AG)
 
  
 
This program comes with ABSOLUTELY NO WARRANTY. This is free software, and you are welcome to redistribute it under certain conditions; see GPL-3.0_COPYING.txt for details.
 
This program comes with ABSOLUTELY NO WARRANTY. This is free software, and you are welcome to redistribute it under certain conditions; see GPL-3.0_COPYING.txt for details.
  
You can download KOST-Val under http://github.com/KOST-CECO/KOST-Val/releases. For installation instructions please check the [http://github.com/KOST-CECO/KOST-Val/releases manual (DE/FR/EN)].
+
You can download KOST-Val under http://github.com/KOST-CECO/KOST-Val/releases. For installation instructions please check the manual.
 
 
 
    
 
    
=== SIARD format ===
 
SIARD stands for Software Independent Archiving of Relational Databases. Originally the Swiss Federal Archives (SFA) have developed the SIARD format as a sustainable solution for the archiving of relations databases.
 
 
In early 2013 SIARD format has been adopted as an eCH Standard (eCH-0165: SIARD format specification https://www.ech.ch/ech/ech-0165).
 
 
    
 
    
eCH is the Swiss organization for standardization in the field of e-government. eCH Standards define guidelines for recurring applications and their results, as for example format definitions or procedural standards. The aim of those standards is to unify and thus facilitate the electronic collaboration between authorities as well as between authorities and organizations, educational and research institutions, firms and private organizations.
 
 
 
=== Future ===
 
=== Future ===
 
See http://github.com/KOST-CECO/KOST-Val/issues  
 
See http://github.com/KOST-CECO/KOST-Val/issues  

Latest revision as of 10:40, 4 April 2024



KOST-Val
KOST-Val is an open source validator for different file formats and Submission Information Package (SIP).
Homepage:http://kost-ceco.ch/cms/index.php?kost_val_de
License:GNU General Public License 3+
Platforms:Windows
Function:Quality Assurance,Validation
Content type:Database,Document,Image


Description[edit]

The KOST-Val application is used to validate different file formats and Submission Information Package (SIP).

KOST-Val supersedes the format validation tools SIARD-VAL, TIFF-Val and SIP-Val by KOST-CECO.


Functional Principle[edit]

KOST-Val complies with the following requirements.

https://raw.githubusercontent.com/KOST-CECO/KOST-Val/master/KOST-Val_FuntionalPrincipleFormatValidation.JPG

The results (including information on inconsistencies and errors) are output for every step and written into a validation log. The validation steps are executed sequentially. Whenever possible the validation shall continue after an error has been detected in order to reduce the number of correction cycles.

Read Me & Download[edit]

https://raw.githubusercontent.com/KOST-CECO/KOST-Val/master/KOST-Val_README.md

This program comes with ABSOLUTELY NO WARRANTY. This is free software, and you are welcome to redistribute it under certain conditions; see GPL-3.0_COPYING.txt for details.

You can download KOST-Val under http://github.com/KOST-CECO/KOST-Val/releases. For installation instructions please check the manual.


Future[edit]

See http://github.com/KOST-CECO/KOST-Val/issues


Feedback & Issues[edit]

Feedback about KOST-Val is very welcome at http://github.com/KOST-CECO/KOST-Val/issues or kost-val[at]kost-ceco.ch

.

User Experiences[edit]

  • ZBW:
    • KOST-Val v1.6.0
    • The tool is very easy to install and to handle.
    • The output in xml-Format (open in a browser to have a table) is easy to understand
    • Running the JPEG-Module against almost 2,400 JPEGs has only lasted 7 minutes
    • The tool recognises fake-JPEGs (jpeg-extension, but no jpeg inside) and issues with jpegs and can differentiate easily between these two cases.

.

Development Activity[edit]

All development activity is visible on GitHub: http://github.com/KOST-CECO/KOST-Val/commits


Release Feed[edit]

Below the last 3 release feeds:

2024-09-05 09:50:54
[tag:github.com,2008:Repository/10448958/v1.1.2 KOST-Tools v1.1.2]
by Chlara
2024-07-30 06:34:04
[tag:github.com,2008:Repository/10448958/KOST-Tools_v1.1.2 KOST-Tools v1.1.2 (alpha-Release)]
by Chlara
2024-05-06 09:19:31
[tag:github.com,2008:Repository/10448958/KOST-Tools_v1.1.1 KOST-Tools v1.1.1]
by Chlara


Activity Feed[edit]

Below the last 5 commits:

2024-07-16 06:13:50
[tag:github.com,2008:Grit::Commit/83ba678dea01746074e9695868104794a1d66208 Add files via upload]
by Chlara https://github.com/Chlara
2024-02-29 06:38:22
[tag:github.com,2008:Grit::Commit/a1c6d75a2bd0c99f1bda6a7cdf20c6639f0d274c Merge pull request #90 from KOST-CECO/v1.1.0]
by KostCeco https://github.com/KostCeco
2024-02-29 06:15:29
[tag:github.com,2008:Grit::Commit/0f6873a4d691ea6ec7fb929d44b8fa6877d4f7c3 KOST-Tools v1.1.0.0 Teil 17]
by Chlara https://github.com/Chlara
2024-02-26 08:55:19
[tag:github.com,2008:Grit::Commit/981deaa62c8e7af97e8055ea07e65117301f54e7 KOST-Tools v1.1.0.0 Teil 16]
by Chlara https://github.com/Chlara
2024-02-26 08:21:13
[tag:github.com,2008:Grit::Commit/c3180d66ea69dc47cea85437cf6a3b62f2718078 KOST-Tools v1.1.0.0 Teil 15]
by Chlara https://github.com/Chlara