Revision as of 08:10, 19 June 2023 by Giodiani (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

GUI application to facilitate the creation and verification of BagIt bags.
License:Open License, but relies on several Apache based and GNU licensed components.
Appears in COW:Digital archiving workflow (high-level), Ingest and AIP creation

Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt6677721baf2d31_29643281


The BagIt specification is a hierarchical file packaging format for the creation of standardised digital containers called 'bags,' which are used for storing and transferring digital content. Derived from work by the Library of Congress and the California Digital Library, a bag consists of a ‘payload’ - the digital content - and ‘tags' - metadata files to document the storage and transfer of the bag. There are a number of Bagit-specific tools to ease bag creation, including the BagIt Library, a Java-based software library to support the creation, manipulation, and validation of bags. For those less comfortable with command-line interface, the Bagger application provides a graphical user interface to the BagIt Library.


The United States Library of Congress, and the National Digital Information Infrastructure and Preservation Program (NDIIPP)

Licensing and cost[edit]

BSD License - free.  The BagIt Library is public domain.

Development activity[edit]

Bagger 2.1.2 was released in February 2012. BagIt Library 4.1 was released in January 2012. The Library of Congress website implies ongoing development of the Transfer Utilities. 

Platform and interoperability[edit]

Both the BagIt Library and Bagger require Java 6.

Functional notes[edit]

Bags contain at minimum three elements: a ‘payload’ and at least two ‘tags.’ The payload consists of the content being preserved. The first tag is a manifest itemising the files making up the content along with their checksums; the second is a bagit.txt file identifying the container as a bag and giving the version of the specification used and the character encoding of the tags.  The specification additionally allows for several optional tags.  

Documentation and user support[edit]

Documentation is extremely sparse, primarily consisting of README files detailing release notes. The BagIt specification itself can be found through the Library of Congress website. It appears that the main user support consists of a mailing list hosted by Sourceforge; however, the list archive only shows 11 posts for 2011.


The BagIt Library uses a command-line interface, while Bagger provides a graphical user interface. No installation is required; the tools can simply be downloaded and run, although it may not be immediately clear to users how to do so.

Expertise required[edit]

BagIt is designed to create a common language for users exchanging digital materials, essentially negating the need for expertise about others’ protocols. However, for configuration, familiarity with one’s own repository’s technical protocols is essential.

Standards compliance[edit]

The BagIt specification is an Internet Engineering Task Force (IETF) internet draft.

Influence and take-up[edit]

The BagIt specification has become widely accepted in the preservation community, and is used by the Library of Congress, Chronopolis, and The Stanford Digital Repository, among others. The Transfer Utilities have been downloaded nearly 4000 times from Sourceforge.

User Experiences[edit]

Development Activity[edit]

Activity Feed[edit]

Link to any RSS feed that is updated when issue or code updates occur, if any, e.g:

2023-01-09 21:16:26
[,2008:Grit::Commit/bc8e34fd669d10aa41748161e657299d03bcd51e Merge pull request #72 from LibraryOfCongress/update-readme-roadmap]
by acdha
2023-01-09 21:12:21
[,2008:Grit::Commit/a22e45d018b607bfb7c09415f8ef8fc8ca74b985 Update information about the future of Bagger]
by acdha
2022-09-27 21:10:53
[,2008:Grit::Commit/dff49de4e74f075b7451aed574b053b6b02fda83 Merge pull request #71 from LibraryOfCongress/add-rfq-information]
by acdha
2022-09-27 20:10:09
[,2008:Grit::Commit/8373c1858c990f08a20872261c88677dd1898704 Add RFQ announcement]
by acdha
2018-04-30 14:00:15
[,2008:Grit::Commit/d3856aa3f048740d996dda494cc9cf1d47533a70 updated issue template for github]
by johnscancella
2018-04-18 13:35:21
[,2008:Grit::Commit/ecd742ed0f6e81e4f3875e2f8826dbf6416cb225 include new profiles when copying default profiles]
by johnscancella
2018-04-16 15:41:51
[,2008:Grit::Commit/41c07101d44d16755d05009d2fc9c67140b760fe Merge pull request #52 from StateArchivesOfNorthCarolina/master]
by johnscancella

Release Feed[edit]

2018-04-18 13:36:07
[,2008:Repository/28666273/v2.8.1 Release 2.8.1]
by johnscancella
2018-04-16 15:48:19
[,2008:Repository/28666273/v2.8 Release 2.8]
by johnscancella
2017-11-06 12:27:41
[,2008:Repository/28666273/2.7.7 Release 2.7.7]
by johnscancella