EPADD

From COPTR
Jump to navigation Jump to search


EPADD
ePADD is a software package developed by Stanford University's Special Collections & University Archives that supports archival processes around the appraisal, ingest, processing, discovery, and delivery of email archives.
Homepage:https://library.stanford.edu/projects/epadd
License:Apache 2.0
Platforms:Java
Function:Metadata Processing,Metadata Extraction,Preservation System
Content type:Email





Description[edit]

From the project Github page: "ePADD is a software package developed by Stanford University's Special Collections & University Archives that supports archival processes around the appraisal, ingest, processing, discovery, and delivery of email archives.

The software is comprised of four modules:

  • Appraisal: Allows donors, dealers, and curators to easily gather and review email archives prior to transferring those files to an archival repository.
  • Processing: Provides archivists with the means to arrange and describe email archives.
  • Discovery: Provides the tools for repositories to remotely share a redacted view of their email archives with users through a web server discovery environment. (Note that this module is downloaded separately).
  • Delivery: Enables archival repositories to provide moderated full-text access to unrestricted email archives within a reading room environment."

From the Project page:

ePADD Technical Information

ePADD is written in Java and Javascript and powered by Apache Tomcat (v7.0) using Java EE Servlet API (v3.x) and Java Mail (v1.4.2). Text and metadata extraction, indexing and retrieval is performed by Apache Lucene (v4.7) and Apache Tika (v1.8). Charting and visualization is supported using the D3-based reusable chart library (v0.4.10). Oracle's Java Application Bundler and Launch4J are used for packaging on Mac and Windows platforms respectively. Other Java libraries from Apache (Lang, commons, CLI, IO, logging, etc.) are also used. JSON formatting is performed with the libraries org.json and Gson.

ePADD has implemented its own natural language processing (NLP) toolkit which is used for named entity extraction, disambiguation and other tasks. This toolkit supplants the Apache OpenNLP used in earlier beta versions of the ePADD software. We continue to use Muse as an internal library within ePADD. However, the Apache OpenNLP proved insufficient for our needs (at least for name recognition), and after various rounds of customization, we built our own named entity recognizer. This toolkit uses external datasets such as Wikipedia/DBpedia, Freebase, Geonames, OCLC FAST and LC Subject Headings/LC Name Authority File.

The project is developed with IDEs like IntelliJ Idea and Eclipse, built with Apache Maven, Ant, and custom shell scripts, and tracked using Git for source control and issue tracking. The ePADD software client is browser-based and compatible with Chrome and Firefox. It is optimized for Windows 7 SP1/10, OSX 10.12/10.13, and Ubuntu 16.04 machines, using Java 8.


User Experiences[edit]


Development Activity[edit]

All development activity is visible on GitHub: http://github.com/ePADD/epadd/commits


Release Feed[edit]

Below the last 3 release feeds:

2021-04-08 19:35:39
[tag:github.com,2008:Repository/38125452/v8 Version 8]
by hangal
2021-01-20 20:58:55
[tag:github.com,2008:Repository/38125452/v7.3.5-alpha 7.3.5 alpha]
by chinuhub
2020-10-30 14:22:52
[tag:github.com,2008:Repository/38125452/v7.3.4 v7.3.4]
by chinuhub


Activity Feed[edit]

Below the last 5 commits:

2021-05-04 14:25:30
[tag:github.com,2008:Grit::Commit/c05db20b11f553badab2334aa98e821cc59d19f1 encodeURIComponent was missing from the URL's formed in ajax calls. T…]
by chinuhub https://github.com/chinuhub
2021-05-04 11:17:13
[tag:github.com,2008:Grit::Commit/45a035d0aed7bac03410e590f7b4414ef3c7c613 Changed encodeURI to encodeURIComponent in collection.js which was cr…]
by chinuhub https://github.com/chinuhub
2021-04-06 16:47:18
[tag:github.com,2008:Grit::Commit/87437f348a1e10363b7c40ac02102889c4866c66 version 8 update]
by chinuhub https://github.com/chinuhub
2021-04-06 16:26:35
[tag:github.com,2008:Grit::Commit/bf5755365532449a97a910297f2c88e495acea00 epaddRepoDetails file if present in the home directory will override …]
by chinuhub https://github.com/chinuhub
2021-04-06 16:14:11
[tag:github.com,2008:Grit::Commit/9248194fab323496a3b2e128c72baab02f85c60e Merge branch 'master' into loading-large-mbox]
by chinuhub https://github.com/chinuhub