Difference between revisions of "Dataverse"

From COPTR
Jump to navigation Jump to search
(Updated information since it hasnt been updated since 2013.)
 
(5 intermediate revisions by 3 users not shown)
Line 1: Line 1:
{{Infobox_tool
+
{{Infobox tool
|purpose=The Dataverse is an open source ([https://github.com/IQSS/dataverse.org code is available on GitHub]) web application to share, preserve, cite, explore and analyze research data.
+
|purpose=The Dataverse is an open source web application to share, preserve, cite, explore and analyze research data.
|image=
 
 
|homepage=http://dataverse.org/
 
|homepage=http://dataverse.org/
|license=
+
|sourcecode=https://github.com/IQSS/dataverse
|platforms=
+
|license=Apache License, Version 2.0
 +
|function=Active Data Storage, Managing Active Research Data, Preservation System, Storage
 +
|content=Research Data
 +
}}
 +
{{Infobox tool details
 +
|ohloh_id=Dataverse
 
}}
 
}}
 
<!-- Delete the Categories that do not apply -->
 
[[Category:Active Data Storage]]
 
[[Category:Preservation System]]
 
[[Category:Managing Active Research Data]]
 
[[Category:Storage]]
 
[[Category:Research Data]]
 
 
 
 
= Description =
 
= Description =
The '''Dataverse'''<ref>{{cite web|last=Crosas|first=M.|title=The Dataverse Network®: An Open-Source Application for Sharing, Discovering and Preserving Data|url=http://www.dlib.org/dlib/january11/crosas/01crosas.html|work=D-Lib Magazine|accessdate=27 May 2015}}</ref> is an open source ([https://github.com/IQSS/dataverse.org code is available on GitHub]) web application to share, preserve, cite, explore and analyze research data. It facilitates making data available to others, and allows you to replicate others' work.<ref>{{cite web|title=About the Project|url=http://dataverse.org/about/|website=Dataverse}}</ref> Researchers, data authors, publishers, data distributors, and affiliated institutions all receive appropriate credit via a data citation with a persistent identifier (e.g., DOI, or Handle).
+
'Dataverse' is an open source ([https://github.com/IQSS/dataverse code is available on GitHub]) web application to share, preserve, cite, explore and analyze research data. It facilitates making data available to others, and allows you to replicate others' work ([http://dataverse.org/about/ see their About page]). Researchers, data authors, publishers, data distributors, and affiliated institutions all receive appropriate credit via a data citation with a persistent identifier (DOI, or Handle).
  
 
A Dataverse repository hosts multiple dataverses ([http://guides.dataverse.org/en/4.0/_images/Dataverse-Diagram.png see diagram]). Each dataverse contains dataset(s) or other dataverses, and each dataset contains descriptive metadata and data files (including documentation and code that accompany the data - [http://guides.dataverse.org/en/4.0/_images/DatasetDiagram.png see diagram]).  
 
A Dataverse repository hosts multiple dataverses ([http://guides.dataverse.org/en/4.0/_images/Dataverse-Diagram.png see diagram]). Each dataverse contains dataset(s) or other dataverses, and each dataset contains descriptive metadata and data files (including documentation and code that accompany the data - [http://guides.dataverse.org/en/4.0/_images/DatasetDiagram.png see diagram]).  
 +
 
====Provider====
 
====Provider====
 
Institute for Quantitative Social Science at Harvard University, along with many collaborators and contributors worldwide.
 
Institute for Quantitative Social Science at Harvard University, along with many collaborators and contributors worldwide.
 +
 
====Licensing and cost====
 
====Licensing and cost====
 
[http://www.apache.org/licenses/LICENSE-2.0.html Apache 2 License] &ndash; free.
 
[http://www.apache.org/licenses/LICENSE-2.0.html Apache 2 License] &ndash; free.
====Development activity====
+
Version 3.0 was released in May 2012. The current version (in August 2013) is 3.5.1.
 
The software is continually development, as revealed by an active issues tracking page. &nbsp;The project is Harvard-sponsored, and appears to have support for the foreseeable future.
 
 
====Platform and interoperability====
 
====Platform and interoperability====
The Dataverse Network makes use of the following components: Java Server Faces (JSF2); Enterprise Java Beans (EJB3); PostgreSQL; Lucene; and R and Zelig.
+
The Dataverse makes use of the following components: Java Server Faces; Enterprise Java Beans; PostgreSQL; Solr; and R and Zelig.
Prerequisites for installation include Sun/Oracle Java JDK 1.6+, a &ldquo;virgin&rdquo; installation of Glassfish v2.1+, preferably as part of the NetBeans Web Development bundle, PostgreSQL v8.3+, and R.
+
Prerequisites for installation include Oracle JDK or OpenJDK, a &ldquo;virgin&rdquo; installation of Glassfish Version 4.1+, preferably as part of the NetBeans Web Development bundle, PostgreSQL Version 9.3+, and R.
The software was designed to integrate reCAPTCHA, Google Analystics, ImageMagick, Handle, and [http://www.dcc.ac.uk/resources/external/lockss LOCKSS] if the installer so wishes.
+
The software was designed to integrate reCAPTCHA, Google Analystics, ImageMagick, Shibboleth, and DOI registration via EZID if the installer so wishes.
 +
The Dataverse currently has [http://guides.dataverse.org/en/latest/api/index.html multiple open APIs available], which allow for searching, depositing and accessing data.
 +
 
====Functional notes====
 
====Functional notes====
DataVerses can be configured for two levels of access. An Open DataVerse allows all registered users to edit their own studies; a Wiki DataVerse allows users to edit all studies, although only a curator or administrator can release the changes.
+
Dataverses can be configured for multiple levels of access (at the dataverse, dataset and file level).
A DataVerse will accept any format, but will only give full support to tabular data. SPSS and STATA are the preferred formats; data in these formats will be eligible for subsettable features, multiple formats for download, and a Universal Numerical Fingerprint (UNF). &nbsp;GraphML is recommended for network data, and will be eligible for subsetting and pre-defined measurements.
+
Dataverse will accept any format, but will give full support to tabular data or fits file data (astronomy format). SPSS, STATA, R and csv are the preferred formats; data in these formats will be eligible for subsettable features, multiple formats for download, and a Universal Numerical Fingerprint (UNF). These files will be eligible for subsetting and pre-defined measurements.
A DataVerse Network has the capability to be registered as a handle.net server, which allows the Network to assign persistent identifiers to data sets.
+
Dataverse has the capability to register DOIs from EZID, which allows the repository to assign persistent identifiers to data sets.
 +
 
====Documentation and user support====
 
====Documentation and user support====
The website contains extensive software documentation, including user, installer, and developer [http://guides.thedata.org/ guides]. &nbsp;A new Users google group appears to be reasonably active.&nbsp; Aside from a webform, contact information is not prominently advertised.
+
The website contains extensive software documentation, including user, installer, and developer [http://guides.dataverse.org/en/latest/ guides]. &nbsp;A [https://groups.google.com/forum/#!forum/dataverse-community Users google group] appears to be reasonably active, along with a specific support email address.
 +
 
====Usability====
 
====Usability====
The DataVerse software provides a web-based interface for both administrators and users.
+
The Dataverse software provides a web-based interface for both administrators and users.
 
The package includes an installer, which is run through the command line; basic install is designed to be very quick. Comfort with command-line interface and general systems knowledge appear to be crucial for configuration and installation of any add-ons.
 
The package includes an installer, which is run through the command line; basic install is designed to be very quick. Comfort with command-line interface and general systems knowledge appear to be crucial for configuration and installation of any add-ons.
 +
 
====Expertise required====
 
====Expertise required====
 
To take full advantage of the archival management features in the software, users should have a firm grasp on the metadata expectations for their field.
 
To take full advantage of the archival management features in the software, users should have a firm grasp on the metadata expectations for their field.
 +
 
====Standards compliance====
 
====Standards compliance====
The software supports numerous metadata standards, exporting as XML records in DDI, Dublin Core, FGDC, and Marc formats. The software is Z39.50 and OAI-PMH compliant, and has the ability to register information to Handle.net.
+
The software supports numerous metadata standards, including DDI, Dublin Core, Data Cite, Virtual Observatory (for astronomy), ISA-Tab (for biomedical). Each dataset is also given a data citation with a persistent global unique identifier that is in compliance with DataCite and the [https://www.force11.org/group/joint-declaration-data-citation-principles-final Joint Declaration of Data Citation Principles].
 +
 
====Influence and take-up====
 
====Influence and take-up====
Current installations include Dataverse Networks at Harvard IQSS, ICPSR, the University of the Thai Chamber of Commerce, and the Utrecht University Library. The software&#39;s sourceforge page reports nearly 7000 downloads.
+
Current installations include
 
+
*[https://dataverse.harvard.edu Harvard University]: A collaboration with the [http://www.iq.harvard.edu/ Institute for Quantitative Social Science (IQSS)], the Harvard Library, and Harvard University Information Technology (HUIT): the [http://dataverse.harvard.edu Harvard Dataverse ] is a repository for sharing, citing, analyzing, and preserving research data; open to all scientific data from all disciplines worldwide.
 +
*[http://www.irss.unc.edu/odum/contentSubpage.jsp?nodeid=11 Odum Institute]
 +
*[https://dataverse.nl Dutch Universities (DANS)]
 +
*[http://dvn.fudan.edu.cn/dvn/ Fudan University]
 +
*[https://dataverse.library.ualberta.ca/dvn/ University of Alberta Libraries]
 +
*[http://130.226.229.76/dvn/ Department of Cross Cultural and Regional Studies, University of Copenhagen] (ToRS)
 +
*[http://dvn.library.ubc.ca/dvn/ ABACUS - British Columbia Research Libraries' Data Services]
 +
*[http://dataverse.scholarsportal.info/dvn/ Scholars Portal - Ontario Council of University Libraries (OCUL)]
 +
*[https://heidata.uni-heidelberg.de/dvn/ HeiDATA - Heidelberg University]
 +
*[http://opendata.uit.no/dvn/ UiT Open Research Data - UiT The Arctic University of Norway]
 +
 
= User Experiences =
 
= User Experiences =
 
+
 
+
 
 
= Development Activity =
 
= Development Activity =
 
+
<!-- Provide *evidence* of development activity of the tool. For example, RSS feeds for code issues or commits. -->
{{Infobox_tool_details
+
Version 4.0 was released in April 2015. The current version (in December 2015) is 4.2.2.
|ohloh_id=Dataverse
+
The software is continually development, as revealed by an active [https://github.com/IQSS/dataverse/issues issues] tracking page. &nbsp;The project is Harvard-sponsored, and appears to have support for the foreseeable future.
}}
+
All development activity is visible on GitHub: http://github.com/IQSS/dataverse/commits
 +
 +
 +
=== Release Feed ===
 +
Below the last 3 release feeds:
 +
<rss max=3>https://github.com/IQSS/dataverse/releases.atom</rss>
 +
 
 +
 +
=== Activity Feed ===
 +
Below the last 5 commits:
 +
<rss max=5>https://github.com/IQSS/dataverse/commits/master.atom</rss>

Latest revision as of 16:20, 21 March 2022




The Dataverse is an open source web application to share, preserve, cite, explore and analyze research data.
Homepage:http://dataverse.org/
Source Code:https://github.com/IQSS/dataverse
License:Apache License, Version 2.0
Function:Active Data Storage,Managing Active Research Data,Preservation System,Storage
Content type:Research Data



Description[edit]

'Dataverse' is an open source (code is available on GitHub) web application to share, preserve, cite, explore and analyze research data. It facilitates making data available to others, and allows you to replicate others' work (see their About page). Researchers, data authors, publishers, data distributors, and affiliated institutions all receive appropriate credit via a data citation with a persistent identifier (DOI, or Handle).

A Dataverse repository hosts multiple dataverses (see diagram). Each dataverse contains dataset(s) or other dataverses, and each dataset contains descriptive metadata and data files (including documentation and code that accompany the data - see diagram).

Provider[edit]

Institute for Quantitative Social Science at Harvard University, along with many collaborators and contributors worldwide.

Licensing and cost[edit]

Apache 2 License – free.

Platform and interoperability[edit]

The Dataverse makes use of the following components: Java Server Faces; Enterprise Java Beans; PostgreSQL; Solr; and R and Zelig. Prerequisites for installation include Oracle JDK or OpenJDK, a “virgin” installation of Glassfish Version 4.1+, preferably as part of the NetBeans Web Development bundle, PostgreSQL Version 9.3+, and R. The software was designed to integrate reCAPTCHA, Google Analystics, ImageMagick, Shibboleth, and DOI registration via EZID if the installer so wishes. The Dataverse currently has multiple open APIs available, which allow for searching, depositing and accessing data.

Functional notes[edit]

Dataverses can be configured for multiple levels of access (at the dataverse, dataset and file level). Dataverse will accept any format, but will give full support to tabular data or fits file data (astronomy format). SPSS, STATA, R and csv are the preferred formats; data in these formats will be eligible for subsettable features, multiple formats for download, and a Universal Numerical Fingerprint (UNF). These files will be eligible for subsetting and pre-defined measurements. Dataverse has the capability to register DOIs from EZID, which allows the repository to assign persistent identifiers to data sets.

Documentation and user support[edit]

The website contains extensive software documentation, including user, installer, and developer guides.  A Users google group appears to be reasonably active, along with a specific support email address.

Usability[edit]

The Dataverse software provides a web-based interface for both administrators and users. The package includes an installer, which is run through the command line; basic install is designed to be very quick. Comfort with command-line interface and general systems knowledge appear to be crucial for configuration and installation of any add-ons.

Expertise required[edit]

To take full advantage of the archival management features in the software, users should have a firm grasp on the metadata expectations for their field.

Standards compliance[edit]

The software supports numerous metadata standards, including DDI, Dublin Core, Data Cite, Virtual Observatory (for astronomy), ISA-Tab (for biomedical). Each dataset is also given a data citation with a persistent global unique identifier that is in compliance with DataCite and the Joint Declaration of Data Citation Principles.

Influence and take-up[edit]

Current installations include

User Experiences[edit]

Development Activity[edit]

Version 4.0 was released in April 2015. The current version (in December 2015) is 4.2.2. The software is continually development, as revealed by an active issues tracking page.  The project is Harvard-sponsored, and appears to have support for the foreseeable future. All development activity is visible on GitHub: http://github.com/IQSS/dataverse/commits


Release Feed[edit]

Below the last 3 release feeds:

2022-08-03 15:25:46
[tag:github.com,2008:Repository/14051004/v5.11.1 v5.11.1]
by kcondon
2022-08-02 19:10:31
[tag:github.com,2008:Repository/14051004/v5.11 v5.11]
by kcondon
2022-04-06 19:49:14
[tag:github.com,2008:Repository/14051004/v5.10.1 v5.10.1]
by kcondon


Activity Feed[edit]

Below the last 5 commits:

2022-08-02 17:44:27
[tag:github.com,2008:Grit::Commit/02e3e921b01306618d45182b4a5757bb12856fe1 Merge pull request #8879 from IQSS/8878-5.11.1-hotfix]
by kcondon https://github.com/kcondon
2022-08-02 16:59:31
[tag:github.com,2008:Grit::Commit/0f185b59218091db726ad60a580cf960f88499b7 Update 5.11.1-release-notes.md]
by landreev https://github.com/landreev
2022-08-02 15:21:30
[tag:github.com,2008:Grit::Commit/08ae12bd34e38c0600e7e03e156e304ae7e0d0fc fix version in sphinx #8878]
by pdurbin https://github.com/pdurbin
2022-08-02 15:19:35
[tag:github.com,2008:Grit::Commit/d463e486c266e954b435690e02fd86ffafbca7b7 typo in war file name #8878]
by pdurbin https://github.com/pdurbin
2022-08-02 14:12:36
[tag:github.com,2008:Grit::Commit/f79a95420d7f857d4aa3373f7d4f16e8f29a4e19 tweak release notes #8878]
by pdurbin https://github.com/pdurbin