Difference between revisions of "Dataverse"

From COPTR
Jump to navigation Jump to search
(Trial import from script.)
 
(Updated information since it hasnt been updated since 2013.)
Line 1: Line 1:
 
{{Infobox_tool
 
{{Infobox_tool
|purpose=Dataverse Network software allows organisations to host a storage and access system for research materials.
+
|purpose=The Dataverse is an open source ([https://github.com/IQSS/dataverse.org code is available on GitHub]) web application to share, preserve, cite, explore and analyze research data.
 
|image=
 
|image=
|homepage=http://thedata.org/
+
|homepage=http://dataverse.org/
 
|license=
 
|license=
 
|platforms=
 
|platforms=
Line 16: Line 16:
  
 
= Description =
 
= Description =
[http://thedata.org/ Dataverse Network] software allows organisations to host a storage and access system for research materials. The software creates self-contained ‘dataverses,’ each of which is designed to support individual researchers or research groups.
+
The '''Dataverse'''<ref>{{cite web|last=Crosas|first=M.|title=The Dataverse Network®: An Open-Source Application for Sharing, Discovering and Preserving Data|url=http://www.dlib.org/dlib/january11/crosas/01crosas.html|work=D-Lib Magazine|accessdate=27 May 2015}}</ref> is an open source ([https://github.com/IQSS/dataverse.org code is available on GitHub]) web application to share, preserve, cite, explore and analyze research data. It facilitates making data available to others, and allows you to replicate others' work.<ref>{{cite web|title=About the Project|url=http://dataverse.org/about/|website=Dataverse}}</ref> Researchers, data authors, publishers, data distributors, and affiliated institutions all receive appropriate credit via a data citation with a persistent identifier (e.g., DOI, or Handle).
Data producers can take advantage of versioning systems, publish data sets with corresponding standardised citations, and choose levels of access for all materials. The Network host can also use a harvester to access remote information sources that have invited others to do so, using the OAI or Nesstar protocols.
+
 
The system was designed for Social Science data, but can be adapted for other scientific research materials.
+
A Dataverse repository hosts multiple dataverses ([http://guides.dataverse.org/en/4.0/_images/Dataverse-Diagram.png see diagram]). Each dataverse contains dataset(s) or other dataverses, and each dataset contains descriptive metadata and data files (including documentation and code that accompany the data - [http://guides.dataverse.org/en/4.0/_images/DatasetDiagram.png see diagram]).  
 
====Provider====
 
====Provider====
Institute for Quantitative Social Science at Harvard University
+
Institute for Quantitative Social Science at Harvard University, along with many collaborators and contributors worldwide.
 
====Licensing and cost====
 
====Licensing and cost====
 
[http://www.apache.org/licenses/LICENSE-2.0.html Apache 2 License] &ndash; free.
 
[http://www.apache.org/licenses/LICENSE-2.0.html Apache 2 License] &ndash; free.

Revision as of 20:24, 20 December 2015

The Dataverse is an open source (code is available on GitHub) web application to share, preserve, cite, explore and analyze research data.
Homepage:http://dataverse.org/


Description

The Dataverse[1] is an open source (code is available on GitHub) web application to share, preserve, cite, explore and analyze research data. It facilitates making data available to others, and allows you to replicate others' work.[2] Researchers, data authors, publishers, data distributors, and affiliated institutions all receive appropriate credit via a data citation with a persistent identifier (e.g., DOI, or Handle).

A Dataverse repository hosts multiple dataverses (see diagram). Each dataverse contains dataset(s) or other dataverses, and each dataset contains descriptive metadata and data files (including documentation and code that accompany the data - see diagram).

Provider

Institute for Quantitative Social Science at Harvard University, along with many collaborators and contributors worldwide.

Licensing and cost

Apache 2 License – free.

Development activity

Version 3.0 was released in May 2012. The current version (in August 2013) is 3.5.1. The software is continually development, as revealed by an active issues tracking page.  The project is Harvard-sponsored, and appears to have support for the foreseeable future.

Platform and interoperability

The Dataverse Network makes use of the following components: Java Server Faces (JSF2); Enterprise Java Beans (EJB3); PostgreSQL; Lucene; and R and Zelig. Prerequisites for installation include Sun/Oracle Java JDK 1.6+, a “virgin” installation of Glassfish v2.1+, preferably as part of the NetBeans Web Development bundle, PostgreSQL v8.3+, and R. The software was designed to integrate reCAPTCHA, Google Analystics, ImageMagick, Handle, and LOCKSS if the installer so wishes.

Functional notes

DataVerses can be configured for two levels of access. An Open DataVerse allows all registered users to edit their own studies; a Wiki DataVerse allows users to edit all studies, although only a curator or administrator can release the changes. A DataVerse will accept any format, but will only give full support to tabular data. SPSS and STATA are the preferred formats; data in these formats will be eligible for subsettable features, multiple formats for download, and a Universal Numerical Fingerprint (UNF).  GraphML is recommended for network data, and will be eligible for subsetting and pre-defined measurements. A DataVerse Network has the capability to be registered as a handle.net server, which allows the Network to assign persistent identifiers to data sets.

Documentation and user support

The website contains extensive software documentation, including user, installer, and developer guides.  A new Users google group appears to be reasonably active.  Aside from a webform, contact information is not prominently advertised.

Usability

The DataVerse software provides a web-based interface for both administrators and users. The package includes an installer, which is run through the command line; basic install is designed to be very quick. Comfort with command-line interface and general systems knowledge appear to be crucial for configuration and installation of any add-ons.

Expertise required

To take full advantage of the archival management features in the software, users should have a firm grasp on the metadata expectations for their field.

Standards compliance

The software supports numerous metadata standards, exporting as XML records in DDI, Dublin Core, FGDC, and Marc formats. The software is Z39.50 and OAI-PMH compliant, and has the ability to register information to Handle.net.

Influence and take-up

Current installations include Dataverse Networks at Harvard IQSS, ICPSR, the University of the Thai Chamber of Commerce, and the Utrecht University Library. The software's sourceforge page reports nearly 7000 downloads.

User Experiences

Development Activity

Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt6623561f2973c7_47858617


  1. {{#invoke:citation/CS1|citation |CitationClass=web }}
  2. {{#invoke:citation/CS1|citation |CitationClass=web }}