1 of 36

Slide Notes

DownloadGo Live

DLCM@OAI9

Published on Nov 18, 2015

Digital curation and preservation of large and complex scientific objects: Data Life-Cycle Management - The Swiss Way. We target in the context of a national program (CUS-P2) the setting up of the the required services that will allow the efficient management of active research data, and ensure the publication, long-term reference and preservation of subsets of data selected by researchers. Through this project we intend to implement concrete high-impact use cases of exemplary research data life-cycle management solutions, along with guidelines and training, so that researchers and their supporting scientific IT and library teams can apply the results themselves in their daily data management activities.

PRESENTATION OUTLINE

DLCM - The Swiss Way

OAI9 - P.-Y. Burgi - Université de Genève
Photo by Darkroom Daze

Agenda

  • Swiss Context (SUC P-2)
  • The DLCM Project
  • Complex Objects?
  • DLCM Tools
  • Challenging Issues
Photo by bengrey

Program SUC 2013-2016 P-2
"Scientific Information: Access, Processing and Safeguarding"

Photo by dalbera

SUC-P2 Facts

  • 36 M CHF
  • 2013-2016 => extended
  • 4 key areas: Publications, eScience, Basis, Services

Currently 16 Active Projects

Photo by ohefin

eScience: Scope

  • Developing concepts leading to national services
  • Further developing established local services
  • Supporting pilot projects
  • Supporting training
Photo by Dru!

The DLCM Project

Photo by dalbera

8 Partner Institutions

Due to start 01.09.15

(final decision 22.06.15)

Pre-study phase

August 2014 to February 2015

The 4 key DLCM questions...

Interviews

"The" Cycle...

Untitled Slide

Links With Other CUS-P2 Projects

  • eScience Coordination Team (eSCT)
  • DICE+
  • Open Research Data (ORD@CH)
  • DACAR@HEG
  • Data and Service Center for the Humanities (DaSCH)
  • Swiss edu-ID and SCALE
Photo by elcovs

Links With International Projects

  • Alliance Permanent Access
  • OpenAIRE
  • FAIRDOM: Findable, Accessible, Interoperable and Reusable (in biology domain)
  • DARIAH

Complex Objects?

Photo by Gwendal_

Simple Object

  • Discrete digital files: textual, picturial, audio, etc.

Untitled Slide

Complex Data

  • Multiformat (DB, texts, images, sounds, videos…)
  • Multistructure (relational databases, XML documents repository…)
  • Multisource (distributed DB, Web…)
  • Multimodal
  • Multiversion (temporal DB…)
  • Volumetry is contributing to complexity
Photo by Gwendal_

DLCM Data, e.g.

  • RDF databases (DH, genomic,...)
  • Genome-based healthcare data of +30K individuals
  • +40K scanned books (Bodmerlab)
  • Imaging data in life-sciences
Photo by dalbera

Bodmerlab (DH)

  • World literature (160'000 works in eighty languages)
  • High resolution images (6562 × 4452) - one per page
  • +40K scanned books (out of 160K)
  • Several hundreds of TBytes

Life Sciences

  • Quantitative high-speed imaging of entire developing embryos with simultaneous multiview microscopy
  • 175 million voxels per second for up to several days
  • Several dozens of terabytes per specimen
Photo by Grey cells

The Tools

to manage research data
Photo by magnuscanis

1. ELN / LIMS

  • openBIS (@ ETH-Z)
  • SLims (Simple LIMS @ EPF-L)
Photo by deanmeyersnet

openBIS

openBIS Structure

openBIS Attachments

VRE for DH

  • SALSAH (@ UNIBAS): System for Annotations & Linkage of Sources in Arts & Humanities
Photo by saguayo

Salsah UI

Salsah RDF Structure

2. Data Preservation & Sharing

  • Eprints
  • Rosetta
  • Fedora Commons
  • Docuteam Packer
  • Invenio
Photo by inju

General Workflows

"Small-Data" Workflow

Challenging Issues

  • Genericity vs. specificity of the tools
  • UI: How to intuitively handle complex datasets?
  • Business plans
Photo by programwitch

DLCM Contacts

Photo by JASElabs