49 SPEC Kit 354: Data Curation
Archivematica will provide these features and is part of planned future development.
HUBzero has the capability to provide emulation and to wrap and make software applications available
to be executed over the web, much like a terminal server. Ideally, we would like to leverage this
capability to make executable tools available with data and to enable online workflow execution and
reproducibility, but to date we have only published a linux desktop as a proof-of-concept.
Some items are either taken care of at the consortial level or are subject to consortial prioritizing.
Some of these activities are dependent on infrastructures provided by departments outside the
Libraries but within the university.
Succession planning documentation is pending review.
We have the capacity for versioning but it isn’t implemented as an automatic function at this time.
34. Please indicate how challenging you expect the following aspects of data curation to be in the next
3 to 5 years on a scale of 1 to 5 where 1=Not challenging and 5=Very challenging. N=50
Aspects of Data Curation 1 2 3 4 5 Rating Average
Expertise in curating certain domain
1 2 11 17 19 4.02
Scaling curation services with
1 5 11 14 19 3.90
Training and retooling library staﬀ to
support data curation services
2 4 14 14 16 3.76
Outreach/Marketing of services 1 9 10 19 11 3.60
Recruiting and retaining data curation
3 9 10 13 15 3.56
Keeping up with technology changes 2 6 15 16 11 3.56
requirements for data sharing
1 8 16 13 12 3.54
# of respondents 6 24 40 41 37
Note: A higher average rating indicates a more challenging aspect.
35. Please enter any additional comments you have about data curation challenges. N=16
All dependent on institutional priorities.
Being able to hire IT to ensure infrastructure is stable and can be developed over time.
Demand still relatively low.
Developing successful use cases will aid in funding, infrastructure, and resources support. ROI
Each new dataset seems to be unique among all previously accepted data.
In many of these cases, these aspects of data curation have already begun, but I imagine that this will
be an ongoing process.
Perception of services will be a big issue; as data curation becomes “popular.” It will still get conflated
with storage or at least ease of storage, so demand could rise steeply.