0% found this document useful (0 votes)
36 views60 pages

Introduction Institutional Repository

The document outlines a workshop on Institutional Data Repositories held on March 4, 2008, at the Metropolitan Library System in Chicago. It covers the purpose, organization, and management of institutional repositories, emphasizing the importance of data curation and sharing for research visibility and long-term access. The workshop includes discussions, lab activities, and reflections on best practices for data management in academic settings.

Uploaded by

Valentish
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
36 views60 pages

Introduction Institutional Repository

The document outlines a workshop on Institutional Data Repositories held on March 4, 2008, at the Metropolitan Library System in Chicago. It covers the purpose, organization, and management of institutional repositories, emphasizing the importance of data curation and sharing for research visibility and long-term access. The workshop includes discussions, lab activities, and reflections on best practices for data management in academic settings.

Uploaded by

Valentish
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/27236384

Introduction to Institutional Data Repositories Workshop

Article · January 2008


Source: OAI

CITATION READS
1 80

2 authors, including:

Melissa H. Cragin
National Science Foundation
40 PUBLICATIONS 1,092 CITATIONS

SEE PROFILE

All content following this page was uploaded by Melissa H. Cragin on 22 May 2014.

The user has requested enhancement of the downloaded file.


Purdue University
Purdue e-Pubs
Libraries Research Publications Purdue Libraries

3-4-2008

Introduction to Institutional Data Repositories


Workshop
Michael Witt
Purdue University, [email protected]

Melissa Cragin
University of Illinois at Urbana-Champaign, [email protected]

Witt, Michael and Cragin, Melissa, "Introduction to Institutional Data Repositories Workshop" (2008). Libraries Research
Publications. Paper 83.
http://docs.lib.purdue.edu/lib_research/83

This document has been made available through Purdue e-Pubs, a service of the Purdue University Libraries. Please contact [email protected] for
additional information.
Introduction to Institutional
Data Repositories

Metropolitan Library System


Chicago, IL

March 4, 2008 9:30 am – 3:30 pm


Michael Witt
Purdue University

Melissa Cragin
University of Illinois at Urbana-Champaign

Today’s Agenda

9:30 - 11:00 Introductions: ourselves, data, repositories


Rationale

11:00 - 12:00 Lab activity #1

12:00 -1:00 Lunch

1:00 - 1:30 Data curation


Data collections

1:30 - 2:15 Issues to discuss

2:15 - 3:00 Lab activity #2

3:00 - 3:30 Roles, resources, and approaches


Closing discussion
A Introduction
An I t d ti to t Institutional
I tit ti l
Data Repositories

Metropolitan Library System


March 4,, 2008 9:30 am - 3:30 p
pm
Chicago, IL

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Introductions
• Melissa Cragin – [email protected]
Vi iti Project
Visiting P j t Coordinator
C di t
Data Curation Education Program (DCEP)
Graduate School of Libraryy and Information Science
University of Illinois at Urbana-Champaign

• Michael Witt – [email protected]


mwitt@purdue edu
Interdisciplinary Research Librarian
assistant professor of library science
Di t ib t d D
Distributed Data
t C
Curation
ti C Center
t (D2C2)
Purdue University Libraries

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Overview of the day
9:30 - 11:00 Introductions: ourselves, data, repositories
R ti
Rationale
l
11:00 - 12:00 Lab activity #1
12:00 -1:00
1:00 Lunch
1:00 - 1:30 Data curation
Data collections
1:30 - 2:15 Issues to discuss
2:15 - 3:00 y #2
Lab activity
3:00 - 3:30 Roles, resources, and approaches
Closing discussion

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
What is data?
DATA

INFORMATION

KNOWLEDGE
Zins, C. (2007). Conceptual approaches for defining data, information, and knowledge. Journal of the American
Society for Information Science and Technology. 58(4), 479-493.

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
What is an institutional repository?
“A university-based institutional repository is a set
of services that a university offers to the
members of its community for the management
and dissemination of digital materials created by
the institution and its community members
members. It is
most essentially an organizational commitment
to the stewardship of these digital materials,
i l di llong-term
including t preservation
ti where
h
appropriate, as well as organization and access
or distribution."
Lynch, C. (2003). Institutional repositories: essential infrastructure for scholarship in the digital age. ARL
Bimonthly Report No. 226.

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Purposes of Repositories
• Enable digital asset management
• Offer preservation services
• Provide institutional visibility through access to collective intellectual work
• Support learning, teaching, and research
• Facilitate discovery of content
• Enable re-use and re-purposing of content
• Support archival business requirements
• Offer alternative channels in support of scholarly communication
• Organize information to allow effective content management and access
• Provide access to outcomes of publicly funded research initiatives
• Strengthen
St th partnership
t hi between
b t content
t t creators/providers
t / id andd content
t t
managers
Rieger, Oya Y. (2007). Select for Success: Key Principles in Assessing Repository Models. D-Lib Magazine, 13(7/8).

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Organization of Repositories
Who runs them?
– domain ((e.g.
g chemistry y – NIST Chemistryy WebBook))
– discipline (e.g. crystallography – National Crystallography Service)
– institutional (most often university based)

Content
– metadata
– e-prints
– cultural history materials
– natural history records
– audio-visual (multimedia)
– scientific
i tifi d data
t
– digital humanities and social science scholarship
– social science data (quantitative)

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
What is IDEALS?
Institutional repository for the scholarship and research in
digital form of the faculty, students, and staff of the
University of Illinois at Urbana-Champaign.

• Completed pilot phase


• In “quiet” production as of Fall 2007
• A joint initiative between the University Library and CITES
with support from the Office of the Provost

http://ideals.uiuc.edu/

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
The Issues
( why
(or h the
h UUniversity
i i off Illi
Illinois
i iis iinvesting
i iin
IDEALS)
• What research can you access?
• Who can access your research?
• Will your research
h make
k th
the bi
biggestt possible
ibl iimpact?
t?

• What can yyou do with yyour p


published research?

• Will your research be available 10, 50, 100+ years into


the future?
• What about all those technical reports, data sets, and
other grey literature?

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
IDEALS goals….
goals
• Help increase access to published and unpublished
research
h

• Help
p increase the impact
p to p
published and
unpublished research

• Provide a persistent, permanent URL for your


research

• Preserve research for long term access and use

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Other services we offer…
offer

• Consultation on copyright issues


• Access restricted items or collections in
IDEALS
• Statistics on number of downloads (regular
weekly or monthly reports are coming
soon…)

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Introduction to Institutional Michael Witt Melissa Cragin
Data Repositories Purdue University University of Illinois
Introduction to Institutional Michael Witt Melissa Cragin
Data Repositories Purdue University University of Illinois
Introduction to Institutional Michael Witt Melissa Cragin
Data Repositories Purdue University University of Illinois
Michael Witt Melissa Cragin
Purdue University University of Illinois
(live demo)
http://docs.lib.purdue.edu

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Why share/archive data?

To get all of your organization's data in one


place.
place
• described properly
• preserved properly
• accessible using g the same tools and
services

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Why share/archive data?

It’s good science.


(
(your results
lt can b
be reproduced)
d d)

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Why share/archive data?

Some funding agencies and organizations


are beginning to require data archiving
and sharing.
(
(e.g., NIH
NIH, open access stipulations)
ti l ti )

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Why share/archive data?

Data can be re-used.


• save $$ by
b nott recreating
ti ththe same d
data
t
• can be used to advance similar research
• can be used to advance research in
p
another discipline

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Why share/archive data?

Data can be re-purposed – the Long Tail.


(
(e.g., as a learning
l i object)
bj t)

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Why share/archive data?

Datasets are assets with value.


• legacy
l
• credibility
• annual reporting

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Why share/archive data?

Datasets can be the building blocks of a


virtual organization or help build an online
community of scholarship and advance
standards.
standards

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Why share/archive data?

The IInformation
Th f ti
Bottleneck

Hacker, T. (2007).

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Why librarians?
• technical services - description
p & classification
• access services – how people use information
• archival science & digital libraries
• 200-year view
• institutional commitment
• trust & neutrality
• awareness of research taking place
• interdisciplinary collaboration (strengthen grant and
project proposals)

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Roles
• Librarians
• E
Executives
ti / top-level
t l l administrators
d i i t t
• IT department
• Legal counsel
• Research administration
• And most importantly,
p y, yyour researchers
• Communities
• Sustainability
• Who does what will be different in every organization.

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Lab Activity #1 (45 minutes)
Scenario:
The Water Quality Field Station at Purdue’s
Purdue s ACRE research farm comprises plots of land
with different soil compositions and treatments. Pipes running under the ground
collect water and channel it into sensors that measure the amount of flow by
counting the number of times that a calibrated bucket fills up and “tips” in an hour.
These files are the output of a data logger that is attached to one of these sensors.

Instructions:
1. Create an account in either Many Eyes [1] (use dataset-many.txt) or Swivel [2] (use
dataset-swivel.csv).
2. Describe and then upload
p the data.
3. Once it has been uploaded, analyze the results.
4. Compare your dataset and results with others in the repository.

We’ll share our results,, observations,, and questions


q before breaking
g for lunch at noon.

[1] Many Eyes, http://www.many-eyes.com/


[2] Swivel, http://swivel.com

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Reflection
Take 10 minutes to write down notes in your
“data repository prospectus” and relate
the lab activity and discussion to how you
would plan and provision data repository
services for your institution
institution.

We will share these at the end of the


workshop for our closing discussion.

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Lunch

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Intro to Data Curation
• Curation: managing and promoting the use of data from its point of
creation to ensure it is fit for contemporary purpose,
creation, purpose and available for
discovery and re-use. … Higher levels of curation will also involve
maintaining links with annotation and with other published materials.
Curation activities include:

– Archiving: A curation activity which ensures that data is properly selected,


stored, can be accessed and that its logical and physical integrity is maintained
over time, including security and authenticity.

– Preservation: An activity within archiving in which specific items of data are


maintained over time so that they can still be accessed and understood through
changes in technology.

http://www.jisc.ac.uk/media/documents/programmes/preservation/e-sciencereportfinal.pdf

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Intro to Digital
g Data Collections
Three levels of collections
R
Research
h
“small science,” local, limited to no
funding, not often standard
Resource
driven by a research community,
some funding, emerging
standards
Reference
Large scale, generally homogeneous
and standardized data, for use by
multiple communities for various
purposes

http://www.nsf.gov/pubs/2005/nsb0540/

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Small Science or Research Collections
data and web site often managed “in
house” – generally no plans for
long-term access or preservation

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Resource and Reference Collections

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Issues to discuss…
discuss
• Access
• Persistence
• Provenance
• Ingest and scale
• Intellectual property and permissions
• Policies
• Selection and appraisal
• Metadata
• Preservation

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Access

• user interface
• interoperability
y

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Persistence
• Provides the ability to "cite"
cite data: a unique
identifier and persistent, resolvable link
• Example: Handle system (digital object
identifiers)

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Provenance
“Data p
provenance is the description p of the origins
g
of a piece of data and the process by which it
arrived in a database.” (Buneman et. al., 2001)
– in the context of databases

Data provenance is “information that helps


determine the derivation history of a data
product starting from its original sources
product, sources.” (Simmhan
et. al., 2005)

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Provenance is used for
for…
• Reliability and quality
• Justification and audit (e.g. audit trails)
• Reusability, reproducibility and repeatability
– “replication recipes”
• Change and evolution
• Ownership, security, credit and copyright
– attribution
• Migration and storage
• Aggregation
• Versioning
C. Goble, "Position Statement: Musings on Provenance, Workflow and (Semantic Web) Annotations for Bioinformatics," in
Workshop on Data Derivation and Provenance, Chicago, 2002.
Y. Simmhan, B. Plale, and D. Gannon (2005). A survey of data provenance in e-science. SIGMOD Record, 34, 31-36.

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Ingest & Scale

• automate
t t processes as much h as possible
ibl
• batch and scheduled ingest
• scale is a problem with large datasets
• don’tt mistake size for value
don

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Intellectual Property &
A
Access Control
C l
• Who owns the data?
• Who is allowed to access the data?
– open access / IP / local user group / directory
((LDAP,, Active Directory,y, NDS)) / Shibboleth /
dark archive, embargo
– proprietary, clinical, privacy (FERPA)

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Policies
• selection policy
• submission policy
• preservation
ti policy
li
• usage policy

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Selection and appraisal
• Scientific criteria
– relevance; size; scope
• Technical
– internal format; size; transfer (media)
• Administrative
– documentation; p
privacy;
y ownership
p
• Financial
– costs (data preparation and management over time)

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Metadata
• Descriptive
• Administrative
– rights
– provenance
• Technical
– file formats
– components and relationships
• Preservation

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Preservation
• When is an IR part of a preservation plan?

• Need
N d iin place…
l metadata,
t d t policies,
li i
• commitment from institution
– resources

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Lab Activity #2 (20 minutes)
Select a data collection from the provided list of repositories that is in your area
of interest. Explore the repository or datasets and try to answer these
questions.
ti Share
Sh and
d di
discuss your fifindings
di with
ith th
the group.

• Who administers the repository?


p y
• How is the data organized?
• What metadata format is used for description? What is being described?
• How do they acquire data?
• Who owns the data?
• Is there a data selection policy?
• Is there a data use policy?
• Is it clear how to use the data?
• What services are offered by the repository (e.g., search, browse,
preservation)?

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Reflection
Take 10 minutes to revisit your “data
data repository
prospectus” and relate the activities and
discussion to your local environment. Write
down any new ideas or observations and try to
fill in any remaining blanks.

We will share these at the end of the workshop for


our closing
l i di discussion.
i

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
How is it that data collections are
coming to be part of the UIUC IR?
• Currently working with:
– 25 years of vole demographics
– Polar temperature data
– 100 years off corn d
data
t
– XRay Crystallography data

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Crystallography:
Extending Collaborations and Services

Raw Data

OAI Request UIUC

OAI Data
Provider
CIFs
OAI Response
earch

End User

OAI Harvestter
OAI Request
SPECTRa Se

Software

OAI Data
Provider
Aggregated Cambridge Raw Data
Metadata from CIFs OAI Response U. CIFs

Raw Data

OAI Request

OAI Data
Provider
Other
OAI Response CIFs
Other
CIFs
Other
CIFs
Other
CIFs
Other
CIFs

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Introduction to Institutional Michael Witt Melissa Cragin
Data Repositories Purdue University University of Illinois
Supporting interdisciplinary research
with
ith d
data
t curation
ti att PPurdue…
d
• new dean
d off lib
libraries,
i new strategic
t t i planl
• creation of the Distributed Data Curation Center
(D2C2) http://d2c2.lib.purdue.edu
(D2C2), http://d2c2 lib purdue edu
• Interdisciplinary Research Librarian: help
integrate
g library y science and librarians into
interdisciplinary research; Discovery Park
• experimentation > projects > towards production
i f t t
infrastructure andd services
i
• Purdue e-Data

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Project plans

&

closing discussion

Introduction to Institutional Michael Witt Melissa Cragin


Data Repositories Purdue University University of Illinois
Data Repository Prospectus

What is the value of a data repository to your specific organization and how does it fit into your
mission?

Who will use your repository and what will their What different kinds of data will you collect
needs be? in your repository? Will you include other
kinds of research materials?

What services or features will be offered by


your repository?
Introduction to Institutional Data Repositories – March 4, 2008 – Michael Witt & Melissa Cragin

What is your plan for acquiring content and building data collections? (How will data be submitted or
ingested into the repository? What might you include in a selection policy?)

What formats might you use to describe data How might you market the data repository?
collections and datasets?

How will you address intellectual property and access What are some ways you could measure and
control concerns? evaluate the success of the data repository?
Introduction to Institutional Data Repositories – March 4, 2008 – Michael Witt & Melissa Cragin

What human resources will be required to launch and then maintain the repository?
(For example, what resources will be needed to manage the ingest process? Who will prepare and upload the data? What
kinds of metadata and description will you require?)

What different roles and responsibilities will be What information resources are available for
needed and how do these map to your organization you to learn more about repository software to
and personnel? facilitate the selection process for your
organization? (Who else in your organization might
have valuable information that would be helpful to you
in making technical decisions?)

Just for fun… What are some possible names for


your data repository?
List of Data Repositories

Organization Web address Notes


NOAA’s National Environmental
Satellite, Data, and Information Entry-way for several data centers and
1. Service http://www.nesdis.noaa.gov/datainfo.html collections.
https://www.cia.gov/library/publications/the-world- geographic, economic and population
2. CIA World Factbook factbook/index.html data
3. US Census Bureau http://factfinder.census.gov/home/saff/main.html?_lang=en census data (various)
4. USGS Earthquake hazards program http://earthquake.usgs.gov/eqcenter/catalogs/
5. National Snow and Ice Data Center http://nsidc.org/
North Temperate Lakes Long Term
6. Ecological Research http://lterquery.limnology.wisc.edu/index_new.jsp
7. British Atmospheric Data Center http://badc.nerc.ac.uk/home/index.html
ICPSR - The Interuniversity
Consortium for Political and Social
8. Research http://www.icpsr.umich.edu/ICPSR/access/index.html
9. Roper Center Public Opinion Archives http://www.ropercenter.uconn.edu/data_access.html
10. UK Data Archive http://www.data-archive.ac.uk/findingData/data.asp
GBIF - Global Biodiversity Information
11. Facility http://data.gbif.org/welcome.htm Also see: http://ge.gbif.net/
“open, shared database of the world’s
12. Harvard’s Freebase http://freebase.com/view/allDomains knowledge”
13. Data360 http://www.data360.org/index.aspx "clarify the current condition"
14. StatCrunch http://www.statcrunch.com/ “data analysis on the web”
“…two primary sets of long-term vole
Long Term Vole Demographic Data demographic data files collected and
15. and Selected Publications http://www.ideals.uiuc.edu/handle/2142/161 maintained by Professor Lowell L. Getz.”
Evolutionary Infrastructure: Boston's Small data collection for a humanities
16. Back Bay Fens http://www3.iath.virginia.edu/backbay/ project
Allen Institute For Brain Science - Maps of brain “geography” at many
17. Allen Brain Atlas http://www.brain-map.org/welcome.do scales
18. fMRI Dcata Center http://www.fmridc.org/f/fmridc cognitive neuroscience collection

Introduction to Institutional Data Repositories – March 4, 2008 – Michael Witt & Melissa Cragin
Introduction to Institutional Data Repositories – March 4, 2008 – Michael Witt & Melissa Cragin

SELECTED BIBLIOGRAPHY

1. NSF National Science Board. Long‐lived digital data collections enabling research and education
in the 21st Century. September 2005. http://www.nsf.gov/pubs/2005/nsb0540/.

2. ARL/NSF Workshop on Long‐Term Stewardship of Digital Data Collections. To stand the test of
time: long‐term stewardship of digital data sets in science and engineering. September 2006.
http://www.arl.org/pp/access/nsfworkshop.shtml.

3. Anna Gold. Cyberinfrastructure, data, and libraries, part 1: A cyberinfrastructure primer for
librarians. D‐Lib Magazine, 13(9/10), September/October 2007.
http://www.dlib.org/dlib/september07/gold/09gold‐pt1.html.

4. Anna Gold. Cyberinfrastructure, data, and libraries, part 2: Libraries and the data challenge:
Roles and actions for libraries. D‐Lib Magazine, 13(9/10), September/October 2007.
http://www.dlib.org/dlib/september07/gold/09gold‐pt2.html.

5. Microsoft Research. Towards 2020 science. March 2006.


http://research.microsoft.com/towards2020science/downloads/T2020S_ReportA4.pdf

6. NSF Cyberinfrastructure Council. Cyberinfrastructure vision for 21st century discovery. March
2007. http://www.nsf.gov/od/oci/CI_Vision_March07.pdf.

7. Philip Lord, Alison Macdonald, Liz Lyon, David Giaretta. From data deluge to data curation.
Proceedings of the UK e‐Science All Hands Meeting, September 2004.
http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/150.pdf.

8. Oya Y. Rieger. Select for success: Key principles in assessing repository models. D‐Lib
Magazine, 13(7/8), July/August 2007. http://www.dlib.org/dlib/july07/rieger/07rieger.html.

9. NSF Blue‐Ribbon Advisory Panel on Cyberinfrastructure. Revolutionizing science and


engineering through cyberinfrastructure. January 2003.
http://www.nsf.gov/od/oci/reports/atkins.pdf.

10. Inter‐University Consortium for Political and Social Research (ICPSR). Guide to social science
data preparation and archiving. 2005. http://www.icpsr.umich.edu/access/dataprep.pdf.

11. Ross Harvey. Appraisal and selection. Data curation manual, Digital Curation Centre. 2007.
http://www.dcc.ac.uk/resource/curation‐manual/chapters/appraisal‐and‐selection/appraisal‐
and‐selection.pdf.
12. Neil Beagrie. Digital curation for science, digital libraries, and Individuals. The International
Journal of Digital Curation, 1(1), 3‐16. 2006. http://www.ijdc.net/ijdc/article/view/6/5.

13. Chris Greer. A vision for the digital data universe. January 2007.
http://www.nanohub.org/resources/2291/.

14. NISO Framework Advisory Group. A framework of guidance for building good digital
collections. Second edition, 2004. http://www.niso.org/framework/Framework2.pdf.

15. Richard L. Moore, Jim D’Aoust, Robert H. McDonald, David Minor. Disk and tape storage cost
models. Proceedings of the 2007 IS&T Archiving Conference.
http://www.sdsc.edu/~mcdonald/content/papers/dt_cost.pdf.

View publication stats

You might also like