Minor, David; Phillips, Mark; Schultz, Matt
This paper will examine ongoing work between two
major preservation systems, the Chronopolis Digital
Preservation Program,  and the MetaArchive
Cooperative.  In the past year, these two systems
have begun work on bridging their technical
underpinnings to create a more robust, reliable, longlived
preservation community for their users. The main
emphasis of this work is moving data between a
LOCKSS-based system (MetaArchive) and an iRODSbased
one (Chronopolis). This work also involves
several other emerging preservation micro-service tools
and practices, and the expertise of the University of
North Texas (UNT) Digital Library  in deploying
them. The final result of this work is intended to be of
three-fold benefit: 1) directly improving the services
This paper examines preservation planning as it is
implemented within the National Library’s preservation
repository (Rosetta) and compares it directly to the
PLATO tool created as part of the PLANETS project.
Preservation planning is both a business precondition
and the systematic framework defining any preservation
action. At the National Library of New Zealand Te Puna
Mātauranga o Aotearoa, preservation planning is
embedded within the Rosetta system.
For the Library, the challenge can be stated simply:
preserve New Zealand’s digital documentary heritage.
With no limitations or control over the format of the
content that is collected and preserved, The National
Library of New Zealand Te Puna Mātauranga o
Aotearoa (NLNZ) has ‘issues’ to resolve before...
MASSOL, Marion; ROUCHON, Olivier
This paper outlines the recent initiative run at CINES,
a national IT datacentre for French academic
researchers, to formalize the business processes of its
department dedicated to the long-term preservation of
digital objects, which is at present one of the very few
operational long-term preservation platforms in France
for the public sector and Higher Education and Research
One of the strategic goals of this organization is the
evaluation and assessment of service quality. The
processes formalization activity – coupled with an
external audit and an ITIL approach – highlighted the
department good practices, gaps and weaknesses.
The processes global map and most of the twenty
detailed process maps have been put together...
Korb, Joachim; Strodl, Stephan
Today, more and more information is being produced in a digital form. In addition to this so-called born-digital content, material that was produced to exist in an analogue form is now being digitised both for preservation and for easier access. This digital information comes in an ever greater variety of formats, many of which are relatively short-lived. Newer versions of the same software are often unable to render files produced with older versions of that software, let alone files produced with similar software from other vendors. Soft- and hardware environments change constantly and after only a few years can older...
Kirchhoff, Amy; Morrissey, Sheila; Fenton, Eileen; Orphan, Stephanie
The scholarly community’s dependence on electronic
resources is rapidly increasing and those electronic
resources are increasingly preserved in digital
repositories or other preservation services. Whether
locally hosted at libraries, collaboratively hosted
between institutions, or externally hosted by a third
party, one method for these digital repositories to take to
assure themselves and their communities of their
soundness is to be audited and certified by impartial
organizations. Such independent organizations with
staff experienced in executing audits and certifications
can represent the interests of the academic community.
Such staff will have the time and skills required to
perform a thorough review of the methodologies and
policies of each digital repository.
Over the course of 2009, the Center...
Zierau, Eld; Jensen, Claus
The focus of this paper is on which digital objects to preserve when preserving digital library materials derived from original paper materials. It will investigate preservation strategies for digital objects from digitised paper material that must both be preserved and simultaneously retain a short route to dissemination. The investigation is based on a study of digitisation done a decade ago and digitisation done today. In the last decade mass digitisation has become more commonly used since technological evolution has made it cheaper and quicker. The paper explores whether there are parts of digital material digitised a decade ago worth preserving,...
van Wijngaarden, Hilde; Rog, Judith; Marijnen, Peter
The National Library of the Netherlands (KB) will renew its digital archiving environment. The current system, the e-Depot with DIAS by IBM as its technical core, has been operational since 2003 and needs to be updated. More importantly, a new system is required because KB has published a new strategic plan with ambitious goals. They require development of an infrastructure that can process, store, preserve and retrieve millions of digital objects, now and for the long term. The digital collections will include e-journals, e-books, websites and digitized master images and will grow from 20 TB currently to 720 TB in...
von Suchodoletz, Dirk; van der Hoeven, Jeffrey; Rechert, Klaus; Schr¨oder, Jasper
After four years of research within the PLANETS project
and two years of KEEP the jigsaw puzzle of emulation becomes
a more complete picture. Emulation strategies are
now seen as a viable complement to migration. A conceptual
and theoretical groundwork has already been laid
out, e.g. proper definition and selection of suitable emulators.
However, integration into preservation frameworks
and additional software archiving remain open research
questions. This paper discusses several aspects of reliable
integration and proposes development steps for a more
complete emulation-based strategies in long-term preservation.
Van Garderen, Peter
Digital curation micro-services offer a light-weight
alternative to preservation systems that are developed
on digital repository and framework technology stacks.
These are often too complex for small and medium-sized
memory institutions to deploy and maintain. The
Archivematica project has implemented a micro-services
approach to develop an integrated suite of free and
open-source tools that allows users to process digital
objects from ingest to access while applying format
specific preservation policies. Inspired by a call to
action in a recent UNESCO Memory of the World
report, the goal of the Archivematica project is to reduce
the cost and technical complexity of deploying a
comprehensive, interoperable digital curation solution
that is compliant with standards and best...
Hitchcock, Steve; Rauber, Andreas; Kulovits, Hannes; Tarrant, David; Carr, Les
An accepted digital preservation workflow is emerging in
which file formats are identified and those believed to be
at risk are migrated to what are perceived to be less risky
formats. This raises important questions about what to
convert and when, if at all. In other words, how to connect
file identification and migration. This area has become
known as preservation planning, and seeks to take
account of a wide variety of factors that might impact
preservation decisions. Broadly there are two approaches
to preservation planning. One provided in some digital
preservation systems is to simplify and reduce both the
number of file formats stored and therefore limit the number
Schultz, Matt; Gore, Emily B.
Distributed digital preservation is a maturing and
appealing solution to the pressing problem of ensuring
the survivability of digital content. Like all other digital
preservation efforts, distributed digital preservation
solutions must communicate trust to their Designated
Communities as they continue to mature. The following
paper discusses the importance of establishing this trust,
retraces the development of TRAC as a reliable tool for
evaluating trustworthy repositories, and details the
process of the MetaArchive Cooperative’s application of
TRAC to its distributed digital preservation solution.
This process revealed that the current metrics for
gauging trust in digital preservation could be readily
applied to distributed solutions with great effect.
However, because these metrics often presume a more
Popitsch, Niko; Mosser, Robert; Philipp, Wolfgang
More and more information that is considered for digital
long-term preservation is generated by Web 2.0 applications
like wikis, blogs or social networking tools. However,
there is little support for the preservation of these data today.
Currently they are preserved like regular Web sites
without taking the flexible, lightweight and mostly graphbased
data models of the underlying Web 2.0 applications
into consideration. By this, valuable information about the
relations within these data and about links to other data is
lost. Furthermore, information about the internal structure
of the data, e.g., expressed by wiki markup languages is
not preserved entirely.
We argue that this currently neglected information is
of high value in a long-term...
Pcolar, David; Marciano, Richard; Hou, Chien-Yi; Chassanoff, Alexandra; Davis, Daniel W.; Zhu, Bing
Given the growing need for cross-repository integration
to enable a trusted, scalable, open and distributed
content infrastructure, this paper introduces the Policy-
Driven Repository Interoperability (PoDRI) project
investigating interoperability mechanisms between
repositories at the policy level. Simply moving digital
content from one repository to another may not capture
the essential management policies needed to ensure its
integrity and authenticity. Platform-independent, policyaware
object models, including policy expressions, and
a distributed architecture for policy-driven management
are fundamental building blocks of a sustainable access
and preservation infrastructure. This project integrates
iRODS and Fedora to demonstrate such an
infrastructure. Using iRODS and its rules engine,
combined with Fedora’s rich semantic object model for
digital objects, provides the basis for implementing...
This paper summarizes work done in a PhD study on
metadata driven imitation for preservation of visual
appearance of web forms and/or receipts used in
eGovernment services. The research done suggests that
metadata, and e.g. a background image, can be used to
describe the visual appearance of documents, and that
this also facilitates having the data separated from the
visual appearance. This separation provides the ability
to present the material to the users in different ways,
depending on their needs and requirements, while
retaining the ability to present the object in its original
look. The original look is seen as the most versatile way
of presenting the material, giving the most fruitful...
Mitcham, Jenny; Niven, Kieron; Richards, Julian
This paper looks at some of the domain specific preservation challenges faced by the Archaeology Data Service and how we work with these in order to maximise the re-use potential of the data that we archive. It looks in particular at one of the mandatory responsibilities of an Open Archival Information System (OAIS) and how we try to ensure that the data that we present to our designated community is ‘independently understandable’. The paper introduces the collaborative ‘Guides to Good Practice’ project which aims to provide data producers with the guidance that they need in order to create data that...
Zierau, Eld; Bøgvad Kejser, Ulla; Kulovits, Hannes
This article describes a methodology which supports evaluation of bit preservation strategies for different digital materials. This includes evaluation of alternative bit preservation solutions. The methodology presented uses the preservation planning tool Plato for evaluations, and a BR-ReMS prototype to calculate measures for how well bit preservation requirements are met. Planning storage of different types of data as part of preservation planning involves classification of the data with regard to requirements on confidentiality, bit safety, availability and costs. Selection of storage parameters is quite complex since e.g. more copies of data means better bit safety, but higher cost and higher...
This article explores preservation of digital material in a
library context with a focus on logical object modelling
that takes both preservation and dissemination into
account. The article describes normalisation of data
expressed via a logical object model. This logical object
model is designed to support the requirements for joint
preservation and dissemination. Additionally the article
includes a suggestion for a possible implementation that
respects the logical object model.
Formulation, of the requirements and possible
implementation for a logical object model, is based on
observation of current trends, as well as results from a
research project on preservation strategies for libraries.
The research project has been carried out at the Royal
Library of Denmark,...
Kelly, Brian; Guy, Marieke
Early adopters of blogs will have made use of
externally-hosted blog platforms, such as
Wordpress.com and Blogger.com, due, perhaps, to the
lack of a blogging infrastructure within the institution or
concerns regarding restrictive terms and conditions
covering use of such services. There will be cases in
which such blogs are now well-established and contain
useful information not only for current readership but
also as a resource which may be valuable for future
The need to preserve content which is held on such
third-party services (“the Cloud’) provides a set of new
challenges which are likely to be distinct from the
management of content hosted within the institution, for
which institutional policies should address...
The digital content lifecycle is generally understood as a set of activities: select, get and/or produce, prepare and/or assemble, describe, manage, and, as appropriate, make available. At the bit level, digital content is viewed as files on a file system. Many crucial activities of the digital content lifecycle are therefore undertaken primarily at the bit level, including transferring, moving, and inventorying files, and verifying that files have not changed over time. The identifiable entities at the bit-level – files and directories -- are widely and easily understood by Library of Congress digital collection data managers and curators. As part of...
Hole, Brian; Wheatley, Paul; Lin, Li; McCann, Patrick
Predicting the costs of long-term digital preservation is a
crucial yet complex task for even the largest repositories
and institutions. For smaller projects and individual
researchers faced with preservation requirements, the
problem is even more overwhelming, as they lack the
accumulated experience of the former. Yet being able to
estimate future preservation costs is vital to answering a
range of important questions for each. The LIFE (Life
Cycle Information for E-Literature) project, which has
just completed its third phase, helps institutions and
researchers address these concerns, reducing the
financial and preservation risks, and allowing decision
makers to assess a range of options in order to achieve
effective preservation while operating within financial