Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Long-term data archiving

Journal Article · · Analytical and Bioanalytical Chemistry
OSTI ID:971300

Long term data archiving has much value for chemists, not only to retain access to research and product development records, but also to enable new developments and new discoveries. There are some recent regulatory requirements (e.g., FDA 21 CFR Part 11), but good science and good business both benefit regardless. A particular example of the benefits of and need for long term data archiving is the management of data from spectroscopic laboratory instruments. The sheer amount of spectroscopic data is increasing at a scary rate, and the pressures to archive come from the expense to create the data (or recreate it if it is lost) as well as its high information content. The goal of long-term data archiving is to save and organize instrument data files as well as any needed meta data (such as sample ID, LIMS information, operator, date, time, instrument conditions, sample type, excitation details, environmental parameters, etc.). This editorial explores the issues involved in long-term data archiving using the example of Raman spectral databases. There are at present several such databases, including common data format libraries and proprietary libraries. However, such databases and libraries should ultimately satisfy stringent criteria for long term data archiving, including readability for long times into the future, robustness to changes in computer hardware and operating systems, and use of public domain data formats. The latter criterion implies the data format should be platform independent and the tools to create the data format should be easily and publicly obtainable or developable. Several examples of attempts at spectral libraries exist, such as the ASTM ANDI format, and the JCAMP-DX format. On the other hand, proprietary library spectra can be exchanged and manipulated using proprietary tools. As the above examples have deficiencies according to the three long term data archiving criteria, Extensible Markup Language (XML; a product of the World Wide Web Consortium, an independent standards body) as a new data interchange tool is being investigated and implemented. In order to facilitate data archiving, Raman data needs calibration as well as some other kinds of data treatment. Figure 1 illustrates schematically the present situation for Raman data calibration in the world-wide Raman spectroscopy community, and presents some of the terminology used.

Research Organization:
Los Alamos National Laboratory (LANL)
Sponsoring Organization:
DOE
DOE Contract Number:
AC52-06NA25396
OSTI ID:
971300
Report Number(s):
LA-UR-09-06149; LA-UR-09-6149
Journal Information:
Analytical and Bioanalytical Chemistry, Journal Name: Analytical and Bioanalytical Chemistry; ISSN 1618-2642
Country of Publication:
United States
Language:
English

Similar Records

Lightweight performance data collectors 2.0 with Eiger support.
Technical Report · Wed May 01 00:00:00 EDT 2013 · OSTI ID:1095927

The SDSS data archive server
Journal Article · Mon Oct 01 00:00:00 EDT 2007 · OSTI ID:924532

A Conceptual World Information Library (WIL) and Land Use Information System (LUIS) - 16181
Conference · Fri Jul 01 00:00:00 EDT 2016 · OSTI ID:22838060