| The following are excerpts from the International Oceanographic Data and Information Excahnge (IDOE) http://www.iode.org's General Infomation pages about Data Management and Info Management. |
| Marine Data Management: we can do more, but can we do better? |
|---|
| The planning and implementation of research, and the efficient management of the resulting data often appear to be two widely separated worlds. Data managers consider the careful collection, management and dissemination of research data as essential for the effective use of research funds. Many researchers, on the other hand, consider data management as technical, boring and an (un)necessary evil; so data management is often insufficiently planned, or not planned for at all, and is assigned a low priority. This is unfortunate, as there is much of social relevance and applicability in the colourful world of oceanographic data management. |
| Marine data management: a working definition |
|---|
| First, we need to distinguish 'data' from 'information'. 'Data' are observable, raw 'values' that result from research or monitoring activities; these values can be numerical (as in temperature or salinity measurements) or nominal (as in species lists for a particular region). The term 'information' is commonly used to mean data that have already been processed and/or interpreted results. In that sense, so-called 'metadata', i.e. data about data (e.g. by whom, at what time, where and how the results were collected) can be considered a special kind of 'information'. |
| Tackling a growing problem |
|---|
| The social relevance of measurement and sampling at sea, and the need to disseminate the results as widely and in as user-friendly a manner as possible, cannot be overestimated. More services and products useful to industry, the general public and policy makers, could, and should, be extracted from databases. The oceans cover two-thirds of the Earth, and about half the world population live in coastal areas, so monitoring the health, resources and 'tantrums' of the global ocean is no luxury. There are many applications of data-management that relate to climate and weather, safety at sea and along the coast, fisheries, offshore activities, management of the seas, etc. |
| Data centres in evolution |
|---|
|
Changes in technology and changes in society are both forcing data
centres to rethink their role and modus operandi. Another trend is
the increased interest in biodiversity and the need to set up
management and monitoring programmes to study marine (and other)
biodiversity. Human-induced world-wide changes, such as global
warming, will no doubt affect our living resources; one of the
challenges of the new data centres is to integrate biological and
physicochemical data and make both data types available for combined
analysis. These and other developments were discussed at the
'Colour of Ocean Data' Symposium, held in
Brussels in 2002. The last part of the symposium was dedicated to a
panel discussion, in which the changing role of data centres was
discussed. What follows is a brief overview of the most important
trends and issues that were identified.
There is a trend away from the traditional data centre, with its main task of archiving datasets, towards becoming more service-orientated. Data centres can look towards libraries for inspiration to redefine their role; libraries provide expertise and guidance in cataloguing. Archives are grey and dusty, libraries are active and open; data centres should strive to resemble the latter rather than the former. Data management needs an equivalent to the 'Web of Science': a mechanism to bring up a list of relevant, available, quality controlled and peer-reviewed datasets. Any mechanism for finding data - i.e. 'data discovery' - is meaningless (and very frustrating&) if it is not linked to a system for data distribution, through which the scientist or interested layperson can access actual data. Setting up both data discovery and data distribution mechanisms is made possible by recent developments in internet and database technology. Some traditional roles of data centres remain important: long-term stewardship of data, integrating datasets, documenting and redistributing datasets, development of standards and standard operational procedures, etc... Datasets often result from particular projects, which usually have a limited time-span. Short-term data management, within the time span of the project, is usually not a problem: scientists need data management to produce useful results; moreover, making provisions for data management is often a prerequisite for getting a proposal accepted in the first place. After the project ends, however, there is a danger that detailed knowledge about the collected data can disappear, together with project staff. It is the mandate of data centres to work together with project staff, to ensure that data are properly documented, and that the integrity of the data themselves are safeguarded after completion of the project. Meta-databases and other methods of data discovery will certainly gain more and more importance as the number of studies, and the number of scientists conducting these studies, increases. Such methods of data discovery, and better communication between and among scientists and data managers, are essential for avoiding unnecessary duplication of effort. There is a need to create data and information products, not only for other data managers and scientists, but also for policy makers and society at large. These products will assist in increasing the visibility of data centres and demonstrate the usefulness of data management to a larger audience. As such, it may assist in attracting funds for further activities as well as data submissions from scientists. Unfortunately, marine scientists are generally very poorly informed about data centres and about data and information management procedures. There is a need to investigate how to put data and information management on the curriculum of academic institutions. This would result in a greater awareness of data centres, and an increased quantity and quality of data submissions. Data managers should actively seek collaboration with scientists. Involvement of data managers in the planning of projects at a very early stage, and more input in the development of data collection makes 'Beginning-to-end data management' a reality. |
| The future |
|---|
|
Managers of marine data are facing major challenges. First, there
is the incredible increase in the volume of data, especially in the
area of remote sensing. Second, there is the great diversity in the
types of data that have to be handled: physicochemical, geological,
meteorological and biological data, all have to be integrated, and
analyses and information products have to draw on all of them.
Last but not least, there is a major discrepancy between the scale
at which data are typically gathered, and the scale at which the
data and information are needed. With very few exceptions, projects
collect data and information on local scales, and over short
time-spans. Humanity has brought on itself problems such as
global warming and consequent sea-level rise, depletion of
fish stocks, and pollution, which have generated a need for data
and information on a global scale; integration of all available
local datasets is the only way to create a data- and information
base to support global decision-making.
Modern data management is inseparable from information technology. Recent developments in technology assist in coping with both the diversity and volume of data flows. The internet provides means to exchange data at no - or very low - cost. Electronic publishing is more and more the method of choice for communicating research results and other information. Database systems are becoming more sophisticated, allowing scientists and data managers to concentrate on subject matter rather than technical nitty-gritty. Computer systems are becoming faster, hard disk and other storage space is becoming cheaper, and information technology is making it possible to conduct data management, and devise information products, that could only be dreamed of just a couple of years ago. The main challenge for data managers is now to remain in control of developments, and not to let marine data management become technology-driven. Obviously, recent technical developments should be monitored, and put to good use whenever and wherever relevant. But it is more important to continuously re-evaluate what the role of the data centres should be, rather than how objectives are being realized. The real issues for data management are standardization, collaboration and enabling knowledge-based decision-making. Obviously, we can do 'more'. But can we also do 'better'? |
| Marine Information Management: helping to understand and protect the marine environment |
|---|
|
On our blue planet, the dominant feature is ocean. It contains 97
percent of the Earth's water and releases vapor into the atmosphere
that returns as rain, sleet, and snow, ever replenishing the planet
with freshwater. All life, including our own, is dependent on the
ocean. Understanding the ocean is integral to comprehending this
planet on which we live. Understanding the ocean is more than a
matter of curiosity. Exploration, inquiry and study are required
to better understand ocean systems and processes.
Marine Science Libraries hold an important role in promoting information about the marine environment: information provision to the policy makers - educating the next generation of environmental stewards; attracting a future environmentally concerned workforce and generating an ocean literate public that understands the value of the ocean and can make appropriate decisions to protect it. On an International scale, networks of Marine Information Management (MIM) Centers, collaborating to produce products and services, strengthen our global understanding of ocean processes and conditions. Marine Information Management is a vital process in this knowledge cycle. |
| Our Users and Partners |
|---|
| The users of Marine Information include research scientists, policy makers, students at all levels, educators, industry and businesses. Marine Information Management Centers interact with Marine Data managers to deliver information products, e.g. Data that has been processed and interpreted. We may repackage the data in the form of electronic citation databases, internet bibliographies, regional repositories of stored and accessible scientific research, online catalogs of specialized collections, or digitized collections of difficult to find scientific studies. We establish national and international standards to disseminate this information, and we form groups of networked individuals and professional societies to collaborate on new products, on training courses and technology for the delivery of marine and atmospheric information. |