The Management of Oil Industry Exploration & Production Data

Who are we? Purchase Options - B&W USA/World - B&W UK - B&W Germany - B&W France - Color USA/World - Color UK - Color Germany - Color France
1 Introduction 12 Physical data
2 Value of data 13 Documents
3 Subsurface data 14 Auditing
4 Current practice 15 Quality
5 DMBoK 16 Other elements
6 Governance 17 Assessing
7 Architecture 18 Glossary
8 Development 19 Figures
9 Operations 20 Bibliography
10 Security 21 Index
11 Corporate data 22 Further info
Upcoming events New articles Extra material Links
Sample chapter Figures Bibliography Extra material Historical Papers

by Steve Hawtin
17 Aug 2013

Data, Information and Knowledge

KID Pyramid

I find that I often confuse the definition of data, information and knowledge. I have, of course, seen a lot of presentations that blithely assume everyone knows what Acroff meant when he first used the terms, however I have also come across quite a few experienced people whose definitions are clearly quite different from my own. I figured it would be worthwhile to attempt to succinctly define them, so at least I have a definition of what they mean for myself.

Data: the simplest of the levels, this is just a set of values. Data has no significance beyond its existence (in and of itself) and, in isolation, has no meaning. All data can be represented as a sequence of yes/no choices, hence the volume of data you have is measured in the number of bits required to represent it.

Information: a set of data within its context. Information affects the state of a dynamic system, such as our understanding of a situation. It resolves uncertainty, so the measure of how much information is present is related to the change in a probability estimate that it causes. Data that is uninterrupted, untrusted or unsurprising contains no information.

Knowledge: the processing, organisation and structuring of information. It is the synthesis of information from multiple sources that leads to understanding. Admittedly this is an elusive concept which is difficult to define and certainly hard to agree on.

Wisdom: an extrapolative appreciation of the underlying situation. In other words wisdom is the ability to predict future knowledge. This idea is even more slippery than knowledge.

The precise definitions of data and information come from Claude Shannon, who in 1948 provided a clear mathematical basis for the whole topic (his ideas underlie many of today's key technologies). The rather loose descriptions here are my attempt to summarise that work, anyone working with "information" should at least be aware of the original paper (available at cm.bell-labs.com/cm/ms/what/shannonday/shannon1948.pdf).

The other two terms are much less clearly defined. There are some computer based systems that I would consider to handle knowledge, for example IBM's "Watson", which competed and won a special version of the TV game show "Jeopardy" and the "Cyc Project" that has been attempting to encode "common sense" since 1984. But, my experience is that the vast majority of claims to handle "knowledge" within computer based systems usually have more to do with sales hype than reality. My own view is that anyone that claims their software handles "wisdom" is clearly just focused on parting you from your money, avoid them.


prev icon
Article 17
 
paper icon
Articles
 
rss icon
RSS Feed
 
news icon
Updates
 
home icon
Intro
 
toc icon
Book Contents
 
figure icon
All Figures
 
biblio icon
Refs
 
download icon
Downloads
 
website icon
Links
 
buy icon
Purchase
 
contact icon
Contact Us
 
next icon
Article 19
 

Comment on the contents of the 'Data, Information and Knowledge' page
Subject: Email to Reply To (optional):