Poster & Book
A must for your library
 Home   About  Terms  Partnership  Contact   Search top LIS resources
Knowledge Map of Information Science: Data, Information, Knowledge
Information Science: Conceptions & Models. The British Museum Reading Room, London. This building used to be the main reading room of the British Library; now it is itself a museum exhibit. Source: Wikipedia.
What is the meaning of "data", "information", and "knowledge"?
Abstract

The field of Information Science is constantly changing. Therefore, information scientists are required to regularly review - and if necessary - redefine its fundamental building blocks. This article is one of a group of four articles, which resulted from a Critical Delphi study conducted in 2003-2005. The study, Knowledge Map of Information Science, was aimed at exploring the foundations of information science. The international panel was composed of 57 leading scholars from 16 countries, who represent (almost) all the major subfields and important aspects of the field. This particular article documents 130 definitions of data, information, and knowledge formulated by 45 scholars, and maps the major conceptual approaches for defining these three key concepts.

Definitions
 

Dr. Hanne Albrechtsen, Institute of Knowledge Sharing, Denmark

Data. In computational systems data are the coded invariances. In human discourse data are that which is stated, for instance, by informants in an empirical study. Information is related to meaning or human intention. In computational systems information is the contents of databases, the web etc. In human discourse systems information is the meaning of statements as they are intended by the speaker/writer and understood/misunderstood by the listener/reader. Knowledge is embodied in humans as the capacity to understand, explain and negotiate concepts, actions and intentions.

  Prof. Elsa Barber, University of Buenos Aires, Argentina

 Datum is the representation of concepts or other entities, fixed in or on a medium in a form suitable for communication, interpretation, or processing by human beings or by automated systems (Wellisch, 1996). Information is (1) a message used by a sender to represent one or more concepts within a communication process, intended to increase knowledge in recipients. (2) A message recorded in the text of a document... Knowledge is knowing, familiarity gained by experience; person's range of information; a theoretical or practical understanding of; the sum of what is known..

 Prof. Aldo de Albuquerque Barreto, Brazilian Institute for Information in Science and Technology, Brazil

 Data is a symbol set that is quantified and/or qualified.  Information is a set of significant sings that has the ability to create knowledge... The essence of the information phenomenon has been characterized as the occurrence of a communication process that takes place between the sender and the recipient of the message. Thus, the various concepts of information tend to concentrate on the origin and the end point of this communication process (Wersig and Neveling, 1975). Knowledge is information that has been appropriate by the user... When information is adequately assimilated, it produces knowledge, modifies the individual’s mental store of information and benefits his development and that of the society in which he lives. Thus, as the mediating agent in the production of knowledge, the information, qualifies itself, in form and substance, as significant structures able to generate knowledge for the individual and his group.

 Prof. Shifra Baruchson–Arbib, Bar Ilan University, Israel

 Data are sensory stimuli that we perceive through our senses. Information is data that has been processed into a form that is meaningful to the recipient (Davis and Olson, 1985). Knowledge is what has understood and evaluated by the knower.

 Prof. Maria Teresa Biagetti, University of Rome 1, Italy

Datum is every thing or every unit that could increase the human knowledge or could allow to enlarge our field of scientific, theoretical or practical knowledge, and that can be recorded, on whichever support , or orally handed. Data can arouse information and knowledge in our mind. Information is the change determined in the cognitive heritage of an individual. Information always develops inside of a cognitive system, or a knowing subject. Signs that constitute the words by which a document or a book has made are not information. Information starts when signs are in connection with an interpreter (Morris, 1938). Knowledge is structured and organized information that has developed inside of a cognitive system or is part of the cognitive heritage of an individual (based on C. S. Peirce, 1931, 1958).
 
  Prof. Michael Buckland, University of California, Berkeley, USA

Data. The word "data" is commonly used to refer to records or recordings encoded for use in computer, but is more widely used to refer to statistical observations and other recordings or collections of evidence.
 
Information. The word “information” is used to refer to a number of different phenomena. These phenomena have been classified into three groupings: (1) Anything perceived as potentially signifying something (e.g. printed books); (2) The process of informing; and (3) That which is learned from some evidence or communication. All three are valid uses (in English) of the term “information”. I personally am most comfortable with no. 1, then with no. 3, but acknowledge that others have used and may use no 2.
 
Knowledge.  The word “knowledge” is best used to refer to what someone knows, which is, in effect, what they believe, including belief that some of the beliefs of others should not be believed. By extension the word “knowledge” is used more loosely for (1) what social groups know collectively; and (2) what is in principle knowable because it has been recorded somehow and could be recovered even though, at any given time, no individual knows (or remembers) it.

Dr. Quentin L. Burrell, Isle of Man International Business School, Isle of Man

 Data are the basic individual items of numeric or other information, garnered through observation; but in themselves, without context, they are devoid of information. Information is that which is conveyed, and possibly amenable to analysis and interpretation, through data and the context in which the data are assembled. Knowledge is the general understanding and awareness garnered from accumulated information, tempered by experience, enabling new contexts to be envisaged.

Prof. Rafael Capurro, University of Applied Sciences, Stuttgart, Germany
Homepages: http://www.capurro.de; http://icie.zkm.de; http://www.i-r-i-e.net

Data are (or datum is) an abstraction. I mean, the concept of `data' or `datum' suggests that there is something there that is purely given and that can be known as such. The last one hundred years of (late) philosophic discussion and, of course, many hundred years before, have shown that there is nothing like `the given' or `naked facts' but that every (human) experience/knowledge is biased. This is the `theory-laden' theorem that is shared today by such different philosophic schools as Popper's critical rationalism (and his followers and critics such as Kuhn or Feyerabend), analytic philosophy (Quine, for instance), hermeneutics (Gadamer) etc. Modern philosophy (Kant) is very acquainted with this question: experience (“Erfahrung”) is a product of `sensory data' within the framework of perception (“Anschauung”) and the categories of reason (“Verstand”) (“perception without concepts is blind, concepts without perception are void”). Pure sensory data are as unknowable as “things in themselves.”
 
Information is a multi-layered concept with Latin roots (`informatio' = to give a form) that go back to Greek ontology and epistemology (Plato's concept of `idea' and Aristotle's concepts of `morphe' but also to such concepts as `typos' and `prolepsis') (See Capurro, 1978; Capurro & Hjoerland, 2003). The use of this concept in information science is at the first sight highly controversial but it basically refers to the everyday meaning (since Modernity): “the act of communicating knowledge” (OED). I would suggest to use this definition as far as it points to the phenomenon of message that I consider the basic one in information science.
 
Message, information, understanding. Following systems theory and second-order cybernetics, I suggest to distinguish between `message', `information' and `understanding.' All three concepts constitute the concept of communication (See, for instance, Luhmann, 1996, with references to biology (Maturana/Varela), cybernetics etc.). A `message' is a `meaning offer' while `information' refers to the selection within a system and `understanding' to the possibility that the receiver integrates the selection within his/her pre-knowledge - constantly open to revision i.e. to new communication - in accordance with the intention(s) of the sender. The receiver mutates each time into a sender.
 
Knowledge is `no-thing' (contrary to “information-as-thing” as suggested by Michael Buckland, 1991a) i.e. it is the event of meaning selection of a (psychic/social) system from its `world' on the basis of communication. The “act of communicating knowledge” (OED's definition of information) is then to be understood as the act of making a meaning offer (=message) leading to understanding (and misunderstanding) on the basis of a selection of meaning (=information). To know is then to understand on the basis of making a difference between `message' (or meaning offer) and `information' (or meaning selection). Human knowledge is, as Popper states, basically conjectural. Or, to put it in hermeneutic terms: understanding is always biased, i.e., based on (implicit) pre-understanding. In more classical terms we distinguish following Aristotle between `empirical knowledge' (or `know-how' = `empeiria') and explicit knowledge (or `know-that', for instance, scientific knowledge or `episteme').
 
Data, information, knowledge. Putting the three concepts ("data", "information", and "knowledge") as done here, gives the impression of a logical hierarchy: information is set together out of data and knowledge comes out from putting together information. This is a fairytale.

Prof. Thomas A. Childers, Drexel University, USA
.
Knowledge is that which is known, and it exists in the mind of the knower in electrical pulses. Alternatively, it can be disembodied into symbolic representations of that knowledge (at this point becoming a particular kind of information, not knowledge).  Strictly speaking, represented knowledge is information. Knowledge—that which is known—is by definition subjective, even when aggregated to the level of social, or public, knowledge—which is the sum, in a sense, of individual “knowings.”  Data and information can be studied as perceived by and “embodied” (known) by the person or as found in the world outside the person.

Prof. Charles H. Davis, Indiana University, USA 

Data is the plural of datum, although the singular form is rarely used. Purists who remember their first-year Latin may insist on using a plural verb with data, but they forget that English grammar permits collective nouns. Depending on the context, data can be used in the plural or as a singular word meaning a set or collection of facts. Etymologically, data, as noted, is the plural of datum, a noun formed from the past participle of the Latin verb dare—to give. Originally, data were things that were given (accepted as “true”). A data element, d, is the smallest thing which can be recognized as a discrete element of that class of things named by a specific attribute, for a given unit of measure with a given precision of measurement (Rush & Davis, [in progress]; Landry & Rush, 1970; Yovits  & Ernst, 1970).”
 
Information The verb ‘inform’ normally is used in the sense to communicate (i.e., to report, relate, or tell) and comes from the Latin verb informare, which meant to shape (form) an idea. Data is persistent while information is transient, depending on context and the interpretation of the recipient. Information is data received through a communication process that proves of value in making decisions.
 
Knowledge involves both data and the relationships among data elements or their sets.  This organization of data based on relationships is what enables one to draw generalizations from the data so organized, and to formulate questions about which one wishes to acquire more data. That is, knowledge begets the quest for knowledge, and it arises from verified or validated ideas (Sowell, 1996).

Prof. Anthony Debons, University of Pittsburgh, USA

Data are symbols organized according to established algorithms.
 
Information represents a state of awareness (consciousness) and the physical manifestations they form. Information, as a phenomena, represents both a process and a product; a cognitive/affective state, and the physical counterpart (product of) the cognitive/affective state. The counterpart could range from a scratch of a surface, movement (placement)of a rock; a gesture(movement) speech(sound), written document, etc. (requirement). Information answers questions of what, where, when and who and permutations thereof..
 
“Knowledge. Knowledge represents a cognitive/affective state that finds definition in meaning and understanding. Knowledge is reflected in the questions of “how” and “why”. Knowledge extends the organism state of awareness (consciousness/ information). Knowledge can be given physical representation (presence) in the material products (technology) thereof (books, film, speech, etc)”.
 
“Message. Message is a medium through which data; information and knowledge are transmitted and used. It represents an instrument for moving the state of awareness and meaning with reference to specific events (states, conditions) from one implicit, or explicit source to another. When the physical products of awareness are transferred from one source to another, reference to the collective domain can be realized.

Prof. Gordana Dodig-Crnkovic, Mälardalen University, Sweden

Raw data (sometimes called source data or atomic data) is data that has not been processed for use. [In the spirit of Tom Stonier’s definition - Data: a series of disconnected facts and observations] Here “unprocessed” might be understood in a sense that no specific effort has been made to interpret or understand the data. They are the result of some observation or measurement process, which has been recorded as “facts of the world”. The word data is the plural of Latin datum, “something given”, which one also could call “atomic facts. Information is the end product of data processing. Knowledge is the end product of information processing. In much the same way as raw data are used as input, and processed in order to get information, the information itself is used as input for a process that results in knowledge.
 
Theory laden. It is very true that all data are theory laden. That does not mean that you can not produce new data which in the next step will lead to the theory revision, and that new, corrected theory will be the basis for producing new data which after a while will lead to the correction of the existing theory. We use our theory-laden data to refute theories!
 
Data-Information-Knowledge-Wisdom. According to Stonier (1993, 1997), data is a series of disconnected facts and observations. These may be converted to information by analyzing, cross-referring, selecting, sorting, summarizing, or in some way organizing the data. Patterns of information, in turn, can be worked up into a coherent body of knowledge. Knowledge consists of an organized body of information, such information patterns forming the basis of the kinds of insights and judgments which we call wisdom. The above conceptualization may be made concrete by a physical analogy (Stonier, 1993): consider spinning fleece into yarn, and then weaving yarn into cloth. The fleece can be considered analogous to data, the yarn to information and the cloth to knowledge. Cutting and sewing the cloth into a useful garment is analogous to creating insight and judgment (wisdom). This analogy emphasizes two important points: (1) going from fleece to garment involves, at each step, an input of work, and (2) at each step, this input of work leads to an increase in organization, thereby producing a hierarchy of organization. .

Prof. Henri Dou, University of Aix-Marseille III, France

Datum is a unique piece of content related to an entity. Information is the sum of the data related to an entity..

Prof. Nicolae Dragulanescu, Polytechnics University of Bucharest, Romania 

Data are a set of symbols representing a perception of raw facts (i.e. following Debons, Horne & Cronenweth (1988), events from which inferences or conclusions can be drawn). Information is organized data (answering the following basic questions: What? Who? When? Where?). Knowledge is understood information (answering following basic questions: why?, how?, for which purpose?).

Prof. Hamid Ekbia, University of Redlands, USA
 

Data. Here, data typically means the “raw” material obtained from observation (broadly understood, but not necessarily, as “sense impressions,” which is a key notion of empiricist philosophy). Such data is typically quantitative, presented in numbers and figures. 

Prof. Charles Ess, Drury University, USA

Prolog. These definitions are offered as an elaboration on physicist Heinz Pagels’ (1988) observation.
Information is just signs and numbers, while knowledge has semantic value. What we want is knowledge, but what we often get is information. It is a sign of the times that many people cannot tell the difference between information and knowledge, not to mention wisdom, which even knowledge tends to drive out. (1988, 49, cited in O’Leary and Brasher, 1996, 262).
 
These distinctions in turn trace back at least as far as T.S. Eliot’s lament:
 
Where is the Life we have lost in living?
Where is the wisdom we have lost in knowledge?
Where is the knowledge we have lost in information?
-- "Choruses from the Rock"]
 
Data can be defined as a class of information objects, made up of units of binary code that are intended to be stored, processed, and transmitted by digital computers. As such, data consists of information in a narrow sense – i.e., as inscribed in binary code, units of data are not likely to be immediately meaningful to a human being.  But units of data, as “informational building blocks,” when collected and processed properly, can form information in the broader sense (see below), i.e., that is more likely to be meaningful to a human being (as sense-making beings).
 
Information. Collocations of data (information in the narrow sense – see above) that thereby become meaningful to human beings – e.g., as otherwise opaque units of binary code are collected and processed into numbers, artificial and natural languages, graphic objects that convey significance and meaning, etc. Such collocations of data can be made meaningful by human beings (as sense-making beings) especially as such data collocations/information connect with, illuminate, and are illuminated by still larger cognitive frameworks – most broadly, worldviews that further incorporate knowledge and wisdom (see below). On this definition, information can include but is not restricted to data.  On the contrary, especially as Borgmann (1999) argues, there are other forms of information (natural, cultural) that are not fully reducible to data as can be transmitted, processed, and/or produced by computers and affiliated technologies.
 
Knowledge is one step above information, and one step below wisdom.  Knowledge in the broadest sense approaches a reasonably comprehensive worldview, i.e., a cognitive framework that establishes the major parameters and ten thousand details of human social and ethical realities, including basic values, beliefs, habits, notions of identity, relationships among human beings (including gender identity and issues) and relationships between humanity and larger realities (political, environmental, religious). Knowledge, however, can remain detached, objective, and thereby useless.  Transforming cognitive forms of knowledge into ethical judgment and action is a primary task and goal of wisdom (see Dreyfus 2001; Ess 2003, 2004).” [16] (Charles Ess)

Prof. Raya Fidel, University of Washington, USA

Data are a string of symbols. Information is data that is communicated, has meaning, has an effect, has a goal. Knowledge is a personal/cognitive framework that makes it possible for humans to use information.

  Prof. Thomas J. Froehlich, Kent State University, USA

“Data. It depends on your framework.  If you are a Kantian, it is the foundation for the a priori categories of the understanding.  If you are a computer programmer it is pre-processed information (data collected according to some algorithm for some purpose) or post-processed information (e.g., tables of such information).  In this latter case data cannot be defined apart from information, because it is dependent on it.  If you are a biologist, it might be stimuli, but these scientific approaches are built on a faulty understanding of perception (e.g., perception is sensations (i.e., stimuli) glued together – which is false).
 
Information is resources useful or relevant or functional for information seekers.
 
Knowledge. For some philosophers, validated, true information is that which coheres with other truths (coherence theory of truth). For others, what corresponds to reality (correspondence theory of truth).  For others, it is what works or is functional (pragmatic theory of truth). At any event it is always contextual.
 
“A lot of our so-called truths, knowledge, or known ‘facts’ are really orthodoxy – what we collectively believe at a certain point in time. Today when someone would observe an unsupported object falling, when pushed for an explanation, they would utter the phrase/explanation: “the law of gravity.”  Unfortunately, it is an explanation that fails to explain – we still do not know what the “weak force” is, what gravity is, but we are taught in our so-called scientific approach, to utter a phrase that is supposed to – in the naming of it – to explain it.  Four centuries back, it was attributed to the “will of God”. Is this a worse explanation?  Possibly.  In both cases, we are living in images and metaphors and the orthodox frameworks of the time.  Most reference collections in libraries are expressions of orthodoxies of various subject domains.

Dr. H.M. Gladney, HMG Consulting, USA

 Data are representations of facts about the world. Information is data organized according to an ontology that defines the relationships between some set of topics.  Information can be communicated. Knowledge is a set of conceptual structures held in human brains and only imperfectly represented by information that can be communicated. Knowledge cannot be communicated by speech or any form of writing, but can only be hinted at.

Prof. Glynn Harmon, University of Texas at Austin, USA

Data is one or more kinds of energy waves or particles (light, heat, sound, force, electromagnetic) selected by a conscious organism or intelligent agent on the basis of a pre-existing frame or inferential mechanism in the organism or agent. Information is an organism’s or an agent’s active or latent inferential frame that guides the selection of data for its own further development or construction. Knowledge is one or more sets of relatively stable information. A Message is one or more inferred data sets gleaned from external or internal energetic reactions.

Dr. Donald Hawkins, Information Today, USA

 Data are facts and statistics that can be quantified, measured, counted, and stored. Information is data that has been categorized, counted, and thus given meaning, relevance, or purpose. Knowledge is information that has been given meaning and taken to a higher level.  Knowledge emerges from analysis, reflection upon, and synthesis of information.  It is used to make a difference in an enterprise, learn a lesson, or solve a problem." [21] (Donald Hawkins)

Prof. Caroline Haythornthwaite, University of Illinois at Urbana Champaign, USA

Datum is smallest collectable unit associated with a phenomenon. Normally, data occur in collections that are collected in order to monitor a process, assess a situation, and/or otherwise gain a referent on a phenomenon. This does not mean that data are always defined, collected or used appropriately for the question in hand, but that that is the intention when doing so. They are building blocks, even if the building is engineered incorrectly.
 
Information. I would usually expect information to be an assessment or interpretation of data. Often information is not far removed from the ‘smallest collectable unit’ as I have defined “datum”. But I expect it to be some abstraction from data... Information does not inherently mean empirical or first hand analysis of data. It also does not guarantee correct interpretation of data although that is expected.
 
Knowledge is more subject, and intangible compared to information or data. It is what an individual takes from information and data, and what they incorporate into their beliefs, values, procedures, actions, etc. It is heavily internally oriented, understood completely only to the person possessing it. Much work around knowledge implies how to get the knowledge “out of” one head and in to another. Such transfer entails encoding knowledge into transferable information and decoding again into knowledge. Knowledge and information are not the same, but they feed from and support each other.
 
A message is the encoded information or codified/explicit knowledge that is disseminated to others. Very much a Shannon and Weaver transmission model, but I also consider that encoding and decoding have a heavy personal, contextual and historical influence.

Ken Herold, Hamilton College, USA

“Data are dynamic objects of cultural experience having the aspect of being meaning-neutral and a dual nature of description and instruction. Information is dynamic objects of cultural experience having the aspect of being belief-neutral and a dual nature of content and medium. Knowledge is dynamic objects of cultural experience having the aspect of being action-neutral and a dual nature of abstracting to and from the world..

Prof. William Hersh, Oregon Health & Science University, USA
 
Data are the raw observations about the world collected by scientists and others, with a minimum of contextual interpretation. Information is the aggregation of data to make coherent observations about the world. Knowledge is the rules and organizing principles gleaned from data to aggregate it into information.

Prof. Birger Hjorland, Royal School of Library & Information Science, Denmark

Data are observations and measurements you make on objects (artifacts, sites, seeds, bones) and on their contexts. Data are theory-laden.
 
Regarding the theory of knowledge organization we may say that knowledge is not organized by elements called data combined or processed according to some algorithmic procedure. What data are is domain specific and theory-laden. At the most general level what is seen as data is depending of the epistemological view that one subscribes to.
 
Information. The most fruitful theoretical view is here based on Karpatschof's interpretation of information and activity theory, AT (2000, p. 128ff.). In order to define information, Karpatschof introduces the concept of release mechanisms, being systems having at their disposal a store of potential energy, the systems being ”designed” to let this energy out in specific ways, whenever trigged by a signal fulfilling the specifications of the release mechanism. The signal that triggers a certain release mechanism is a low energy phenomenon fulfilling some release specifications. The signal is thus the indirect cause, and the process of the release mechanism the direct cause of the resulting reaction, which is a high-energy reaction compared to the energy in the signal. Information is thus defined as a quality by a given signal relative to a certain mechanism.
 
The release mechanism has a double function: (1) it reinforces the weak signal and (2) it directs the reaction by defining the functional value of a signal in the pre-designed system of the release mechanism. There has been a tendency to consider information to be an obscure category in addition to the classical categories of physics. Information is indeed a new category, but it cannot be placed, eclectically, beside the prior physical categories. Information is a category, not beside, but indeed above the classical categories of physics. Therefore, information is neither directly reducible to these classical categories, nor is it a radically different category of another nature than mass and energy.
 
 Information is, in fact, the causal result of existing physical components and processes. Moreover, it is an emergent result of such physical entities. This is revealed in the systemic definition of information. It is a relational concept that includes the source, the signal, the release mechanism and the reaction as its reactants. The release mechanism is a signal processing system and an information processing system.
 
Information is thus defined in physical terms of signals, mechanisms and energy, but probably first arose with the biological world. Hjørland (2002) outlines the development of information processing mechanisms in the biological, the cultural and the social world.
 
Many professionals can claim to work with "the generation, collection, organization, inter­pretation, storage, retrie­val, dissemination, transfor­mation and use of infor­mation". This is not specific to information professionals. (Their specific work is discussed in Capurro & Hjørland (2003) and elsewhere). Hjørland (2000) investigates when and why the word "information" became associated with library schools (and thus knowledge organization) and what the theoretical implications in the shift from documents to information imply”.
 
Knowledge. Different epistemologies (theories of knowledge) have different views on the nature of knowledge. I subscribe to the pragmatic theory of knowledge. The most important influence from pragmatic philosophy has been skepticism towards any claim of knowledge. A claim of knowledge should never be regarded as finally verified. It should just be regarded as just a claim. However, claims may be supported by empirical and logical arguments. Knowledge claims are parts of more comprehensive theories. Knowledge claims are not purely arbitrary. Instead of regarding science as a collection of true statements, it should be regarded as a collection of supported knowledge claims. In ordinary speech, knowledge then means that part of our background assumptions, that we do not find it fruitful to put questions to.

Prof. Donald Kraft, Louisiana State University, USA

Data are atomic facts, basic elements of “truth”, without interpretation or greater context. It is related to things we sense. Information is a set of facts with processing capability added, such as context, relationships to other facts about the same or related objects, implying an increased usefulness. Information provides meaning to data. Knowledge is information with more context and understanding, perhaps with the addition of rules to extend definitions and allow inference.

Prof. Yves François Le Coadic, National Technical University, France

Datum (in our sector mainly electronic) is the conventional representation, after coding (using ASCII, for example), of information. Information is knowledge recorded on a spatio-temporal support. Knowledge is the result of forming in mind an idea of something (Le Coadic, 2004).

Dr. Jo Link-Pezet, Urfist, and University of Social Sciences, France

 Data are commonly seen as simple, isolated facts, though products of intellectual activity in their rough shape. Knowledge is the appropriation of information in the process of learning, acting, interpreting. Knowledge is in the head of people, yet knowledge can be shared. Knowledge refers to the way information is used during the intellectual process.

Michal Lorenz, Masaryk University in Brno, Czech Republic

Data are formalized parts (i.e., digitalized contents) of sociocultural information potentionally proccessable by technical facilities which disregard the cognitive process and that is why it is necessary to provide them with meanings from outside (i.e. they are objective). Information is a relationship between an inner arrangement (i.e., a priori set structure (Smajs & Krob, 2003), implicate order of a system and its present embodiment in reality (explicate order) including mediating memory processes (i.e., historically dependent processes) releasing the meaning. Knowledge is tacitly or consciously grasped and interiorized content of information related and meaningfully integrated into a unifying frame of experience among other information contents interiorized in the same way, the complex of which reflects subjective understanding of environment. Mistakes arise from integration of misinformation or from integration of contradictory information into a unifying frame of experience (the second leads to cognitive dissonance and motivates to seek another information).

Prof. Michel J. Menou, Knowledge and ICT management consultant, France

Data are perceptible or perceived - if and when the signal can be interpreted by the 'user' - attributes of physical, biological, social or conceptual entities. Information is recorded and organized data that can be communicated (Porat & Rubin, 1977). However, it is advisable to distinguish between the various states or conditions of information (e.g. information-as an object (Buckland, 1991b), or semantic, syntactic and paradigmatic states (Menou, 1995). Knowledge is information that is understood, further to its utilization, stored, retrievable and reusable under appropriate circumstances or conditions..
.

  Prof. Haidar Moukdad, Dalhousie University, Canada

Data are sets of characters, symbols, numbers, and audio/visual bits that are represented and/or encountered in raw forms. Inherently, knowledge is needed to decipher data and turn them into information”. Information is facts, figures, and other forms of meaningful representations that when encountered by or presented to a human being are used to enhance his/her understanding of a subject or related topics. Knowledge is a reservoir of information that is stored in the human mind. It essentially constitutes the information that can be “retrieved” from the human mind without the need to consult external information sources.
  Prof. Charles Oppenheim, Loughborough University, UK
Data are raw material of information, typically numeric.Information is data which is collected together with commentary, context and analysis so as to be meaningful to others. Knowledge is a combination of information and a person's experience, intuition and expertise.

Prof. Lena Vania Pinheiro, Brazilian Institute for Information in Science and Technology, Brazil

Datum is an object or crude fact perceived by the subject, non – constructed nor elaborated in the consciousness, without passing through neither analysis processes nor evaluation for its transfer as information. Information is a phenomenon generated from knowledge and integrated therein, analyzed and interpreted to achieve the transfer process of message (i.e., meaningful content) and the cognitive transformations of people and communities, in a historical, cultural and social context. Knowledge is a social and cognitive process formed by the passing or assimilated information to thought and to action. Message is the meaningful content of information..
 

Prof. Maria Pinto, University of Granada, Spain

Data are primitive symbolic entities, whose meaning depend on it integration within a context that allow their understanding by an interpreter. Information is the intentional composition of data by a sender with the goal of modifying the knowledge state of an interpreter or receiver. Knowledge is the intelligent information processing by the receiver and it consequent incorporation to the individual or social memory (Belkin & Robertson, 1976; Blair, 2002).

Prof. Roberto Poli, University of Trento, Italy

 Signs. The distinctive feature of signs is that they denote something, regardless of whether that something exists or does not exist, is concrete or abstract, possible or impossible, a thing or an event, a substance or a determination, an individual or a collective. Analysis even of one single sign leads to a multiplicity of signs and their denoted items. For this reason, we may say that the sign contains a reference to both the denoted item considered per se, in isolation, and the contexts or situations in which the denoted item appears. And of these of especial importance are those that, for lack of better terminology, we can call the proximal context and the distal context. The proximal context is the net of relations that hold among the items denoted by signs. On the other hand, the distal context is the outcome of a categorization procedure. Its most usual form is that constituted by the reply to questions like 'what is this?', where acceptable replies are of the type 'this is an animate being', 'this is an artifact', 'this is a property', etc. This codification of the two types of context enables me to propose the following distinction between data and information.
 
Datum. Def. 1. x is a datum = x is a sign that denotes entities or attributes in a proximal context. In the light of this definition one understands why conventional analyses of consistency and integrity, or procedures of normalization, are effective techniques for the organization and rationalization of data. From a technological point of view, relational data-bases are the currently most advanced products available for the efficient handling of data.
 
Information. Def. 2. x is an item of information = x is a datum in adistal context. Definition 2 tells us that information is made up of more structured items. That is to say, information is the embedding of signs-in-a-proximal-context (i.e., data) in a distal context. Information, thus, adds greater structure to data. These definitions provide a first explanation for the scant interest aroused by proposals to draw more exact distinctions between data and information. In effect, in concrete cases of application, it is often difficult to distinguish precisely between distal and proximal contexts.

Conditions of knowledge. Knowledge is apparently not reducible solely to information and data. The problem is to understand 'what is lacking', what must be added to information and data in order to achieve true knowledge. My claim is that the meaning of a sign is given by the position of the sign in a field of signs (in a space). On the other hand, the content of a sign is given by the position of the item (denoted by the sign) in a field of items. Data, information, meanings and contents cover the field of knowledge. This amounts to saying that we have knowledge when we know (1) which item is denoted by which sign, (2) the item's proximal context, (3) the item's distal contexts, (4) the sign's position in the field of signs, (5) the item's position in the field of items (Poli, 2001).

..
Data, information, knowledge, message. I am unable to understand why data, information, knowledge and message are placed on the same level of analysis. I would suggest considering message as the “vehicle” carrying either data or information (which can be taken as synonymous). Knowledge hints to either a systematic framework (e.g., laws, rules or regularities, that is higher-order “abstractions” from data) or what somebody or some community knows (“I know that you are married”). In this latter sense knowledge presents a “subjective” side.
Prof. Ronald Rousseau, KHBO, and University of Antwerp, Belgium 

Data are a representation of facts or ideas in a formalized manner, and hence capable of being communicated or manipulated by some process. So: data is related to facts and machines ( Holmes, 2001). Information is the meaning that a human assigns to data by means of the known conventions used in its representation. Information is related to meaning and humans (Holmes, 2001).

  Scott Seaman, University of Colorado, Boulder, USA

Datum is a quantifiable fact that can be repeatedly measured. Information is an organized collection of disparate datum. Knowledge is the summation of information into independent concepts and rules that can explain relationships or predict outcomes.  

Prof. Richard Smiraglia, Long Island University, USA

Data are raw evidence, unprocessed, eligible to be processed to produce knowledge. Information is the process of becoming informed; it is dependent on knowledge, which is processed data. Knowledge perceived, becomes information. Knowledge is what is known, more than data, but not yet information. Recorded knowledge may be accessed in formal ways. Unrecorded knowledge is accessible in only chaotic ways. 

Prof. Paul Sturges, Loughborough University, UK 

Data are discrete items of information that I would call facts on some subject or other, not necessarily set within a fully worked out framework. Information is facts and ideas communicated (or made available for communication). Knowledge is the considered product of information. Selection as to what is valid and relevant is a necessary condition of the acquisition of knowledge.

Prof. Carol Tenopir, University of Tennessee, USA

Data are facts that are the result of observation or measurement. (Landry et al, 1970). Information is meaningful data. Or data arranged or interpreted in a way to provide meaning. Knowledge is internalized or understood information that can be used to make decisions.

  Dr. Joanne Twining, Intertwining.org, a virtual information consultancy, USA

Data are unprocessed, unrelated raw facts or artifacts (Nitecki 1993). Information is data or knowledge processed into relations (between data and recipient) (Nitecki 1993). Knowledge is information scripted into relations with recipient experiences.

Prof. Anna da Soledade Vieira, Federal University of Minas Gerais, Brazil

Data are representations of facts and raw material of information. Information is data organized to produce meaning. Knowledge is meaningful content assimilated for use. The three entities can be viewed as hierarchical in terms of complexity, data being the simplest and knowledge, the most complex of the three. Knowledge is the product of a synthesis in our mind that can be conveyed by information, as one of many forms of its externalization and socialization.

  Prof. Irene Wormell, Swedish School of Library and Information Science in Borås, Sweden

Data are alphabetic or numeric signs, which without context do not have any meaning. Information is a set of symbols that represent knowledge. Information is what context creates/gives to data. It is cognitive. Normally it is understood as a new and additional element in collecting data and information for planned action. Knowledge is enriched information by a person's or a system's own experience. It is cognitive based. Knowledge is not transferable, but through information we can communicate about it. (Note that the highest level of information processing is the generation of wisdom, where various kinds of knowledge are communicated and integrated behind an action. 

Prof. Yishan Wu, Institute of Scientific and Technical Information of China, China

Data are artifacts that reflect a phenomenon in natural or social world in the form of figures, facts, plots, etc. Information is anything communicated among living things. It is one of the three mainstays supporting the survival and evolution of life, along with energy and materials. Knowledge is a human construct, which categorize things, record significant events, and find causal relations among things and/or events, etc. in a systematic way.  

Chaim Zins, Knowledge Mapping Research, Israel

Inferential propositional knowledge. In traditional epistemology, there are three main kinds of knowledge: practical knowledge, knowledge by acquaintance, and propositional knowledge (Bernecker and Dretske, 2000). Practical knowledge refers to skills (i.e., functional abilities, such as driving a car). Knowledge by acquaintance is direct non-mediated recognition of external physical objects and organisms (e.g., "this is Albert Einstein"), or the direct recognition of inner phenomena (e.g., pain, hunger). Propositional knowledge usually comes in the form of 'knowing that''; S (subject) knows that P (proposition). It is the reflective and/or the expressed content of what a person thinks that he or she knows. Note that the contents of our reflective and/or expressed thoughts are in the form of propositions. Propositional knowledge is divided into inferential and non-inferential knowledge. Non-inferential propositional knowledge refers to direct intuitive understanding of phenomena (e.g., ""This is a true love"). Inferential knowledge is a product of inferences, such as induction and deduction. The field of information science, as well as any academic field, is composed of inferential propositional knowledge, as they are published in articles and books. This analysis is focused on defining "data", "information", and "knowledge" as they are related and implemented in inferential propositional knowledge.
 
Subjective vs. objective realms. Data (D), information (I), and knowledge (K) phenomena have two distinctive modes of existence; namely, in the subjective and in the objective realms. Correspondingly, we differentiate between subjective knowledge and objective knowledge. Note that "subjective knowledge" is equivalent here to the knowledge of the subject or the individual knower, and "objective knowledge" is equivalent here to knowledge as an object or a thing. Subjective knowledge exists in the individual's internal world (i.e., as a thought), while objective knowledge exists in the individual's external world (e.g., as it is published in books, presented in digital libraries, and stored in electronic devices). In this context, they are not related to arbitrariness and truthfulness, which are usually attached to the concepts of "subjective knowledge" and "objective knowledge". To avoid confusion, I will use the terms "universal knowledge" and "collective knowledge (i.e., knowledge in the collective realm) rather than "objective knowledge". The distinction between subjective knowledge and universal knowledge differs from the distinction between private knowledge and public knowledge. "Private knowledge" is the individual's intimate knowledge. These are thoughts on contents known only to the individual, such as intimate dreams and feelings, "hidden agenda (i.e., hidden goals and incentives)."Public knowledge" refers to thoughts that the individual consider as knowledge, and they are on contents known to other people as well (e.g., "2+2=4", "Paris is the capital of France")."
 
Six distinctive concepts. Having established the distinction between the subjective and the universal domains, we are in a position to define the three key concepts "data", "information", and "knowledge". In fact, we have six concepts to define, divided into two distinctive sets of three. One set relates to the subjective domain, and the other – to the universal domain.
 
D-I-K in the subjective domain. In the subjective domain, data are the sensory stimuli, which we perceive through our senses. Information is the meaning of these sensory stimuli (i.e., the empirical perception). For example, the noises that I hear are data. The meaning of these noises (e.g., a running car engine) is information. Still, there is another alternative as to how to define these two concepts - which seems even better. Data are sense stimuli, or their meaning (i.e., the empirical perception). Accordingly, in the example above, the loud noises, as well as the perception of a running car engine, are data. Information is empirical knowledge. Accordingly, in the example above, the knowledge that the engine is now on and the car is leaving is information, since it is empirically based. Information is a type of knowledge, rather than an intermediate stage between data and knowledge. Knowledge is a thought in the individual's mind, which is characterized by the individual's justifiable belief that it is true. It can be empirical and non-empirical, as in the case of logical and mathematical knowledge (e.g., "every triangle has three sides"), religious knowledge (e.g., "God exists"), philosophical knowledge (e.g., "Cogito ergo sum"), and the like. Note that "knowledge" is the content of a thought in the individual's mind, which is characterized by the individual's justifiable belief that it is true, while "knowing" is a state of mind which is characterized by the three conditions: (1) the individual believe that it is true, (2) S/he can justify it, and (3) It is true,  or it is appear to be true.  
 
D-I-K in the universal domain. In the universal domain, data, information, and knowledge are human artifacts. They are represented by empirical signs (i.e., signs that one can sense through his/her senses). They can take on diversified forms such as engraved signs, painted forms, printed words, digital signals, light beams, sound waves, and the like. Universal data, universal information, and universal knowledge mirror their cognitive counterparts. Meaning, in the objective domain data are sets of signs that represent empirical stimuli or perceptions, information is a set of signs, which represent empirical knowledge, and knowledge is a set of signs that represent the meaning (or the content) of thoughts that the individual justifiably believes that they are true.
..
Signs vs. meaning. Defining the D-I-K phenomena as sets of signs needs to be refined. There is a fundamental distinction between documented (i.e., written, spoken, or physically expressed) propositions and meanings. "E=MC2", "E=MC2", and "E=MC2" are not three different types of knowledge. These are three different sets of signs that represent the same meaning. In other words, they are three different utterances of the same knowledge. Knowledge, in the collective domain, is the meaning that is represented by written and spoken statements (i.e., sets of symbols). However, since we cannot perceive with our senses the meaning itself, which is an abstract entity, we can relate only to the sets of signs (i.e., written, spoken, or physically expressed propositions), which represent it. Apparently, it is more useful to relate to the data, information, and knowledge as sets of signs rather than as meaning and its building blocks.




Five Models for Defining Data, Information, and Knowledge

1. UD: D-I; SD: K; meaning: D-I are external phenomena; K are internal phenomena. This is the most common one. It underlies the rationale of the name "Information Science"; that is, Information Science is focused on exploring data and information, which are external phenomena. It does not explore knowledge, which is internal phenomena.

2. UD: D; SD: I-K; meaning: D are external phenomena; I-K are internal phenomena. 

3. UD: D-I-K; SD: I-K; meaning: D are external phenomena; I-K phenomena can be in both domains, external or internal.

4. UD: D-I; SD: D-I-K; meaning: D-I  phenomena can be in both domains, external or internal; K phenomena are internal.

5. UD: D-I-K; SD: D-I-K; meaning: D-I-K phenomena can be in both domains, universal (i.e., external) or subjective (i.e., internal).

Model 1

Model 2

Model 3

Model 4

Model 5

UD

SD

UD

SD

UD

SD

UD

SD

UD

SD

D

 

D

 

D

 

D

D

D

D

I

 

 

I

I

I

I

I

I

I

 

K

 

K

K

K

 

K

K

K

Five models for Defining Data, Information, and Knowledge; What is your model?
Overview  Data, Information, Knowledge  Conceptions
.
Classifications  Map Forum  Forum 

10 Pillars of Knowledge: Map of Human Knowledge
 
Library Classifications  
November 2013  © Copyright Dr. Chaim Zins, Jerusalem, 2002-2013. All rights reserved. 
Chaim Zins, Knowledge Mapping Research, 26 Hahaganah St. Jerusalem, 97852 tel: 972-2-5816705 chaim.zins@gmail.com