Raw data

Primary datarawRaw Data Now
Raw data, also known as primary data, is data (e.g., numbers, instrument readings, figures, etc.) collected from a source.wikipedia
80 Related Articles

Data

statistical datascientific datadatum
Raw data, also known as primary data, is data (e.g., numbers, instrument readings, figures, etc.) collected from a source.
Raw data ("unprocessed data") is a collection of numbers or characters before it has been "cleaned" and corrected by researchers.

Rufus Pollock

Dr Rufus Pollock
Inspired by a post by Rufus Pollock of the Open Knowledge Foundation his call to action is "Raw Data Now", meaning that everyone should demand that governments and businesses share the data they collect as raw data.
In 2009, he was credited by web inventor Tim Berners-Lee for starting the Raw Data Now meme.

Thermometer

thermometerstemperature sensortemperature
If a scientist sets up a computerized thermometer which records the temperature of a chemical mixture in a test tube every minute, the list of temperature readings for every minute, as printed out on a spreadsheet or viewed on a computer screen is "raw data".

Outlier

outliersstatistical outliersconservative estimate
Raw data has not been subjected to processing, "cleaning" by researchers to remove outliers, obvious instrument reading errors or data entry errors, or any analysis (e.g., determining central tendency aspects such as the average or median result). In computing, raw data may have the following attributes: it may possibly contain human, machine, or instrument errors, it may not be validated; it might be in different areen (colloquial) formats; uncoded or unformatted; or some entries might be "suspect" (e.g., outliers), requiring confirmation or citation.

Central tendency

LocalityLocality (statistics)Measure of central tendency
Raw data has not been subjected to processing, "cleaning" by researchers to remove outliers, obvious instrument reading errors or data entry errors, or any analysis (e.g., determining central tendency aspects such as the average or median result).

Arithmetic mean

meanaveragearithmetic
Raw data has not been subjected to processing, "cleaning" by researchers to remove outliers, obvious instrument reading errors or data entry errors, or any analysis (e.g., determining central tendency aspects such as the average or median result).

Median

averagesample medianmedian-unbiased estimator
Raw data has not been subjected to processing, "cleaning" by researchers to remove outliers, obvious instrument reading errors or data entry errors, or any analysis (e.g., determining central tendency aspects such as the average or median result).

Statistics

statisticalstatistical analysisstatistician
Raw data can be inputted to a computer program or used in manual procedures such as analyzing statistics from a survey.

Survey methodology

surveysurveysstatistical survey
Raw data can be inputted to a computer program or used in manual procedures such as analyzing statistics from a survey.

Binary number

binarybinary numeral systembase 2
The term "raw data" can refer to the binary data on electronic storage devices, such as hard disk drives (also referred to as "low-level data").

Computing

computer technologycomputing technologyapplied computing
In computing, raw data may have the following attributes: it may possibly contain human, machine, or instrument errors, it may not be validated; it might be in different areen (colloquial) formats; uncoded or unformatted; or some entries might be "suspect" (e.g., outliers), requiring confirmation or citation.

Colloquialism

colloquialcolloquiallycolloq.
In computing, raw data may have the following attributes: it may possibly contain human, machine, or instrument errors, it may not be validated; it might be in different areen (colloquial) formats; uncoded or unformatted; or some entries might be "suspect" (e.g., outliers), requiring confirmation or citation.

Code

encodingencodedencode
In computing, raw data may have the following attributes: it may possibly contain human, machine, or instrument errors, it may not be validated; it might be in different areen (colloquial) formats; uncoded or unformatted; or some entries might be "suspect" (e.g., outliers), requiring confirmation or citation.

Confirmation

confirmedSacrament of Confirmationconfirmation name
In computing, raw data may have the following attributes: it may possibly contain human, machine, or instrument errors, it may not be validated; it might be in different areen (colloquial) formats; uncoded or unformatted; or some entries might be "suspect" (e.g., outliers), requiring confirmation or citation.

Citation

citedcitationscite
In computing, raw data may have the following attributes: it may possibly contain human, machine, or instrument errors, it may not be validated; it might be in different areen (colloquial) formats; uncoded or unformatted; or some entries might be "suspect" (e.g., outliers), requiring confirmation or citation.

Data processing

processingdata-processingprocessing of data
Once captured, this raw data may be processed stored as a normalized format, perhaps a Julian date, to make it easier for computers and humans to interpret during later processing.

Egg

eggsfish egghatching
Raw data (sometimes colloquially called "sourcey" data or "eggy" data, the latter a reference to the data being "uncooked", that is, "unprocessed", like a raw egg) are the data input to processing.

Information

informativeinputinputs
Although raw data has the potential to be transformed into "information," extraction, organization, analysis and formatting for presentation are required before raw data can be transformed into usable information.

Payment terminal

point-of-sale terminalcredit card terminalPOS
For example, a point-of-sale terminal (POS terminal, a computerized cash register) in a busy supermarket collects huge volumes of raw data each day about customers' purchases.

Cash register

cash registerstillcash till
For example, a point-of-sale terminal (POS terminal, a computerized cash register) in a busy supermarket collects huge volumes of raw data each day about customers' purchases.

Computer program

programprogramscomputer programs
Raw data can be inputted to a computer program or used in manual procedures such as analyzing statistics from a survey. Once processed and analyzed by a software program or even by a researcher using a pen and paper and a calculator, this raw data may indicate the particular items that each customer buys, when they buy them, and at what price; as well, an analyst or manager could calculate the average total sales per customer or the average expenditure per day of the week by hour.

Calculator

pocket calculatorcalculatorselectronic calculator
Once processed and analyzed by a software program or even by a researcher using a pen and paper and a calculator, this raw data may indicate the particular items that each customer buys, when they buy them, and at what price; as well, an analyst or manager could calculate the average total sales per customer or the average expenditure per day of the week by hour.

Marketing

marketermarketedmarketing campaign
Such information could then become data for further processing, for example as part of a predictive marketing campaign.

Database

database management systemdatabasesDBMS
As a result of processing, raw data sometimes ends up being put in a database, which enables the raw data to become accessible for further processing and analysis in any number of different ways.

Tim Berners-Lee

Sir Tim Berners-LeeTim Berners LeeTimothy Berners-Lee
Tim Berners-Lee (inventor of the World Wide Web) argues that sharing raw data is important for society.