# Data set

**datasetdatasetsdata setsdataClassic data setsdata seriesset of data**

A data set (or dataset) is a collection of data.wikipedia

318 Related Articles

### Standard deviation

**standard deviationssample standard deviationSD**

These include the number and types of the attributes or variables, and various statistical measures applicable to them, such as standard deviation and kurtosis.

The standard deviation of a random variable, statistical population, data set, or probability distribution is the square root of its variance.

### Open data

**Open government dataopendata**

In the open data discipline, data set is the unit to measure the information released in a public open data repository.

However, the lack of a license makes it difficult to determine the status of a data set and may restrict the use of data offered in an "Open" spirit.

### Data

**statistical datascientific datadatum**

A data set (or dataset) is a collection of data.

### Iris flower data set

**Iris'' flower data setFisher's IrisFisher's iris data**

The Iris flower data set or Fisher's Iris data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper The use of multiple measurements in taxonomic problems as an example of linear discriminant analysis.

### Anscombe's quartet

Anscombe's quartet comprises four data sets that have nearly identical simple descriptive statistics, yet have very different distributions and appear very different when graphed.

### Sample (statistics)

**samplesamplesstatistical sample**

In statistics and quantitative research methodology, a data sample is a set of data collected and the world selected from a statistical population by a defined procedure.

### Data blending

**blendblended datamultisource analysis**

Data blending is a process whereby big data from multiple sources are merged into a single data warehouse or data set.

### Robust statistics

**robustbreakdown pointrobustness**

The data sets for that book can be found via the Classic data sets page, and the book's website contains more information on the data.

### Statistics

**statisticalstatistical analysisstatistician**

In statistics, data sets usually come from actual observations obtained by sampling a statistical population, and each row corresponds to the observations on one element of that population.

Exploratory data analysis (EDA) is an approach to analyzing data sets to summarize their main characteristics, often with visual methods.

### Data collection system

**automated data collection systemsData collectiondata collection and processing**

A collection (used as a noun) is the topmost container for grouping related documents, data models, and datasets.

### Data (computing)

**datacomputer datadata representation**

### United Nations Office for the Coordination of Humanitarian Affairs

**Office for the Coordination of Humanitarian AffairsOCHAUN Office for the Coordination of Humanitarian Affairs**

### Table (database)

**tabletablesdatabase table**

In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record of the data set in question.

### Column (database)

**columnscolumnAttribute**

In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record of the data set in question.

### Row (database)

**rowsrowrecord**

In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record of the data set in question.

### Statistical parameter

**parametersparameterparametrization**

These include the number and types of the attributes or variables, and various statistical measures applicable to them, such as standard deviation and kurtosis.

### Kurtosis

**excess kurtosisleptokurticplatykurtic**

These include the number and types of the attributes or variables, and various statistical measures applicable to them, such as standard deviation and kurtosis.

### Real number

**realrealsreal-valued**

The values may be numbers, such as real numbers or integers, for example representing a person's height in centimeters, but may also be nominal data (i.e., not consisting of numerical values), for example representing a person's ethnicity.

### Integer

**integersintegralZ**

The values may be numbers, such as real numbers or integers, for example representing a person's height in centimeters, but may also be nominal data (i.e., not consisting of numerical values), for example representing a person's ethnicity.

### Number

**number systemnumericalnumbers**

The values may be numbers, such as real numbers or integers, for example representing a person's height in centimeters, but may also be nominal data (i.e., not consisting of numerical values), for example representing a person's ethnicity.

### Level of measurement

**quantitativelevels of measurementscale**

### Missing data

**missing valuesmissing at randomincomplete data**

However, there may also be missing values, which must be indicated in some way.

### Sampling (statistics)

**samplingrandom samplesample**

In statistics, data sets usually come from actual observations obtained by sampling a statistical population, and each row corresponds to the observations on one element of that population.

### Statistical population

**populationsubpopulationsubpopulations**

In statistics, data sets usually come from actual observations obtained by sampling a statistical population, and each row corresponds to the observations on one element of that population.

### Algorithm

**algorithmsalgorithm designcomputer algorithm**

Data sets may further be generated by algorithms for the purpose of testing certain kinds of software.