Data set
datasetdatasetsdata setsdataClassic data setsdata seriesset of data
A data set (or dataset) is a collection of data.wikipedia
318 Related Articles
Standard deviation
standard deviationssample standard deviationSD
These include the number and types of the attributes or variables, and various statistical measures applicable to them, such as standard deviation and kurtosis.
The standard deviation of a random variable, statistical population, data set, or probability distribution is the square root of its variance.
Open data
Open government dataopendata
In the open data discipline, data set is the unit to measure the information released in a public open data repository.
However, the lack of a license makes it difficult to determine the status of a data set and may restrict the use of data offered in an "Open" spirit.


Data
statistical datascientific datadatum
A data set (or dataset) is a collection of data.
Iris flower data set
Iris'' flower data setFisher's IrisFisher's iris data
The Iris flower data set or Fisher's Iris data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper The use of multiple measurements in taxonomic problems as an example of linear discriminant analysis.





Anscombe's quartet
Anscombe's quartet comprises four data sets that have nearly identical simple descriptive statistics, yet have very different distributions and appear very different when graphed.
Sample (statistics)
samplesamplesstatistical sample
In statistics and quantitative research methodology, a data sample is a set of data collected and the world selected from a statistical population by a defined procedure.

Data blending
blendblended datamultisource analysis
Data blending is a process whereby big data from multiple sources are merged into a single data warehouse or data set.
Robust statistics
robustbreakdown pointrobustness
The data sets for that book can be found via the Classic data sets page, and the book's website contains more information on the data.





Statistics
statisticalstatistical analysisstatistician
In statistics, data sets usually come from actual observations obtained by sampling a statistical population, and each row corresponds to the observations on one element of that population.
Exploratory data analysis (EDA) is an approach to analyzing data sets to summarize their main characteristics, often with visual methods.





Data collection system
automated data collection systemsData collectiondata collection and processing
A collection (used as a noun) is the topmost container for grouping related documents, data models, and datasets.
Data (computing)
datacomputer datadata representation
United Nations Office for the Coordination of Humanitarian Affairs
Office for the Coordination of Humanitarian AffairsOCHAUN Office for the Coordination of Humanitarian Affairs

Table (database)
tabletablesdatabase table
In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record of the data set in question.
Column (database)
columnscolumnAttribute
In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record of the data set in question.

Row (database)
rowsrowrecord
In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record of the data set in question.
Statistical parameter
parametersparameterparametrization
These include the number and types of the attributes or variables, and various statistical measures applicable to them, such as standard deviation and kurtosis.
Kurtosis
excess kurtosisleptokurticplatykurtic
These include the number and types of the attributes or variables, and various statistical measures applicable to them, such as standard deviation and kurtosis.





Real number
realrealsreal-valued
The values may be numbers, such as real numbers or integers, for example representing a person's height in centimeters, but may also be nominal data (i.e., not consisting of numerical values), for example representing a person's ethnicity.
Integer
integersintegralZ
The values may be numbers, such as real numbers or integers, for example representing a person's height in centimeters, but may also be nominal data (i.e., not consisting of numerical values), for example representing a person's ethnicity.
Number
number systemnumericalnumbers
The values may be numbers, such as real numbers or integers, for example representing a person's height in centimeters, but may also be nominal data (i.e., not consisting of numerical values), for example representing a person's ethnicity.

Level of measurement
quantitativelevels of measurementscale
The values may be numbers, such as real numbers or integers, for example representing a person's height in centimeters, but may also be nominal data (i.e., not consisting of numerical values), for example representing a person's ethnicity.
Missing data
missing valuesmissing at randomincomplete data
However, there may also be missing values, which must be indicated in some way.

Sampling (statistics)
samplingrandom samplesample
In statistics, data sets usually come from actual observations obtained by sampling a statistical population, and each row corresponds to the observations on one element of that population.




Statistical population
populationsubpopulationsubpopulations
In statistics, data sets usually come from actual observations obtained by sampling a statistical population, and each row corresponds to the observations on one element of that population.
Algorithm
algorithmsalgorithm designcomputer algorithm
Data sets may further be generated by algorithms for the purpose of testing certain kinds of software.






