Data integration

Customer data integrationintegrationintegrateintegratesCustomer DataData (information) integrationdata harmonizationData mergingintegrate dataintegrating data
Data integration involves combining data residing in different sources and providing users with a unified view of them.wikipedia
Big data

big data analyticsbig data analysisbig-data
Data integration appears with increasing frequency as the volume (that is, big data) and the need to share existing data explodes.
Big data requires a set of techniques and technologies with new forms of integration to reveal insights from data-sets that are diverse, complex, and of a massive scale.

Data transformation

MediationData mediationtransformation
the trend in data integration favored loosening the coupling between data and providing a unified query-interface to access real time data over a mediated schema (see Figure 2), which allows information to be retrieved directly from original databases.
It is a fundamental aspect of most data integration and data management tasks such as data wrangling, data warehousing, data integration and application integration.

Data warehouse

data warehousingdata warehousesEnterprise Data Warehouse
IPUMS used a data warehousing approach, which extracts, transforms, and loads data from heterogeneous sources into a single view schema so data from different sources become compatible.
The typical extract, transform, load (ETL)-based data warehouse uses staging, data integration, and access layers to house its key functions.

Data architecture

it was determined that current data modeling methods were imparting data isolation into every data architecture in the form of islands of disparate data and information silos.
Data integration, for example, should be dependent upon data architecture standards since data integration requires data interactions between two or more data systems.

Ontology-based data integration

ontology-based approach to data integration
This approach represents ontology-based data integration.
It is one of the multiple data integration approaches and may be classified as Global-As-View (GAV).

Data virtualization

Advanced data virtualization is also built on the concept of object-oriented modeling in order to construct virtual mediated schema or virtual metadata repository, using hub and spoke architecture.
This concept and software is a subset of data integration and is commonly used within business intelligence, service-oriented architecture data services, cloud computing, enterprise search, and master data management.


magic sets
The theory of query processing in data integration systems is commonly expressed using conjunctive queries and Datalog, a purely declarative logic programming language.
In recent years, Datalog has found new application in data integration, information extraction, networking, program analysis, security, and cloud computing.

Information silo

silosdata silossiloed
Issues with combining heterogeneous data sources, often referred to as information silos, under a single query interface have existed for some time.

Semantic integration

integrationontology mapping
some of the work in data integration research concerns the semantic integration problem.


National Science Foundation initiatives such as Datanet are intended to make data integration easier for scientists by providing cyberinfrastructure and setting standards.
United States federal research funders use the term cyberinfrastructure to describe research environments that support advanced data acquisition, data storage, data management, data integration, data mining, data visualization and other computing and information processing services distributed over the Internet beyond the scope of a single institution.

Extract, transform, load

ETLETL (Extract-Transform-Load)extract, transform and load
IPUMS used a data warehousing approach, which extracts, transforms, and loads data from heterogeneous sources into a single view schema so data from different sources become compatible.

Federated database system

federated databasedata federationDatabase federation
A data-integration solution may address this problem by considering these external resources as materialized views over a virtual mediated schema, resulting in "virtual data integration".

Core data integration

Core data integration is the use of data integration technology for a significant, centrally planned and managed IT initiative within a company.

Edge data integration

An edge data integration is an implementation of data integration technology undertaken in an ad hoc or tactical fashion.

Data mapping

mappingmappingslink the data
Data mapping is used as a first step for a wide variety of data integration tasks, including :


data spacedataspace
Dataspaces are an abstraction in data management that aim to overcome some of the problems encountered in data integration system.

Data fusion

decision fusion and distributed detectionfuse datafusing different data sets
In the geospatial (GIS) domain, data fusion is often synonymous with data integration.

Master data management

master dataMaster data storereference domain
In business, master data management (MDM) is a method used to define and manage the critical data of an organization to provide, with data integration, a single point of reference.

Integration competency center

An integration competency center (ICC), sometimes referred to as an integration center of excellence (COE), is a shared service function providing methodical data integration, system integration, or enterprise application integration within organizations, particularly large corporations and public sector institutions.

Data blending

blendblended datamultisource analysis
Data blending has been described as different to data integration due to the requirements of data analysts to merge sources very quickly, too quickly for any practical intervention by data scientists.

Three-schema approach

Three schema approachconceptual view3-schema architecture
It proposes three different views in systems development, with conceptual modelling being considered the key to achieving data integration.

Enterprise information integration

data management
Enterprise Information Integration (EII) applies data integration commercially.

Schema matching

schema mappingunified
Automating these two approaches has been one of the fundamental tasks of data integration.

Web data integration

WDI is an extension and specialization of data integration that views the web as a collection of heterogeneous databases.