Glossary

Learning all the ins and outs of the jargon associated with the field of data asset management can feel like learning another language. This non-exhaustive glossary is an important resource to guide you through the ever-changing field of data asset management.

Please feel free to reference this page!
Search for glossary terms (regular expression allowed)
Data A datum is a raw description at a given moment of an elementary part of a resource. It is devoid of any reasoning, supposition, observation or probability. A data item is associated with a variable (property) and its observation (value), either qualitative or quantitative.
Data is a set of elementary data.
Data architecture

Data architecture consists of the policies, regulations and models that define the requirements for acquiring, storing, processing, organising, integrating and publishing an organisation's data. Data architecture is one of the pillars of enterprise architecture activities.

Data archive A data archive is a storage repository containing an archive of all or part of data sets. The data contained in it describes old resources that no longer exist.
Data economy Like any economy, the data economy is a human activity that consists of the production, distribution, exchange and consumption of products; in this case information products.
Data infrastructure

A data infrastructure is a digital infrastructure that supports the sharing and consumption of data. Like other infrastructures, it is a necessary system for the functioning of an economy, which in this case is the data economy.

Data lake A data lake is a storage repository containing a large amount of structured, semi-structured and unstructured data. It is a space for storing data sets in their native format with no fixed limit on their size or number. It provides access to a large amount of data to increase the integration and analysis capabilities of raw data.
Data management Data management covers the development, execution and oversight of plans, policies, programmes and practices that deliver, control, protect and enhance the value of an organisation's intangible assets of data and information throughout their life cycle.
Data mart A data mart stores all the data relating to an analysis or publication theme of an organization. The content of the data comes strictly from the corporate data warehouse. The structure of the data is based, among other things, on the data exchange models to which the organization is subject.
Data model A data model is a model of reality that defines the structure and content of datasets. It is a mapping of the types of resources defined in an information model to the datasets needed for any information system.
Data repository

A data repository is a place where data is stored and access is guaranteed through an application service.
Data repositories can be broken down into several major classes: data sources, data warehouses, data marts, and data archives.

Data scientist A data scientist is an employee responsible for adding value to the organization's data through analyses based on statistical and machine learning models. He or she manipulates all the data in order to extract knowledge useful for optimising the organization's activities.
Data source A data source is a storage repository containing data sets acquired from outside the organization or whose content is created and maintained within the organization.
Data steward A data steward is an employee with a deep understanding of the business meaning of the data portfolio they manage, as well as an operational understanding of how data is represented and linked within data pools.
Data swamp A data swamp is a lake of data that is saturated by the integration of a multitude of unorganised and unclassified data, making it inaccessible or of little value to the users concerned.
Data warehouse A data warehouse is a storage repository containing all the structured data specific to a particular business area of an organisation. The structure of the data reflects a consolidated view of the business area in question, thanks to the integration of all related data sources.
Dataset A dataset is a homogeneous collection of data.
A dataset defines a set of resources of the same types. A dataset can be of a simple type, where it is referred to as a single entity class, or complex if it consists of several related objetc classes. A dataset can have a tabular, tree or graph structure.
Digital twin A digital twin is the set of data describing a resource throughout its life cycle.