Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

Anonymization

"Anonymization" of data means processing it with the aim of irreversibly preventing the identification of the individual to whom it relates. Data can be considered anonymised when it does not allow identification of the individuals to whom it relates, and it is not possible that any individual could be identified from the data by any further processing of that data or by processing it together with other information which is available or likely to be available.

Attribute name

In Saagie data governance, there are names given to a field.

...

  • Raw data
  • Intermediate data
  • Final data
  • Not specified 

Database

A database is a collection of information that is organized so that it can be easily accessed, managed and updated.

Data is organized into rows, columns and tables, and it is indexed to make it easier to find relevant information.

Dataset

A dataset is a collection of related, discrete items of data that may be accessed individually or in combination or managed as a whole entity. A dataset is organized into some type of data structure.

Dataset can have 3 types :

  • TABLE
  • DIRECTORY
  • FILE

Database

A database is a collection of information that is organized so that it can be easily accessed, managed and updated.

...


Domain

In Saagie Data Governance, domains are used to group a set of datasets by theme. They can correspond to the departments in the company for example. They will facilitate the exploration of the data lake.

...

  • Input by user : 1.1
  • No, empty, null named : 1.0

Master data

Master data means that for a table, the field/name attribute is the master, so the reference value.

Personal data

According to the law, personal data means any information relating to an identified or identifiable individual; an identifiable person is one who can be identified, directly or indirectly, in particular by reference to an identification number (e.g. social security number) or one or more factors specific to his physical, physiological, mental, economic, cultural or social identity (e.g. name and first name, date of birth, biometrics data, fingerprints, DNA…).

Primary key

A primary key is a special relational database table column (or combination of columns) designated to uniquely identify all table records.

Provenance

Provenance is a source from which the dataset comes.

...