Table of Contents |
---|
Anonymization
"Anonymization" of data means processing it with the aim of irreversibly preventing the identification of the individual to whom it relates. Data can be considered anonymised when it does not allow identification of the individuals to whom it relates, and it is not possible that any individual could be identified from the data by any further processing of that data or by processing it together with other information which is available or likely to be available.
Attribute name
In Saagie data governance, there are names given to a field.
...
- Raw data
- Intermediate data
- Final data
- Not specified
Database
A database is a collection of information that is organized so that it can be easily accessed, managed and updated.
Data is organized into rows, columns and tables, and it is indexed to make it easier to find relevant information.
Dataset
A dataset is a collection of related, discrete items of data that may be accessed individually or in combination or managed as a whole entity. A dataset is organized into some type of data structure.
Dataset can have 3 types :
- TABLE
- DIRECTORY
- FILE
Database
A database is a collection of information that is organized so that it can be easily accessed, managed and updated.
...
Domain
In Saagie Data Governance, domains are used to group a set of datasets by theme. They can correspond to the departments in the company for example. They will facilitate the exploration of the data lake.
...
- Input by user : 1.1
- No, empty, null named : 1.0
Master data
Master data means that for a table, the field/name attribute is the master, so the reference value.
Personal data
According to the law, personal data means any information relating to an identified or identifiable individual; an identifiable person is one who can be identified, directly or indirectly, in particular by reference to an identification number (e.g. social security number) or one or more factors specific to his physical, physiological, mental, economic, cultural or social identity (e.g. name and first name, date of birth, biometrics data, fingerprints, DNA…).
Primary key
A primary key is a special relational database table column (or combination of columns) designated to uniquely identify all table records.
Provenance
Provenance is a source from which the dataset comes.
...