Data governance is a set of processes, policies and tools for managing data assets. It ensures quality, security and compliance.
Data Governance Pillars¶
Roles¶
- Data Owner — business owner of domain
- Data Steward — quality and policy enforcement
- Data Engineer — technical implementation
- DPO — GDPR and data protection
Data Classification¶
data_classification:
public:
description: "Publicly available data"
internal:
access: "employees"
confidential:
encryption: "at rest + in transit"
restricted:
encryption: "column-level"
masking: "required in non-prod"
Metrics¶
- Data Quality Score — % of data meeting rules
- Catalogue Coverage — share of documented assets
- Time to Data — how quickly analysts find data
Summary¶
Data governance is key for data trustworthiness. Clear roles, classification and automated controls.
data governancecompliancemetadataquality