_CORE
AI & Agentic Systems Core Information Systems Cloud & Platform Engineering Data Platform & Integration Security & Compliance QA, Testing & Observability IoT, Automation & Robotics Mobile & Digital Banking & Finance Insurance Public Administration Defense & Security Healthcare Energy & Utilities Telco & Media Manufacturing Logistics & E-commerce Retail & Loyalty
References Technologies Blog Know-how Tools
About Collaboration Careers
CS EN
Let's talk

Soda — deklarativní monitoring kvality dat pomocí YAML

09. 01. 2026 1 min read intermediate

Soda nabízí YAML-based přístup k monitoringu kvality dat. Soda Checks Language umožňuje definovat kontroly bez kódu.

Soda Checks Language

# checks/orders.yml
checks for fact_orders:
  - row_count > 0
  - missing_count(order_id) = 0
  - duplicate_count(order_id) = 0
  - min(total_czk) >= 0
  - freshness(order_date) < 1d
  - values in (customer_id) must exist in dim_customers (customer_id)

Spuštění

from soda.scan import Scan
scan = Scan()
scan.set_data_source_name("warehouse")
scan.add_sodacl_yaml_file("checks/orders.yml")
scan.execute()
if scan.has_check_fails():
    for c in scan.get_checks_fail():
        print(f"✗ {c.name}: {c.outcome}")

Soda vs Great Expectations

  • Soda — YAML, rychlý start, SQL-native
  • GX — Python API, komplexní validace

Summary

Soda je ideální pro deklarativní YAML přístup ke kvalitě dat. Rychlý start a snadná integrace.

sodadata qualitymonitoringyaml
Share:

CORE SYSTEMS tým

Stavíme core systémy a AI agenty, které drží provoz. 15 let zkušeností s enterprise IT.