Every execution, every table, every user—captured as active metadata and unified for governance, observability, and automation.
Jobs & Pipeline Execution
User Access & Security
User Data Assets & Lineage
Keboola gives you a single pane of glass into every job, transformation, and data flow — with runtime, cost, and error details automatically captured. Instead of chasing failures across platforms, your team gets a complete picture in one place.
From ingestion to transformation to output, Keboola tracks every dataset, bucket, column, and dependency. Engineers gain complete lineage to understand how data flows, transforms, and is consumed across projects and teams.
Over 50 types of security and access events are automatically captured — from user logins and token creation to data exports and role changes and seamlessly streamed into your SIEM (Splunk, Datadog, ELK, etc.) for real-time monitoring and automated alerting.
Every action inside Keboola — from job runs to data app sessions — is tracked and transformed into insights your teams can act on. Usage and adoption metadata helps you see how your platform is being used, where it’s underutilized, and how to accelerate adoption.
Every project can be mapped to a team or use case. Usage in the projects is automatically tracked and attributed, enabling a “Zero IT Budget” model where IT owns the platform but reinvoices departments based on actual consumption of cloud resources.
Built-in audit trails and access controls keep you compliant while engineers keep shipping.
Replace 10+ siloed tools with one source of truth for lineage, observability, and governance.
Track true usage and re-invoice costs to departments — proving ROI and ending shadow IT spend.
Power a data catalog, lineage, and activity center directly in Keboola — giving you transparency, control, and trusted data without managing multiple platforms.
Keboola Data Catalog transforms raw datasets into certified data products enriched with full metadata context. This makes it simple to discover, share, and reuse trusted data across teams — with governance baked in.
Turn Keboola metadata into a live control tower for your Keboola projects — monitor spend, performance, and security in one unified view, built to keep your data operations lean, transparent, and reliable.
From source to dashboard, Keboola tracks every dataset and dependency. Spot issues before they cascade, understand the impact of every schema change, and ship with peace of mind.
Run your entire data lifecycle—ingestion to AI delivery—on one governed platform, replacing dozens of tools and eliminating integration headaches.
Turn any dataset into AI-powered applications and automations—safely, predictably, and at scale.
Run Keboola on AWS, Azure, GCP, or your private cloud—without lock-in, with seamless integration to your existing stack.
Whether you’re coding in Python or SQL, building flows in UI, or prompting AI to generate pipelines—Keboola adapts to your workflow.
Trace every dataset, transformation, and output instantly—across the entire platform—for compliance, debugging, and trust.
Keboola is both—a unified AI & Data Platform built from the ground up to support autonomous AI operations with enterprise-grade reliability and security. It can be fully operated via your preferred AI agents, IDEs, or assistants—like Cursor, Windsurf, Claude, and more. From building and orchestrating pipelines to creating new integrations, handling maintenance, or running analytics and insights, you can control Keboola entirely from your chosen interface—while keeping every process secure, governed, and auditable.
Keboola Metadata is a unified metadata layer that combines data catalog, lineage, observability, and governance in one platform. Unlike standalone catalogs, it is natively embedded in the data pipeline, automatically capturing metadata from every job, dataset, user, and project — no extra tools required.
Without metadata, data becomes fragmented and untrustworthy. Keboola ensures full transparency of data flows, sources, and transformations, helping teams trust their data, speed up troubleshooting, and meet compliance requirements.
Traditional catalogs require heavy integrations and manual work. Keboola’s metadata is automatically collected and actionable, directly linked to pipeline operations, cost tracking, and governance controls.
Tags are custom key-value attributes applied to tables, buckets, flows, or configurations. They enable certified datasets, sensitive data classification, and separation of environments (dev/prod).
Tags allow IT leaders to certify datasets, restrict sensitive data, and automate masking policies. This creates policy-based governance without slowing engineering teams.
Keboola captures 44+ telemetry tables with 1,100+ attributes, covering jobs, flows, transformations, storage, users, tokens, cost usage, and security events. This gives a complete observability layer for the entire data lifecycle.
Yes. Keboola was designed API-first, which means all telemetry and metadata can be exported or accessed programmatically. You can push detailed metrics — from jobs, flows, storage, users, and costs — into observability platforms, BI dashboards, or SIEM/security tools using native connectors or APIs.
Keboola also supports the OpenLineage standard, making it easy to integrate with external data catalogs, governance tools, and lineage platforms that use the same protocol. This ensures interoperability in complex enterprise environments, while still giving you the advantage of having complete, automatically captured metadata natively inside Keboola.
With this flexibility, data teams can use Keboola as their central metadata backbone while still connecting seamlessly to Collibra, Atlan, or other external governance ecosystems if required.
Yes. Keboola was built for multi-project, multi-maintainer organizations, which means you get a structured metadata model out of the box. Metadata is automatically scoped at the organization, project, and even branch level, giving IT and data leaders granular visibility and control across complex environments.
This structure ensures that metadata is consistent, discoverable, and actionable across departments, business units, or use cases — without requiring additional configuration. For example:
- Organization level: global policies, compliance, and cost tracking.
- Project level: individual team workloads, departmental datasets, or specific business use cases.
- Branch level: development vs. production environments for safe iteration.
Because this hierarchy is natively part of Keboola, you don’t need to stitch together metadata from scattered tools. Instead, you get a single source of truth that reflects your actual organization structure, ready for governance, observability, cost attribution, and integration into external catalogs.