
data.world
Active metadata management software
Graph databases
Data fabric software
Machine learning data catalog software
Data governance tools
- Features
- Ease of use
- Ease of management
- Quality of support
- Affordability
- Market presence
Take the quiz to check if data.world and its alternatives fit your requirements.
$12 per month
Small
Medium
Large
- Public sector and nonprofit organizations
- Healthcare and life sciences
- Education and training
What is data.world
data.world is a cloud-based data catalog and governance platform that helps organizations document, discover, and manage data assets across analytics and operational environments. It supports use cases such as business glossary management, metadata harvesting, lineage/impact analysis, and governed data access workflows for data producers, analysts, and governance teams. The platform emphasizes knowledge-graph-style relationships between data assets and collaborative curation (e.g., stewardship, annotations, and shared context) to improve reuse and consistency.
Strong catalog and glossary
The product provides core catalog functions such as searchable inventory of datasets, documentation, and business glossary terms. It supports stewardship-oriented workflows to standardize definitions and link terms to technical assets. These capabilities align well with common governance operating models where business and technical stakeholders share ownership of definitions and metadata.
Metadata relationships via graph
data.world models relationships among data assets, terms, people, and processes, which helps represent context beyond simple tables-and-columns inventories. This approach supports richer navigation across related assets and can improve impact analysis when relationships and lineage are populated. It is useful in environments where understanding cross-domain dependencies is as important as locating a dataset.
Integrations for metadata ingestion
The platform is designed to ingest metadata from multiple data sources and tools, enabling a centralized view of assets across the stack. This supports governance and discovery in heterogeneous environments where metadata is distributed. Centralized ingestion reduces reliance on manual documentation when connectors and automated harvesting are available for the organization’s systems.
Lineage depth varies by source
End-to-end lineage and impact analysis quality depends on which systems are connected and what metadata those systems expose. Some environments require additional configuration, custom integration work, or complementary tooling to achieve detailed transformation-level lineage. Organizations should validate lineage coverage for their specific ETL/ELT, BI, and orchestration tools before standardizing on it.
Governance rollout requires stewardship
Like other governance platforms, value depends on sustained stewardship, curation, and adoption by data owners and producers. Without defined operating processes (ownership, certification, issue management, and change control), catalogs can become incomplete or outdated. Teams should plan for roles, workflows, and ongoing maintenance rather than treating it as a one-time implementation.
Not a general graph database
Although it uses graph concepts for metadata and relationships, it is not positioned as a general-purpose graph database for transactional or application workloads. Organizations needing a graph database for application development may still require a dedicated graph database product. Fit is strongest for metadata, governance, and discovery rather than arbitrary graph application queries.
Plan & Pricing
| Plan | Price | Key features & notes |
|---|---|---|
| Individual (Free) | Free | 3 private projects/datasets; 3 live tables (data virtualization trial, no size limit); 100 MB limit per dataset/project; 1 GB total storage. (Community offering on data.world). |
| Individual Pro | $12 per month | 20 private projects/datasets; 20 live tables (data virtualization, no size limit); 3 GB limit per dataset/project; 100 GB total storage. (Community offering on data.world). |
| Essentials | Custom pricing | Tier 1 integrations; Private instance; Fully managed SaaS; Data inventory, glossary, metadata management; Data virtualization & query federation — up to 20 datasets & projects (preview features noted); SAML authentication; Usage & governance reports; 1 US region. 10 users included. |
| Standard | Custom pricing | Includes Essentials plus Tier 2 integrations; Enterprise support & SLAs; Full audit log history; Eureka Explorer Lineage; Graph visualization; Code preview; Impact analysis; Upstream audit; Data Governance Core. |
| Enterprise | Custom pricing | Includes Standard plus Tier 3 integrations (2 included); data.world Collector (on‑prem metadata collection agent); data.world Bridge (on‑prem HA, secure connectivity); AWS PrivateLink; Choice of US or EU region. |
| Enterprise+ | Custom pricing | Custom package for special security/contract needs (e.g., single‑tenant install with AWS account isolation, customer‑managed keys, custom security certifications). |
Notes: Users — 10 users included in each tier; additional annual fee per user (volume pricing available). Expand with add‑ons (additional Tier 3 integrations, extended data virtualization, sensitive data discovery, HIPAA BAA, DPA, etc.). For enterprise pricing and quotes, contact sales via data.world's pricing/get-pricing flows.
Seller details
data.world, Inc.
Austin, Texas, United States
2015
Private
https://data.world/
https://x.com/datadotworld
https://www.linkedin.com/company/data-world/