Bug fixes and enhancements for metadata collectors are released regularly. Go here to to view the past release notes.

πŸ“˜

Stay updated on collector releases!

To keep up with the latest updates and enhancements to data.world collectors, subscribe to the RSS feed from your favorite RSS reader.

Details about the release

Item

Details

Release version

​​2.295

Release date

​​5 September 2025​​

Docker image ID ​ ​

​

Link to download the Docker image:​​ ​https://hub.docker.com/r/datadotworld/dwcc/tags

  • amd64:​​ de6998c350b20a56f9640d655c6636800d40d0dbcd8ed59423d22ead0f570780
  • ​arm64:​​ 604b61820a188097deb41980f524aca7619698a2af05efade087e0fbc36fc3e0

Jar file ​ ​ ​ ​ ​​ ​ ​ ​


​

Link to download the JAR file:​​ ​https://releases.data.world/dwcc/2.295/dwcc-2.295.zip

  • ​​Sha256:​​ 656182bc4510f81f5d18d6ae0e7c3e3dd5ac83a8b03a8b60b12b830bcea9bfed

New features and changes

  • ​​Marquez collector: ​ Added support for harvesting Marquez datasets associated with Databricks, extending lineage and metadata coverage.
  • ​​SQL Server collector​: Added support for SQL Server replications, improving visibility into replicated database environments.
  • ​​QlikSense collector Introduced validation for user-configured site identifiers and added warnings when identifiers are invalid or inaccessible.
  • Microsoft Fabric collector:
    • Added support for variables and parameters in data pipeline activities even when those activities have not had a recent run.
    • Added support for warehouse sources that use SQL queries in Copy Activities, broadening coverage of pipeline sources.
  • Sigma collector: Now supports lineage from datasets to source tables or other datasets, improving traceability of dataset dependencies.
  • Databricks collector: Enhanced lineage harvesting to support SQL statements containing the struct function.
  • dbt core and dbt clould collectors: Added support for dbt projects targeting SQL Server databases using encryption, improving compatibility in secure environments.
  • Power BI collector: Added support for harvesting report images embedded in a zip file, ensuring complete metadata capture from reports.

​Bug fixes

  • ​​Databricks collector:
    • Fixed an issue where column properties were not cataloging the correct values due to an API bug.
    • Fixed redundant collection of workspace resources and jobs.
    • Updated the gitProvider property to support both uppercase and camelCase values returned by the Databricks Jobs API.
  • ​​Snowflake, Redshift, Databricks, Denodo, Oracle, PostgreSQL, Teradata, MySQL, Db2, Netezza, SQL Server, SAP HANA collectors: Resolved parsing errors that occurred when harvesting view lineage from views whose SQL contained column comments with parentheses.
  • SQL Server Reporting Services (SSRS) collector: Corrected incorrect detection of when to use SOAP vs REST API, ensuring proper connectivity for older and newer SSRS versions.
  • OpenAPI collector: Fixed an issue with duplicate identification of API resources, ensuring unique resource cataloging.

​ ​ ​​ ​ ​ ​ ​​

Bug fixes and enhancements for metadata collectors are released regularly. Go here to to view the past release notes.

πŸ“˜

Stay updated on collector releases!

To keep up with the latest updates and enhancements to data.world collectors, subscribe to the RSS feed from your favorite RSS reader.

Details about the release

Item

Details

Release version

​​2.294

Release date

​​21 August 2025​​

Docker image ID ​ ​

​

Link to download the Docker image:​​ ​https://hub.docker.com/r/datadotworld/dwcc/tags

  • amd64:​​ 3ec6779e76bd02363a337100629f5b5cf230ef1799de6919b7d88bf1cc7e7ab4
  • ​arm64:​​ f1aa1cacb78c657e05f22b2e1ac61d35c1a46582052bd36bc1f07633ed4abea3

Jar file
​ ​ ​ ​ ​​ ​ ​ ​


​

Link to download the JAR file:​​ ​https://releases.data.world/dwcc/2.294/dwcc-2.294.zip

  • ​​Sha256:​​ 1efacdf6e5915e7fc3ceeb64812a23c003e4944385ac9c88721c9e7d46e5c6da

New features and changes

  • ​​Tableau collector: ​​Harvests the ​last published date​​ for workbooks and data sources, providing greater visibility into update history.
  • ​​Alteryx collector​: Catalogs additional metadata, including the ​caption tag​ in ​ToolContainer​​, the ​query​ in ​LockInInput​​, and the ​SQL​ in ​DbFileInput​​.
  • ​​QlikSense collector Added a configuration option to ​include​ or ​exclude applications​​, giving users more control over the scope of harvested metadata.

​Bug fixes

  • ​​Athena collector : Fixed an issue where the collector stopped after 100 tables. It now correctly ​harvests more than 100 tables​​ in a database.
  • ​​Sigma collector​​: Enhanced the workbook filter to avoid missing workbook exceptions, improving reliability during harvesting.
  • ​​MySQL collector:​ Fixed an error in fetching statistics for columns whose names are ​reserved SQL keywords​​.
  • ​​Microsoft Fabric collector: Resolved issues in stored procedure harvesting by properly resolving names when using ​pipeline variables and parameters​​, and updated relationship types to r​epresent dependencies​​ more accurately.

​
​ ​​ ​ ​ ​ ​​

Bug fixes and enhancements for metadata collectors are released regularly. Go here to to view the past release notes.

πŸ“˜

Stay updated on collector releases!

To keep up with the latest updates and enhancements to data.world collectors, subscribe to our RSS feed.

Details about the release

Item

Details

Release version

​​2.293​​

Release date

​​11 August 2025​​

Docker image ID ​ ​

​

Link to download the Docker image:​​ ​https://hub.docker.com/r/datadotworld/dwcc/tags

  • amd64:​​ 3135b99ac3e7222ce28edfa1e9609aa39ec53ebbdd270272cfeccb8bc07e3f3b
  • ​arm64:​​ 541eb90896f09aa98abc588bffa728d2ac70cc92305b272cc9fe09fb2b692426

Jar file
​ ​ ​ ​ ​​ ​ ​ ​


​

Link to download the JAR file:​​ ​https://releases.data.world/dwcc/2.293/dwcc-2.293.zip

  • ​​Sha256:​​ 56b6f467c1c5310345227f7f0a6d7830ed6aecc3f899dd5bf558a13e0cc38392

​Bug fixes

  • ​Power BI collector:​ Fixed an issue to ensure that when a ​database name​ is provided in the ​datasources.yaml​​ file, it is always used and not overridden by values retrieved from a database query.
  • ​Tableau collector:​ Fixed an issue so that ​Published Datasources​ are only cataloged when they are being used in a ​Project that is in scope​​, preventing unnecessary or irrelevant catalog entries.
  • ​​Informatica CDI collector:​​ Resolved an unexpected exception that was causing the collector to fail, improving stability and reliability.

​
​ ​​ ​ ​ ​ ​​

Details about the release​

Item

Details

Release version

​​2.292​​

Release date

​​1 August 2025​​

Docker image ID ​ ​

​

Link to download the Docker image:​​ ​https://hub.docker.com/r/datadotworld/dwcc/tags

  • amd64:​​ sha256:c8813ea1b7589f37397f26ccd31824ab1bf12f2797066a21498fd3e889f12a87
  • ​arm64:​​ sha256:80793256c70d6afb5385d9f067a88ba353ed47d298183b403d20c0f4a14fb8b4​​

Jar file
​ ​ ​ ​ ​​ ​ ​ ​


​

Link to download the JAR file:​​ ​https://releases.data.world/dwcc/2.292/dwcc-2.292.zip

  • ​​Sha256:​​ e1fa87f14be306c236731256e673a40bba05e8d4a2e9dbc0e0d833b1428b0356

​New features and changes​

  • SQL Server collector: Added support for harvesting ​agent jobs​​.
  • Sigma collector: Added configuration options to include or exclude workspaces, providing greater control over which resources are harvested.
  • Redshift collector: Now supports harvesting external tables defined via AWS Glue.
  • Microsoft Fabric collector:
    • Dataflow Gen2 is now treated as a separate resource type from Dataflows.
    • Added support for cataloging destinations and table-level lineage for sources and destinations in Dataflow Gen2 CI/CD types.
  • Microsoft Fabric and Power BI collectors: Now catalog refresh schedules for resources where refresh configuration is available, helping track automated data updates.
  • AWS Glue collector: Now identifies partitioned columns separately from other columns.

​​Bug Fixes​​
​

  • Marquez collector: Fixed a null pointer exception that could occur when a job lacked a latest run.
    ​

​
​ ​​ ​ ​ ​ ​​

Details about the release​

Item

Details

Release version

​​2.291​​

Release date

​​1 August 2025​​

Docker image ID ​ ​

​

Link to download the Docker image:​​ ​https://hub.docker.com/r/datadotworld/dwcc/tags

  • amd64:​​ sha256:2138f6297b46e4e3e1f103272c5d6ee5c4b9ccac62298312c51a4291be510cfe
  • ​arm64:​​ sha256:7a91ef17763f34c61f8e164392a47476876b04048cd6d3df27921f35ae571b5b​​

Jar file
​ ​ ​ ​ ​​ ​ ​ ​


​

Link to download the JAR file:​​ ​https://releases.data.world/dwcc/2.291/dwcc-2.291.zip

  • ​​Sha256:​​ 7c6c9410aa1785593072efc46f78679a35384d14dc67b25049ef45995dcf5618

​New features and changes​

  • ​​Microsoft Fabric collector:​:​
    • Added support for harvesting ​Apps​ and ​Org Apps​​ in Microsoft Fabric.
    • Added support for harvesting GraphQL instances.

​​Bug Fixes​​
​

  • Marquez collector: Now skips unsupported dataset types, preventing errors during harvesting.
    ​

​
​ ​​ ​ ​ ​ ​​

Details about the release​

Item

Details

Release version

​​2.290​​

Release date

​​17 July 2025​​

Docker image ID ​ ​

​

Link to download the Docker image:​​ https://hub.docker.com/r/datadotworld/dwcc/tags

  • amd64:​​ b9e38013cc79f30c38f86c3431d5188ba2f8327e103171735993a1c7bc75ad03
  • ​arm64:​​ e9d242b411d4759bbdd12c5e592d1afd3f9c1e5b139027ea41bad1a6a4979d6a​​

Jar file
​ ​ ​ ​ ​​ ​ ​ ​


​

Link to download the JAR file:​​ ​https://releases.data.world/dwcc/2.290/dwcc-2.290.zip

  • ​​Sha256:​​ 724e96e1fb936c2106730e11b55d6aa4eb96b08d25dd4f2d95fd0f7a632d1b11

​New features and changes​

  • A ​new collector​​, the ​​OpenAPI collector, is now available in ​public preview​​. It supports harvesting metadata from APIs described using OpenAPI v3.0, enabling documentation and cataloging of API assets.
  • ​​Snowflake, Redshift, Databricks, Denodo, Oracle, PostgreSQL, Teradata, MySQL, Db2, Netezza, SQL Server, SAP HANA collectors:​
    • Added a ​Sensitive Data Classification​ option to allow classification using a ​hosted private-ai instance.​​
    • Now include ​column statistics support​ for ​Date, Timestamp, and Boolean data types, enhancing profiling depth​​ across supported databases.
  • ​​Postgres collector:​ Supports ​AWS IAM authentication​ via ​secret and access key parameters​​, offering more secure and flexible credential management.

​​Bug Fixes​​
​

  • Oracle collector: Fixed an issue in the ​table index feature​ that previously caused permission errors or max open cursor issues by updating the query logic to use ​DBA_​​ views when available.
  • SSIS Collector: Now harvests ​deeply nested control flow executables​​, ensuring complete control flow visibility.
  • Snowflake, Redshift, Databricks, Denodo, Oracle, PostgreSQL, Teradata, MySQL, Db2, Netezza, SQL Server SAP HANA collectors: Improved sampling behavior for environments where ​TABLESAMPLE​ is unsupported by falling back to ​LIMIT​ or ​TOP​​ clauses to compute statistics more reliably.
    ​

​
​ ​​ ​ ​ ​ ​​