Wednesday, 26th June 2019
Logo

Databricks and Informatica partner

Databricks and Informatica have formed a partnership to accelerate the development of intelligent data pipelines. As a result of the partnership, the companies introduced product integrations that provide rapid and efficient data ingestion, simplified creation of high-volume data pipelines, and integrated data governance for intelligent data discovery and end-to-end lineage. The partnership is being announced on the keynote stage by CEOs Ali Ghodsi and Anil Chakravarthy at Informatica World 2019, taking place now in Las Vegas.

Today data engineering and data science teams depend on many hybrid data sources that make finding the right datasets and tracing the lineage of data through pipeline processing impossible. Bringing the Informatica capabilities for discovery, lineage, ingestion and preparation together with Databricks’ Unified Analytics Platform provides an analytics solution for intelligent data pipelines that leverages the correct datasets and provides end-to-end data lineage for analytics and machine learning implementations.

The Informatica and Databricks partnership introduces product integrations that allow faster development and complete governance for data engineering workloads:

  • Informatica’s Cloud Data Integration and Databricks’ Unified Analytics Platform enable data teams to quickly ingest data directly into a managed data lake from hundreds of hybrid data sources.
  • Informatica’s Big Data Management with Databricks’ Unified Analytics Platform allows data teams to easily create performant, scalable data pipelines for big data. Using Informatica’s visual drag and drop workflows, data teams can define their data pipelines to run on highly optimized Apache Spark™ clusters in Databricks to provide high performance at scale.
  • Informatica’s Enterprise Data Catalog provides support for tracking data lineage of pipelines with Databricks’ Unified Analytics Platform, and makes Databricks tables available as part of the data catalog.

Informatica is also announcing support for Delta Lake, the new open source project from Databricks, to provide an analytics-ready place to store massive amounts of data. Delta Lake provides ACID transactions and schema enforcement that brings reliability at scale to data lakes and makes high quality datasets ready for downstream analytics.

Alteryx predicts that England will lose to India and Australia will take home the trophy.
A new report from Dun & Bradstreet reveals businesses are missing revenue opportunities and losing c...
Following the success of Snowflake’s sold-out inaugural user conference, Snowflake Summit on June 3-...
In a joint pilot project, the Finnish Olympic Committee, Polar and Tieto are testing a new system fo...
GridGain Systems has launched the GridGain Data Lake Accelerator, an in-memory solution for digital...
WHISHWORKS reveals results of 2019 Big Data Survey with in-depth look at trends and challenges.
Chief data officers (CDOs) and their data and analytics (DA) teams are focusing on the right priorit...
Protecting and improving the nation’s vital infrastructure – such as energy, transport and digital c...