The comprehensive support enables data ingestion into lakehouse environments where data warehouse management features are combined with low-cost storage. The additional support for Delta Lake combined with the enhanced integration and integrity capabilities in Talend Data Fabric enable the fast ingest and optimal processing of reliable, high-quality data for Databricks users to inform machine learning workloads and quickly unlock insights for their business.
Talend also announced the availability of Talend’s Stitch Data Loader in Databricks’ recently announced Data Ingestion Network of partners. This network, showcasing select Databricks partners, brings data teams closer to building the new data management paradigm, lakehouse, which combines the best elements of data lakes and data warehouses, enabling business intelligence and machine learning on all of a business’s data.
“Talend is an important addition to our new partner ecosystem, which was built to speed data ingestion access for our customers,” said Michael Hoff, SVP Business Development and Partners at Databricks. “Talend provides both a powerful integration platform for data engineers and a simple-to-use data ingestion tool for business analysts. This not only helps our customers get started fast, but also gives them a path forward for enterprise data management.”
In addition to extended support for Delta Lake in its recent Winter ’20 release, Talend Data Fabric supports Apache Spark™ 2.4 and Databricks Runtime 5.5 Long Term Support for optimal product performance. Talend also adds major advancements for Spark Dataset, which enables users to take advantage of performance enhancements for optimal Apache Spark processing.
“Working closely with Databricks, our joint customers can achieve higher performance and innovate faster by using Talend Data Fabric to move workloads to Databricks,” said Mike Pickett, SVP Business Development and Ecosystem at Talend. “We look forward to continued collaboration with one of the leading cloud data and AI platforms in the industry.”
These updates enable Talend to provide Databricks users with comprehensive data quality and governance features to support machine learning and advanced analytics, natively supporting the full power of Apache Spark and Delta Lake. Through this integration, users can access the scale and cloud benefits through a drag and drop interface, instead of manually coding data engineering jobs. Talend is integrated with both Azure Databricks and Databricks for AWS.