Logo

Cloudera Introduces Cloudera DataFlow for the Public Cloud

A new data service on Cloudera Data Platform to automate and manage cloud-native data flows, increasing operational efficiency and reducing cloud costs.

  • Tuesday, 17th August 2021 Posted 4 years ago in by Phil Alsop

Cloudera has launched Cloudera DataFlow for the Public Cloud, a cloud-native service for data flows to process hybrid streaming workloads on the Cloudera Data Platform (CDP). With Cloudera DataFlow for the Public Cloud, users can now automate complex data flow operations, boost the operational efficiency of streaming data flows with auto-scaling capabilities, and cut down on cloud costs by eliminating infrastructure sizing guesswork. 

Data-in-Motion volumes are expected to grow exponentially, up to 79 ZB, across all industries, according to IDC’s Worldwide Global DataSphere IoT Device and Data Forecast, 2021–2025, and many organizations already leverage Apache NiFi to capture and process data across hybrid cloud architectures by visually designing no-code data flows. While the cloud provides an easy outlet for processing or storing massive volumes, there are multiple challenges that must be addressed. Deploying dozens of sophisticated data flows into a single cluster can lead to operational and monitoring challenges. When multiple NiFi flows compete for the same resources, it can lead to performance issues. IT administrators often choose larger infrastructure sizes out of caution, leading to underutilization and high costs. Finally, companies want a pay-as-you-go model to avoid paying for resources not in use.  

“Cloudera DataFlow automates and manages cloud-native data flows on Kubernetes - and it is something only we offer,” said Dinesh Chandrasekhar, Head of Product Marketing, Data-in-Motion at Cloudera. “Now it is easy for our customers to boost the operational efficiency of their streaming workloads and save on infrastructure costs in the public cloud.” 

“Companies are doing the balancing act between efficiency and performance on one side and cost control as they scale up their streaming workloads,” said Maribel Lopez, Founder & Principal Analyst at Lopez Research. “The adoption of hybrid cloud architectures only escalates that challenge. Tech leaders need intelligent tools that help them streamline the process of running and managing workloads in the cloud.” 

Cloudera DataFlow for the Public Cloud is a powerful cloud-native service for NiFi on Kubernetes and includes key operational and monitoring capabilities that address these challenges easily and aren’t typically available with basic data flow services: 

●   Central Flow Catalog for manageability, discovery, and version control   

●   Central dashboard for monitoring, troubleshooting and performance tuning of data flows across multiple cloud clusters 

●    Simple deployment wizard and robust APIs for auto-scaling flows on Kubernetes managed by CDP 

●     Pre-built flows called “ReadyFlows” for some of the common streaming use cases 


The unveiling of CrowdStrike's 2026 Global Threat Report highlights a surge in AI-enabled threats,...
Capgemini and OpenAI collaborate to support enterprise AI adoption via the Frontier platform.
Tech Mahindra and University College London are collaborating on research and solution development...
BMC is working with financial institutions to support mainframe modernisation, workflow...
Creative ITC has established its U.S. headquarters in Houston to support growth across North...
Large enterprises express concern that AI may not deliver the resilience and business continuity...
WaveMaker has introduced a new system for AI-driven enterprise application development designed to...
Endava teams up with Cognition to enhance AI-assisted software delivery. This partnership aims to...