Pipelines accelerator enables developers, business analysts and data scientists to quickly derive insights from data, without having to worry about infrastructure and integration.

Go to Documentation

Pipelines benefits

Pipelines accelerator enables developers, business analysts and data scientists to quickly derive insights from data, without having to worry about infrastructure and integration.

Integrated with all data

Pipelines provide connectors to relational databases, flat files, mainframes, cloud services, NoSQL, and more.
Hide

Increased flexibility

Through portability across on-premises and public cloud environments.
Hide

Reduced complexity

Pipelines reduce complexity through a graphical interface, code free transformations, and reusable tempates.
Hide

Improved data trustworthiness

Through data quality libraries, metadata and lineage capture, audit logging.
Hide

Pipeline features

Connector ecosystem

Built-in connectors to a variety of cloud and on-prem, modern and legacy systems; public APIs to build custom connectors

Visual data pipeline designer

Build data pipelines by stitching together sources, transformations, sinks and other nodes using a simple, point and click pipeline studio UI

Data discovery and governance

Data discovery based on technical, operational and business metadata; Lineage for root cause and impact analysis

Comprehensive integration toolkit

Conditionals and pre/post processing actions; Alerting and Notifications; Error processing

Interactive, code-free transformations

Wide variety of built-in data transformation plugins; Interactive data transformation interface with feedback at each step

Seamless operations

Time and process based scheduling, monitoring via logs, metrics, dashboards and reporting

Plugins

Pipelines support plugins for connecting to a variety of cloud and on-premises systems, as well as to perform data transformations.

Google Cloud
Microsoft Azure
Amazon Web Services
Hadoop

Videos

Learn CDAP: Event Based Triggers for CDAP Pipelines

Learn CDAP: [SCREENCAST] How to create workflows from your existing Spark applications

Learn CDAP: Spark Interpreter Demo - Writing custom Spark code in a CDAP Pipeline

Learn CDAP: EDW Optimization - Offloading data from Oracle to HBase

Learn CDAP: Data Ingest from Azure Blob Store and ADLS using CDAP Pipelines

Learn CDAP: Realtime IOT data Ingestion Using Azure Event Hub and CDAP Pipelines

Learn CDAP: EDW Optimization with Hadoop and CDAP (CDC)

#BDAM: EDW Optimization with Hadoop and CDAP, by Sagar Kapare from Cask

Learn CDAP: Legacy Spark programs in CDAP

Learn CDAP: Transfer data from Oracle DB to CDAP Table

Learn CDAP: Preview for Batch Data Pipelines

Learn CDAP: Preview for Realtime Data Pipelines

Learn CDAP: Hive Export and Import using CDAP Pipelines

Rapid Time to Value with CDAP: From Data File to Apache Kudu in Under 5 Minutes