Download CDAP Documentation Releases Videos Guides Blog Posts Go to GitHub

LICENSE: Apache 2.0


CDAP is an open source framework for building data analytic applications.

It provides developers with data and application abstractions to simplify and accelerate application development, address a broader range of real-time and batch use cases, and deploy applications into production quicker while satisfying enterprise requirements.

CDAP runs of Apache Hadoop® distributions such as Cloudera Enterprise Data Hub, the Hortonworks® Data Platform, or the MapR Distribution. It also supports all the major public clouds such as AWS, Microsoft Azure and Google Cloud Platform.

CDAP exposes developer APIs (Application Programming Interfaces) for creating applications and accessing core CDAP services. CDAP defines and implements a diverse collection of services that support applications and data on existing Hadoop infrastructure such as HBase, HDFS, YARN, MapReduce, Hive, and Spark.

In addition, CDAP also provides built-in applications that solve vertical industry use-cases such data preparation, integration, metadata management and machine learning.

Getting Started

You can download the latest CDAP Sandbox in its various forms from the sidebar. Follow the steps listed on the Getting Started page to get started with CDAP.

Once that you’ve had a look at the CDAP SDK, take a look at:

Latest News

  • October 12, 2018: CDAP 5.1.0 released. View the Release Notes.
  • August 29, 2018: Talk on CDAP 5.0, that provides support to run your application anywhere, including on premises as well as in the cloud. More info.
  • July 31, 2018: CDAP 5.0 is now GA. View the Release Notes.


CDAP is an Apache licensed open source software, with a rapidly growing user community. The CDAP team welcome all types of contributions, whether they are bug reports, feature requests, documentation, or code patches.

Reporting Issues

To report bugs or request new features, please open an issue in the CDAP JIRA. You can also use the cdap-dev mailing list for general questions or discussions.

Contributing Code

We have a simple pull-based development model with a consensus-building phase, similar to Apache’s voting process. If you would like to contribute to CDAP whether adding new features, enhancing existing features, or fixing bugs, you can follow the guidelines here.

Thanks for helping to improve CDAP and its ecosystem!