CDAP is an open source framework for building data analytic applications.
It provides developers with data and application abstractions to simplify and accelerate application development, address a broader range of real-time and batch use cases, and deploy applications into production quicker while satisfying enterprise requirements.
CDAP runs of Apache Hadoop® distributions such as Cloudera Enterprise Data Hub, the Hortonworks® Data Platform, or the MapR Distribution. It also supports all the major public clouds such as AWS, Microsoft Azure and Google Cloud Platform.
CDAP exposes developer APIs (Application Programming Interfaces) for creating applications and accessing core CDAP services. CDAP defines and implements a diverse collection of services that support applications and data on existing Hadoop infrastructure such as HBase, HDFS, YARN, MapReduce, Hive, and Spark.
In addition, CDAP also provides built-in applications that solve vertical industry use-cases such data preparation, integration, metadata management and machine learning.
You can download the latest CDAP Sandbox in its various forms from the sidebar. Follow the steps listed on the Getting Started page to get started with CDAP.
Once that you’ve had a look at the CDAP SDK, take a look at:
- Examples, located in the /examples directory of the CDAP SDK;
- Selected Examples (demonstrating basic features of the CDAP) are located on-line;
- Tutorials and user guides for example use-cases;
- Developers’ Manual for guides on using CDAP APIs to build applications and plugins;
- Javadocs for public APIs;
- If you are ready to install CDAP in a distributed environment, refer to the installation guide for distributed mode
- October 12, 2018: CDAP 5.1.0 released. View the Release Notes.
- August 29, 2018: Talk on CDAP 5.0, that provides support to run your application anywhere, including on premises as well as in the cloud. More info.
- July 31, 2018: CDAP 5.0 is now GA. View the Release Notes.
CDAP is an Apache licensed open source software, with a rapidly growing user community. The CDAP team welcome all types of contributions, whether they are bug reports, feature requests, documentation, or code patches.
We have a simple pull-based development model with a consensus-building phase, similar to Apache’s voting process. If you would like to contribute to CDAP whether adding new features, enhancing existing features, or fixing bugs, you can follow the guidelines here.
Thanks for helping to improve CDAP and its ecosystem!