Wrangler allows you to visually and interactively cleanse and prepare raw data, with the aim of making it consumable for further processing. It provides a standardized UI driven interactive flow that takes the pain out of preprocessing tasks for data engineering, data science and data analysis.

Go to Documentation

Wrangler features

Code free transformations

Interactive, code-free transformations with feedback at each step using a powerful graphical UI

Extensible, comprehensive transformation library

Comprehensive library with over 1000+ built-in transformations; Extensible API for adding more transformations

Comprehensive data source support

Built-in connections to popular cloud and on-prem data sources such as relational databases, file systems, object stores such as AWS S3 and Cloud Storage, Kafka, NoSQL stores

Operationalization using pipelines

One-click pipeline creation for creating scalable and reliable pipelines for mission critical environments

Automatic data quality and profiling

Data quality indicators for determining data quality; data quality library for improving trust and quality; profiling to understand data distribution and column relationships after every transformation

Videos

Learn CDAP: CDAP Data Prep and Pipelines Tutorial

Learn CDAP: [SCREENCAST] Quantize a column - Digitize

Learn CDAP: [SCREENCAST] Parse as Avro using Schema Registry

Learn CDAP: [SCREENCAST] Parse AVRO Binary and Protobuf Records in DataPrep

Learn CDAP: [SCREENCAST] New CDAP Connections

Learn CDAP: [SCREENCAST] DataPrep - Restricted Directives