• Build complex data processing applications with any JVM language
  • Develop on desktop and deploy at scale to any Hadoop cluster
  • Integrate with and test against multiple external data stores

Cascading is a Java application framework that enables developers to quickly and easily build rich enterprise grade Data Processing and Machine Learning applications that can be deployed and managed across private or cloud-based Apache Hadoop clusters and API compatible distributions.

Community Cascading Site ->


Cascading 2.1

Lots of new features have been added that allow applications to be more testable and portable, and data to be manipulated in many more ways.
Learn more →

Features

Cascading has a rich set of capabilities that enable developers to quickly conduct advanced data analytics using familiar relational data management constructs.
Learn more →

Getting Started

It’s easy to get started. Learn from our documentation and sample applications how to quickly create simple data pipes and flows in the context of real-world scenarios.
Learn more →

Concurrent Tweets Concurrent Tweets