
- Build complex data processing applications with any JVM language
- Develop on desktop and deploy at scale to any Hadoop cluster
- Integrate with and test against multiple external data stores
Cascading is a Java application framework that enables developers to quickly and easily build rich enterprise grade Data Processing and Machine Learning applications that can be deployed and managed across private or cloud-based Apache Hadoop clusters and API compatible distributions.
Cascading 2.1
Lots of new features have been added that allow applications to be more testable and portable, and data to be manipulated in many more ways.
Learn more →
Features
Cascading has a rich set of capabilities that enable developers to quickly conduct advanced data analytics using familiar relational data management constructs.
Learn more →
Getting Started
It’s easy to get started. Learn from our documentation and sample applications how to quickly create simple data pipes and flows in the context of real-world scenarios.
Learn more →
Concurrent Tweets