The Data Economy: Meet the hybrid data scientist-application developer

The Data Economy: Meet the hybrid data scientist-application developer

Apr 25, 2014
Jeffrey Kelly

http://siliconangle.com/blog/2014/04/25/the-data-economy-meet-the-hybrid-data-scientist-application-developer/

Hadoop meets application development tooling

On the tools set side of the equation, Hortonworks recently expanded its partnership with Concurrent, which sells support services for the open source Cascading application development framework. When I spoke with the company last fall, Concurrent Founder and CTO Chris Wensel described Cascading as a Java library used by application developers to quickly create complex, data oriented applications. Concurrent’s Cascading SDK abstract’s away the complexity of dealing with things like MapReduce and Pig, allowing developers to integrate data sources via APIs and easily migrate predictive models into Hadoop. (You can explore sample Cascading-based apps on GitHub here.)

HDP with Cascading (Source: Hortonworks)
HDP with Cascading (Source: Hortonworks)

As part of the expanded partnership, Hortonworks said it will ensure ongoing compatibility of Cascading-based apps with the Hortonworks Data Platform and will provide level 1 and level 2 Cascading support for customers (Concurrent will still handle level 3 support.) This compatibility includes the ability to execute Cascading-based apps on Apache Tez, a recently developed Hadoop-based execution engine for real-time Big Data workloads. While Concurrent itself is still in its early days, open source Cascading is quite popular with application developers, garnering over 90,000 downloads per month.