News & Events

New Case Study Posted - UpStream

UpStream provides a marketing performance management Cloud service. They chose Cascading to streamline data manipulation, allow development of reusable components and to make development quick and easy for faster time to market. See the full case study here.


New Online Class, An Introduction to Cascading

Our partner, Scale Unlimited, will be offering their online course, Introduction to Cascading, this November 18th.


New Case Study Posted - Solusi247

Solusi247 is an integrated service provider for information technology. They purchased an OEM license and Level 3 support for Cascading and are embedding it in their GRID247 framework for ETL and analytical data processes. See the full case study here.


New Case Study Posted - Trulia

Trulia is the fastest growing online real estate resource. They use Cascading as part of their main data processing pipeline, providing an easy and higher-level abstraction for running multiple MapReduce jobs. See the full case study here


New Case Study Posted - TeleNav

TeleNav used Cascading to build a sophisticated machine learning framework that supports the company’s award-winning GPS navigation and mobile workforce management service. See the full case study here.


New Case Study Posted - Bixo Labs

Bixo Labs uses Cascading to provide efficient, reliable and scalable workflow systems for big data processing. See the full case study here.


True Ventures Announces Investment in Concurrent

True Ventures invests in Concurrent, which is helping to define and lead the market for products around data processing and management that give context to growing volumes of data. Read the full article here.


Concurrent, Inc. featured in GigaOM article

The article highlights Concurrent’s seed round of funding and describes current and future plans of the company. Read the full article here.


New Case Study Posted - Etsy

Etsy is using Cascading to provide advanced features on their community and marketplace sites. See the full case study here.


Concurrent announces partnership with MapR

The two companies partner to expand usage of Hadoop in the enterprise. Read the full press release here.


Concurrent, Inc. is now a MapR Advantage partner

MapR has announced its new partner program to expand usage of Hadoop. Read the MapR press release (with quote from Concurrent’s Chris Wensel).


Chris Wensel Speaking at BigDataCamp 2011

Chris is doing a lightning talk at BigDataCamp in Santa Clara. His talk: “Concurrent is Hiring!”, June 28th at 6:30 PM.


Chris Wensel Speaking at UberConf 2011

Chris Wensel will be speaking at 3:15 on July 13, topic: “Hadoop Architecture - In Depth”.


New Case Study Posted - Ion Flux

Ion Flux is using Cascading and Amazon Elastic MapReduce to analyze DNA sequence data. See the full case study on AWS here: http://aws.amazon.com/solutions/case-studies/ion-flux/


Concurrent is Hiring

We’re hiring Java and Web developers to take lead positions on our product team. See more here http://www.concurrentinc.com/careers/


Speaking at Berlin Buzzwords 2011, June 6

Berlin, Germany
6th-7th June, 2011

Topic: Common MapReduce Patterns. In this talk Chris Wensel will introduce the MapReduce model and discuss in some depth the most common patterns seen in Hadoop MapReduce applications including Joins, Secondary Sorting, and Partial Aggregations. The session is Monday (6 June) from 16:20 - 17:00, in Urania Humboldtsaal (Pink). For more on the conference: http://berlinbuzzwords.de/


Cascading 1.2 Released

We are happy to announce that Cascading 1.2 is now publicly available for download.

This release features many performance and usability enhancements while remaining backwards compatible with 1.0 and 1.1.

Specifically:

  • Performance optimizations during grouping (StreamComparator)
  • Composable map-side partial aggregations (AggregateBy)
  • Native Riffle support for non-Cascading (or nested iterative Cascading) processes (ProcessFlow and Riffle)

For a detailed list of changes see: CHANGES.txt

We are also happy to announce that Cascading and its extensions have their own Maven/Ivy Jar repository, Conjars. Conjars is a public repository, any developer wishing to publish Cascading libraries and extensions can register their public key and push artifacts. Conjars is a simple fork of the Clojars repo code.

Along with this release are a number of extensions created by the Cascading user community.

Among these extension are:

  • Cascading.Avro - Cascading Scheme for the Apache Avro data serialization format.
  • Cascading.Memcached - Integration with Memcached, Membase, and ElasticSearch.
  • Bixo - a web mining toolkit
  • DBMigrate - a tool for migrating data to/from RDBMSs into Hadoop
  • Apache HBase, Amazon SimpleDB, and JDBC integration
  • JRuby and Clojure based scripting languages for Cascading
  • Cascalog - a robust interactive extensible query language

This release will run against 0.19.x, and 0.20.x. Including Amazon Elastic MapReduce.


BigDataCamp 2010 - Videos Now Online

BigDataCamp 2010 was a huge success this year with well over 250 registrants in attendance, making this un-conference nearly the same size as the first Hadoop Summit in 2008.

All of the BigDataCamp workshop videos are now online. Specifically our founder, Chris K Wensel, presented on Cascading. Check it out if you missed it live.


BigDataCamp 2010

Concurrent, Inc. will be one of the sponsors for this year’s BigDataCamp the night before the Hadoop Summit.

BigDataCamp, is an unconference for data engineers, enterprise architects, developers, analysts, data mining and business intelligence professionals working with or interested in learning more about Hadoop.  Amazon Web Services is also sponsoring the event and providing free Amazon Web Services credits to be used during the workshop part of the event.  BigDataCamp is designed for users of Hadoop, MapReduce, and related technologies to exchange ideas in a loosely defined format and will take place on June 28, 2010 in Santa Clara, Ca., the evening prior to the annual Hadoop Summit.

BigDataCamp will be led by Dave Nielsen, co-organizer of the popular CloudCamp series of unconferences. Pre-defined topics will include best practices in application development and advanced analytics and will be presented in the form of a workshop with free Amazon Web Services credits for use with Amazon Elastic MapReduce. Other topics will be determined by conference attendees through majority-vote rule.

BigDataCamp is free but limited to 150 attendees. BigDataCamp attendees will receive a 30% discount on registration for the Hadoop Summit.  More information and registration details can be found at http://www.bigdatacamp.org.


Cascading 1.1.0 Now Available

We are happy to announce that Cascading 1.1.0 is now publicly available for download.

This release features many performance and usability enhancements while remaining backwards compatible with 1.0.

Specifically:

  • Performance optimizations with all join types
  • Numerous job planner optimizations
  • Dynamic optimizations when running in Amazon Elastic MapReduce and S3
  • API usability improvements
  • Support for TSV, CSV, and custom delimited text files
  • Support for manipulating and serializing non-Comparable custom Java types
  • Debug levels supported by the job planner

For a detailed list of changes see: CHANGES.txt

Along with this release are a number of extensions created by the Cascading user community.

Among these extension are:

  • Bixo - a data mining toolkit
  • DBMigrate - a tool for migrating data to/from RDBMSs into Hadoop
  • Apache HBase, Amazon SimpleDB, and JDBC integration
  • JRuby and Clojure based scripting languages for Cascading
  • Cascalog - a robust interactive extensible query language

This release will run against Hadoop 0.18.3, 0.19.x, and 0.20.x. Including Amazon Elastic MapReduce.

Note the tests will not compile or run against Hadoop 0.18.3 due to package changes since that version.

 


Cascading

Cascading is software for fault tolerant data processing. Learn more ›

Cascading Support

Concurrent provides licensing, indemnification, and support for Cascading. Learn more ›

Consulting and Training Services

For advanced Cascading Consulting, Training, and Mentoring. Learn more ›