Category Archives: Events

MeetUp | Simplifying Application Development on Hadoop – May 26, 2014

Sign-up here: http://meetu.ps/2kSwC4

When:
Monday, May 26, 2014
7:00 PM (Central European Time Zone)

Where:
SoundCloud
Greifswalder Str. 2112-213
Berlin, Germany

Simplifying Application Development on Hadoop

Hadoop is an integral part of the big data ecosystem and can process huge datasets in a reliable, scalable and distributed manner. But the MapReduce programming model is cumbersome, complex and messy to develop and maintain. Is there a way to harness the power of Hadoop in an easy to use, clean framework and extend it seamlessly for predictive modeling?

This session is about simplifying application development on Hadoop by using Cascading, an open source Java application framework that provides higher level data processing abstractions. This talk focuses on our experience in building complex data flow applications, that are enterprise-grade, through test driven development. The talk will also be showcasing, through live demo and code, how an application developed with Cascading can be extended to integrate with predictive models built with analytics tools, like R, and scaled-out on a Hadoop cluster.

Our Speaker: Vinoth Kannan is a Software Developer and Big Data Engineer at WidasConcepts.

Our Hosts: SoundCloud, based in Berlin, is leveraging Scalding to create the world’s leading social sound platform.

For more information about Cascading: Visit cascading.org

Webinar | Accelerate Big Data Application Development with Cascading and HDP – Apr 22, 2014

Join Hortonworks and Concurrent to learn how to accelerate your big data application development with the popular Cascading framework and Hortonworks Data Platform.

Date: Tuesday, April 22, 2014
Time: 10am Pacific / 1pm Eastern

In this webinar, we will describe how developers can create future proof, data-driven applications built on Apache Hadoop. We will show how organizations can take advantage of the latest Hadoop processing frameworks like YARN and Tez. And we’ll finish it off with a demonstration on application development with Java-based Cascading middleware on the Hortonworks Sandbox.

Register at: http://bit.ly/1qYX9BW

MeetUp | How AdMobius Uses Cascading to Connect Mobile Data to its Users – Apr 7, 2014

Sign-up here: http://meetu.ps/2h3XFn

When:
Monday, Apr 7, 2014
6:00 PM (PT)

Where:
Etsy
20 California St, Floor 3
San Francisco, CA 94111

How AdMobius Uses Cascading to Connect Mobile Data to its Users

AdMobius is the industry’s first Mobile Audience Management Platform (MAMP) and enables publishers and advertisers to discover and target relevant audiences at scale. By organizing and interpreting unique demographic and interest-based information, AdMobius unlocks the rich value of mobile data.

Come and join Jyotirmoy Sundi, Data Engineer at AdMobius, to learn more about how by adopting Cascading, AdMobius was able to create their “Device Graph” data application that aggregates data and pairs that information to mobile devices. Also, learn how AdMobius created their own custom Tap for data interoperability between their application and their existing infrastructure.

This MeetUp will be hosted at Etsy in San Francisco, CA. Food and drinks will be provided.

Event | Concurrent, Inc. to Present at Big Data TechCon 2014

Alexis Roos to Deliver Two Sessions on How to Easily Develop Big Data Applications on Apache Hadoop™

SAN FRANCISCO – March 25, 2014Concurrent, Inc., the enterprise Big Data application platform company, today announced that Alexis Roos, senior solutions architect, will deliver two sessions at the third annual Big Data TechCon 2014, taking place March 31 – April 2 in Boston. This three-day event is the how-to Big Data training conference for professionals implementing and analyzing Big Data, and will feature practical tutorials for IT and Big Data professionals.

Concurrent Presentations At-A-Glance

What:  “How to Build Enterprise Data Apps with Cascading
Who: Alexis Roos, senior solutions architect, Concurrent, Inc.
When: Wednesday, April 2 at 2 p.m. EST
How: Register at http://bigdatatechcon.com/registrationdetails.html

Session Description

Cascading is the most popular application development framework for building enterprise-grade data applications on Apache Hadoop. This open source development framework allows developers to leverage their existing skillsets, such as Java and SQL, to create reliable applications without having to think in MapReduce. In this presentation, Alexis will give an introduction to Cascading, how it works and then dive into building applications with Cascading. Attendees will learn what types of use cases exist for data-driven businesses, how to approach them with Cascading and its vast ecosystems, and the best practices for Cascading application development.

What:  “Cascading Lingual Shows How SQL Can Save Your On-and-Off Relationship with Hadoop
Who: Alexis Roos, senior solutions architect, Concurrent, Inc.
When: Wednesday, April 2 at 3:45 p.m. EST
How: Register at http://bigdatatechcon.com/registrationdetails.html

Session Description

With Hadoop, life is complicated. It’s a give-and-take relationship where you constantly want to migrate workloads onto Hadoop for processing but also want to get data off for reporting and analysis. These scenarios can be tricky and will often complicate your relationship with Hadoop. However, SQL is the therapy that will help. This class will introduce Cascading Lingual, an open-source project that provides ANSI-compatible SQL, enabling fast and simple Big Data application development on Hadoop. Alexis will demonstrate how Cascading Lingual can be used with various tools like R and desktop applications to drive improved execution on Big Data strategies by leveraging existing in-house resources, skills sets and product investments. Attendees will learn how data scientists and developers can now easily work with data stored on Hadoop using their favorite BI tool.

About the Speaker

Alexis Roos is a senior solutions architect focusing on Big Data solutions at Concurrent, Inc. He has more than 18 years of experience in software and sales engineering, helping both Fortune 500 firms and startups build new products that leverage Big Data, application infrastructure, security, databases and mobile technologies. Prior, Alexis worked for Sun Microsystems and Oracle for more than 13 years, as well as Couchbase and several large systems integrators in Europe.

Supporting Resources

About Concurrent, Inc.

Concurrent, Inc. delivers the #1 application development platform for Big Data applications. Concurrent builds application infrastructure products that are designed to help enterprises create, deploy, run and manage data applications at scale on Apache Hadoop™.

Concurrent is the team behind Cascading™, the most widely used and deployed technology for Big Data applications with more than 150,000+ user downloads a month. Used by thousands of businesses including Twitter, eBay, The Climate Corp and Etsy, Cascading is the de-facto standard in open source application infrastructure technology.

Concurrent is headquartered in San Francisco and online at http://concurrentinc.com.

Media Contact
Danielle Salvato-Earl
Kulesa Faul for Concurrent, Inc.
(650) 922-7287
concurrent@kulesafaul.com

Event | Concurrent, Inc. to Present at DeveloperWeek San Francisco 2014

Concurrent’s Alexis Roos to Deliver Session on “Pattern: An Open Source Project for Creating Complex Machine Learning Applications”

SAN FRANCISCO – Feb. 13, 2014Concurrent, Inc., the enterprise Big Data application platform company, today announced that Alexis Roos, senior solutions architect, will deliver a talk, titled “Pattern: An Open Source Project for Creating Complex Machine Learning Applications,” at DeveloperWeek San Francisco. Taking place Feb. 15-21, the conference is the first developer event that brings together thousands of developers to explore, learn and build new skills, applications, startups and product features.

Details At-A-Glance

What:  “Pattern: An Open Source Project for Creating Complex Machine Learning Applications”
Who: Alexis Roos, senior solutions architect, Concurrent, Inc.
When: Tuesday, Feb. 18 at 10 a.m. PST
Where: Workshop Room 2, Terra Gallery, 511 Harrison St., San Francisco
How: Register at http://developerweek.com/register/

Session Description

Cascading Pattern is an open source project that takes models trained in popular analytics frameworks, such as SAS, Microstrategy, SQL Server, etc., and runs them at scale on Apache Hadoop. With Pattern, developers can use a Java API to create complex machine learning applications, such as recommenders or fraud detection. Pattern effectively lowers the barrier of adoption to Apache Hadoop for developers because developers can use existing skill sets to immediately begin building these complex applications.

In this presentation, Concurrent, Inc.’s Alexis Roos, will provide sample code that will show applications using predictive models built in SAS and R, such as anti-fraud classifiers. Additionally, Alexis will compare variations of models for enterprise-class customer experiments.

Extended abstract available at: http://sched.co/1fuIriN

About the Speaker

Alexis Roos is a senior solutions architect focusing on Big Data solutions at Concurrent, Inc. He has more than 18 years of experience in software and sales engineering, helping both Fortune 500 firms and start-ups build new products that leverage Big Data, application infrastructure, security, databases and mobile technologies. Prior, Alexis worked for Sun Microsystems and Oracle for more than 13 years, and has also spent time at Couchbase and several large systems integrators over in Europe. Alexis has spoken at dozens of conferences as well as university courses and holds a Master’s Degree in computer science with a cognitive science emphasis.

Supporting Resources

About Concurrent, Inc.

Concurrent, Inc. delivers the #1 application development platform for Big Data applications. Concurrent builds application infrastructure products that are designed to help enterprises create, deploy, run and manage data applications at scale on Apache Hadoop™.

Concurrent is the team behind Cascading™, the most widely used and deployed technology for Big Data applications with more than 130,000+ user downloads a month. Used by thousands of businesses including Twitter, eBay, The Climate Corp and Etsy, Cascading is the de-facto standard in open source application infrastructure technology.

Concurrent is headquartered in San Francisco and online at http://concurrentinc.com.

Media Contact
Danielle Salvato-Earl
Kulesa Faul for Concurrent, Inc.
(650) 922-7287
concurrent@kulesafaul.com

MeetUp | Etsy’s journey: JRuby to Scalding – Feb 25, 2014

Sign-up here: http://meetu.ps/28L6C0

When:
Tuesday, Feb 25, 2014
6:00 PM (PT)

Where:
Etsy
20 California St, Floor 3
San Francisco, CA 94111

Etsy’s journey: JRuby to Scalding. What happens when a technology chooses you

Come hear Dan McKinley, Principal Engineer at Etsy, talk about his journey from JRuby to Scalding.

After 3 years building features and analytics infrastructure in cascading.jruby, the framework (similar in most ways to Pig) was entrenched. Etsy had just finished migrating their EMR pipeline onto new internal hardware, and standardizing the development environment for their 150+ person product development team. It was at this moment that the Scalding grenade hit. Introduced using guerrilla tactics, within a few months Scalding had been widely adopted. Within a year, cascading.jruby was deprecated. The talk will cover Etsy’s story, the technical problems that precipitated it, and the general unease implied when your technology chooses you.

MeetUp | Cascading Office Hours in San Francisco – Nov 13, 2013

Sign up here: http://meetu.ps/22ZnsY

When:
Wed, Nov 13 6:00 PM (PT)

Where:
Mikkeller Bar
34 Mason Street
San Francisco, CA 94102

Come and greet the Cascading team on Wednesday, Nov 13 in San Francisco!

Cascading gurus, Chris Wensel and Alexis Roos, will be hosting informal office hours and will be there to answer your pertinent questions around Cascading and the related projects you’re working on. Whether you’re a veteran of Cascading and its extensions or simply new to the Cascading framework– this is the right place to get your questions answered by experts!

MeetUp | Cascading meetup in Portland – Jul 25, 2013

Event Info – http://www.pdx-hadoop.eventbrite.com

When:
Thu, Jul 25 6:30-9:30 PM (PDT)

Where
Widmer Brothers Brewery – GreatRoom (Gasthaus)
955 North Russell Street
Portland, OR 97227

Organized/Sponsored By: Aaron Betik, NIKE, Inc
Global Technology Director, Consumer and Digital Analytics & BI

Doors are open at 6:15, Talk starts at 7pm. Light snacks, beverages and brews will be available and will have a social hour following the talk by Paco Nathan.

Cascading is an open source workflow abstraction atop Hadoop and other Big Data frameworks, with a 5+ year history of large-scale Enterprise deployments. For example, half of Twitter’s total compute uses this API, along with other large use cases at eBay, Etsy, Airbnb, LinkedIn, Apple, Climate, Nokia, Factual, Telefonica, etc. Cascading leverages some aspects of functional programming so that developers can create large-scale data pipelines which are robust and easier to operationalize. There are popular DSLs in Scala (Scalding) and Clojure (Cascalog), plus Jython, JRuby, etc. Recent support also implements DSLs for ANSI SQL (Lingual) and PMML (Pattern).