Category Archives: blog

Mistaking a Data Library for a Data Lake: 7 best practices for developing your Hadoop data strategy

Originally published in InsideBigData on October 2, 2015, by Daniel Gutierrez In this special guest feature, Supreet Oberoi of Driven, Inc (formerly Concurrent). talks about how companies should change their perspective on their data strategies, and look at the process as building a data library as opposed to a data lake. Supreet is the vice […]

READ MORE

Reducing Hospital Readmissions with Big Data Predicative Analytics

By: Michael Covert, CEO of Analytics Inside Hospital readmission is an event that health care providers are attempting to reduce, and it is a primary target of new regulation from the US Affordable Care Act. A readmission is defined as ANY reentry to a hospital 30 days or less from a prior discharge. A financial […]

READ MORE

3 Things to Change to Keep Your Big Data Apps Running Smoothly

It doesn’t matter if you have 5 or 500 Big Data applications in production on your Hadoop cluster(s), the operational challenge is the same (though scale comes into the picture at some point) and understanding how your applications are actually behaving is key.  There are a number of tools available to help you manage and […]

READ MORE

Solving Hadoop Problems, For Fun and Profit

Originally published in datanami on July 6, 2015 by Alex Woodie Things move quickly in the Hadoop world, and keeping up can be hard to do. Just ask Chris Wensel, the creator of the popular open source development tool Cascading and CTO at Concurrent. While Wensel spends many hours keeping Cascading current with every Hadoop […]

READ MORE

Your Hadoop App Broke. Now What?

Traditionally, a lot of time is spent collecting and preparing data, and then, eventually, you get around and build an app that makes use of the data. You create the right views and get the insights you need. Life is awesome – all that data you collected has value, and the Hadoop project is a […]

READ MORE

11 Tools for a Healthy Hadoop Relationship

Supreet Oberoi May 8th 2015 http://thenextweb.com/dd/2015/05/08/11-tools-for-a-healthy-hadoop-relationship/ I’m often asked which Hadoop tools are “best.” The answer, of course, is that it depends, and what it depends on is the stage of the Hadoop journey you’re trying to navigate. Would you show up with a diamond ring on a first date? Would you arrange dinner with […]

READ MORE

Now, We’re All Data Miners

Imagine finding out that your headquarters is sitting on a diamond mine. But you’re an architectural firm, oil company, or a commercial real estate company — what do you know about diamonds? Data is like that. Simply put, no matter what kind of company you are, you’re in the data business and you’re sitting on […]

READ MORE

Boosting Hadoop Performance through Dev and Ops Collaboration

When an enterprise Hadoop app fails to perform up to expectations, the finger-pointing can get ugly. Developers blame the operators, convinced that adding hardware, inspecting the data, or improving cluster utilization is the answer. Operators, in turn, throw the problem back on developers, arguing that code optimization is what’s needed. Even in forward-thinking teams with […]

READ MORE

Conquering Your Hadoop Fear in 5 Easy Steps

For longer than I’d like to admit, I put off learning Hadoop the way I put off making dentist appointments. I had years of experience building data-centric applications and APIs, but with Hadoop there were all these intimidating new questions. Which distribution? What cluster size? MapReduce? How do I test? What do Hive, Pig, Phoenix, […]

READ MORE

How to Escape the Dark Valley of Your Hadoop Journey

It happens to the best of us. You know your business is bursting with useful data, and you’ve only begun to scratch the surface. So you strike out to build an analytical platform, using all the great open-source tools you’ve been hearing so much about. First, you have to capture all the data that’s coming […]

READ MORE