Hadoop: Future Of Enterprise Data Warehousing? Are You Kidding?

Blog post info and actions

Blog post body

James Kobielus

I kid you not.

What’s clear is that Hadoop has already proven its initial footprint in the enterprise data warehousing (EDW) arena: as a petabyte-scalable staging cloud for unstructured content and embedded execution of advanced analytics. As noted in a recent blog post, this is in fact the dominant use case for which Hadoop has been deployed in production environments.

Yes, traditional (Hadoop-less) EDWs can in fact address this specific use case reasonably well — from an architectural standpoint. But given that the most cutting-edge cloud analytics is happening in Hadoop clusters, it’s just a matter of time — one to two years, tops — before all EDW vendors bring Hadoop into their heart of their architectures. For those EDW vendors who haven’t yet fully committed to full Hadoop integration, the growing real-world adoption of this open-source approach will force their hands.

Where the next-generation EDW is concerned, the petabyte staging cloud is merely Hadoop’s initial footprint. Enterprises are moving rapidly toward the EDW as the hub for all advanced analytics. Forrester strongly expects vendors to incorporate the core Hadoop technologies — especially MapReduce, Hadoop Distributed File System, Hive, and Pig — into their core architectures. Again, the impressive growth in MapReduce as a lingua franca for predictive modeling, data mining, and content analytics will practically compel EDW vendors to optimize their platforms for MapReduce, alongside high-performance support for SAS, SPSS, R, and other statistical modeling languages and formats. We see clear signs that this is already happening, as with EMC Greenplum’s recent announcement of a Hadoop product family and indications from some of that company’s competitors that they have similar near-term road maps.

Read more

Predictions And Plans For Business Analytics In 2011

Blog post info and actions

Blog post body

James Kobielus

I love reporters. As someone with an M.A. in journalism who then evolved into an analyst, I recognize that both professions occupy approximately the same tier in the industry food chain. In fact, many IT industry analysts were trade press reporters at one point in their careers, and it’s not uncommon for analysts to go back into media institutions later on.

When great longtime IT reporters, such as Computerworld’s Jaikumar Vijayan, call me up to get my thoughts, I’m just as interested in their take on what’s important. Jai recently published an excellent article with my predictions, plus those of another analyst, on the year ahead in analytics. To the jaded reader, these sorts of year-end look-ahead articles may feel like perfunctory rehashes of stuff we’ve been telling them for quite some time, perhaps with a trendy new buzzword thrown in to keep it remotely glance-worthy.

I try not to repeat myself too much. Rather than regurgitate the statements I made in the phone interview with Jai, I’ll highlight how I’m addressing the principal business-analytics trends that I discussed with him — self-service, pervasive, social, scalable, cloud, and real-time—in our 2011 Forrester research agenda:

Read more

Social Media Analytics vs Social Network Analysis: Is There A Real Difference Or Are You Seeing Double?

Blog post info and actions

Blog post body

James Kobielus

As an industry analyst, I’m part of the professional class that delights in defining standard marketplace terminology. More than that, many of us spend our working lives coaxing industry to march under marketing banners aligned with our pet definitions.

Yes, indeed, each analyst likes to feel that his or her marketecture terminology should rule school. Last month I did a Forrester podcast on a topic that’s extremely hot right now: leveraging the power of social media and social networks to manage your brand, drive marketing and sales campaigns, and manage ongoing customer relationships. In that session, I discussed the role of analytics in social media for multichannel customer relationship management (CRM).

My initial impetus for the podcast was to spell out the chief distinctions between two terms that, on first glance, appear almost synonymous: social media analytics and social network analysis. During the podcast I also trucked in another related closely related term—social media monitoring—and even alluded to social intelligence and other phrases that have gained currency.

What follows, for those of you who don’t listen to podcasts, or can’t find them, is the gist of what I said on this topic: 

Read more

Number of People Using Advanced Analytics

Blog post info and actions

Blog post body

James Kobielus

Guesstimates are often essential for market sizing and trending. To be useful, especially where primary data are lacking, they demand a valid conceptual framework. 

Like you, I’m looking forward to the responses to Boris Evelson’s quick Web-based survey, which you can access from his most recent blogpost.It’s always a challenge to assess how truly pervasive BI is—and pervasive it could potentially become.

To generate a valid first approximation, Boris scoped his blog comments and quick survey to “traditional BI” applications (i.e., historical reporting, query, dashboarding). He scoped his estimate only to large enterprise and midmarket firms (i.e., those with 100 or more employees) and only to BI usage in the US.

In order to keep this task manageable, Boris excluded some use cases that are often included in the “traditional BI” category: spreadsheets and other “homegrown” analytics apps; BI embedded in line-of-business apps; and non-interactive, static, published BI outputs. He leveraged both public and Forrester-gathered primary data to gauge how many actual and potential BI users there might be.

Scoping it as he did, Boris estimated that slightly more than 1.5 million people in the US are using traditional BI applications, which is between 2-3 percent of the employees of BI-implementing firms. He suspects the actual percentage might be as high as 6-8 percent of employees, but he’s not sure. That’s why he’s running the Web-based quick survey.

Read more

Social, Spatial, & Temporal: The Coordinates of Community in the Cloud

Blog post info and actions

Blog post body

James Kobielus

Social networks have their foundations in the space-time continuum—you know, the funky coordinate system  that Einstein was so keen about.

Social network analysis is all about looking for patterns of “proximity” among people, considered in their cultural capacities as influencers and followers, innovators and imitators, first-movers and late adopters. Down deep, I consider social network analysis an important new branch of decision support systems as a discipline. The core question is: What unique situational chemistry causes various people, individually or collectively, to make various decisions at various places and times?

That’s where space and time enter the social network analysis equation. It’s not enough that I look up to your shining example and take my lead from what you say and do. It’s just as important that we be in the same city, neighborhood, or room. More than that, it’s important that you and I actually cross paths in order for you to actively influence me to buy that latte, or for you to calm me down and thereby stop me from storming out the door and severing my relationship with a retailer who has ignored my complaints one time too many.

Read more

How Much Intelligence Can You Pack Into a Tweet?

Blog post info and actions

Blog post body

James Kobielus

In the analytics wars, one of the quasi-metaphysical topics I try to avoid is debating the distinctions between “information,” “intelligence,” and “insight.”

Read more

Next Best Models: The Process Agility Equation

Blog post info and actions

Blog post body

James Kobielus

You never know what’s coming at you next, which is why process agility is so important. Your organization must have a ready response for anything. And you must make sure that every process participant can identify, at their level, what that response might be, so they can take appropriate action.

Read more

Social Network Analysis: The Fuse Igniting Enterprise Data Warehouse Growth. It’s Planet Petabyte or Bust!

Blog post info and actions

Blog post body

James Kobielus

Social networks have always been with us, of course, but now they’ve gained concrete reality in the online fabric of modern life.

Social network analysis has, in a real sense, been with us almost as long as we’ve been doing predictive analytics. Customer churn analysis is the killer app for predictive analytics, and it is inherently social. It’s long been known that individual customers don’t always churn themselves—i.e., decide to renew and/or bolt to the competition—in isolation. As they run the continual calculus called loyalty in their heads and hearts, they’re receiving fresh feeds of opinion from their friends and families, following the leads of peers and influencers, and keeping their fingers to the cultural breeze. You could also make a strong case for social networking—i.e., individual behaviors spurred, shaped, and encouraged within communities—as a key independent variable driving cross-sell, up-sell, fraud, and other phenomena for which we’ve long built predictive models.

The other day, a Forrester client was asking me for educated guesses on how fast the average enterprise data warehouse (EDW) is likely to grow over the next several years, and as I was working through the analysis, I couldn’t avoid the conclusion that social network analysis—for predictive and other uses—will be an important growth driver (though not the entire story). I’d like to lay out my key points.

Read more

Advanced Analytics Predictions For 2010

Blog post info and actions

Blog post body

James Kobielus

James G. Kobielus By James Kobielus

As we bid adieu to one decade and move into the next, it’s important to catch our collective breath and to take a quick look ahead. Here are some quick thoughts on the trends that will shape advanced analytics in the year to come. These trends will set the stage for thoroughgoing transformation of business intelligence (BI), data warehousing (DW), predictive analytics (PA), data mining (DM), business activity monitoring (BAM), complex event processing (CEP), and other key analytics technologies in the Teens:

  • Self-service operational BI puts information workers in driver’s seat: Enterprises have begun to adopt self-service BI to cut costs, unclog the analytics development backlog, and improve the velocity of practical insights. Users are demanding tools to do interactive, deeply dimensional exploration of information pulled from enterprise data warehouses, data marts, transactional applications, and other systems.
Read more

Social Network Analysis: Going to Become Too Ubiquitous for Its Own Good

Blog post info and actions

Blog post body

James Kobielus

James G. Kobielus By James Kobielus

Social networks are the future of online life, whether we like it or not. Before the end of the coming decade, relationships with everyone –including family, friends, colleagues, employers, merchants, suppliers, and government agencies—will hinge on your access to these parties, and theirs to you, through online communities of all shapes and sizes.


Social networks are becoming much more pervasive than today’s mass-market communities—such as Facebook, Twitter, and LinkedIn—would lead you to believe. Before long, many will be embedded in the full range of business and personal applications. In ten years’ time, today’s social networks will have evolved into a powerful, seamless worldwide infrastructure for collaboration, sharing, interaction, and transactions. Many will be integral features of the mobile, broadband, and streaming media services that shape business and consumer life.

Read more