Data Scientist: What Skills Does It Require?

Blog post info and actions

Blog post body

James Kobielus

Data scientists are a curious breed. The term encompasses a wide range of specialties, all of which rely on statistical algorithms and interactive exploration tools to uncover nonobvious patterns in observational data.

Who belongs in this category? Clearly, the “quants” are fundamental. Anybody who builds multivariate statistical models, regardless of the tool they use, might call themselves a data scientist. Likewise, data mining specialists who look for hidden patterns in historical data sets — structured, unstructured, or some blend of diverse data types — may certainly use the term. Furthermore, a predictive modeler or any analyst who builds fact-based what-if simulations is a data scientist par excellence. We should also include anybody who specializes in constraint-based optimization, natural language processing, behavioral analytics, operations research, semantic analysis, sentiment analysis, and social network analysis.

But these jobs are only one-half of the data-science equation. The “suits” are also fundamental. Any business domain specialist who works with any of the tools and approaches listed above may consider him- or herself a data scientist. In fact, if one and the same person is a black belt in SAS, SPSS, R, or other statistical tools, and also an expert in marketing, customer service, finance, supply chain, or other business specialties, they are a data scientist par excellence.

Both of these skill sets are fundamental to high-quality data science. Lacking statistical expertise, you can’t understand which are the most appropriate algorithms and approaches to make the foundation of your statistical models. Lacking business domain expertise, you can’t identify the most valid variables and appropriate data sets to build into your models around.

Read more

Social CRM—Don’t Make It a Silo in Your Multichannel Strategy

Blog post info and actions

Blog post body

James Kobielus

Customer relationship management (CRM) has always been a multichannel discipline. You should connect and bond with your customers through whatever mix of channels they use or prefer. That has traditionally meant that existing channels be supplemented and extended by whatever new technologies come along. Hence, any enterprise serious about multichannel CRM has begun to add social media to a strategy that includes point of sale (PoS), direct postal mail, agent-assisted telephony, interactive voice response, e-mail, portals, and text messaging—at the very least. 

Social CRM is the newest craze in this arena. It refers generally to the convergence of social media with CRM. However, in the minds of some observers it seems to imply that social media will somehow become the most important CRM channel of all, marginalizing or obsoleting others. I take issue with that perspective, which threatens to turn social media, in all their billowing multiplicity, into a big new overstuffed silo in the CRM world. 

I don’t deny that social-networking interfaces are all the rage in the CRM space. One obvious case in point is salesforce.com’s Chatter collaboration platform, which looks and feels so Facebook-y that, navigating through it, I half-expect my cousins to be posting new vacation pictures to the community. 

Read more

Self-Service Predictive Modeling: Vendors Still Have Far to Go

Blog post info and actions

Blog post body

James Kobielus

Self-service analytics is one of my core coverage focus areas. It applies not just to business intelligence (BI) but also to advanced analytics. 

When, a few months ago, I uttered the immortal phrase “roll over rocket scientists,” I was referring more specifically to the need for pervasive self-service tools for predictive analytics and data mining (PA/DM). Considering that my recently published Forrester Wave on PA/DM Solutions primarily addressed the traditional requirements of “rocket scientist” experts in statistical analysis, I did not put a huge emphasis on data mining features geared to business analysts, subject matter experts, and other “non-technical” information workers. 

As I’ve stated in that blogpost and the follow-on podcast, the core problem with today’s PA/DM offerings is that many of them are power tools, not solutions that have been designed for the mass business market. Vendors such as SAS Institute, IBM/SPSS, KXEN, Oracle, Portrait Software, Angoss, FICO, and TIBCO Spotfire provide data mining specialists with feature-rich algorithm-powered solutions for modeling, scoring, regression, and other core PA/DM functions. Their core, traditional user base consists of statisticians, mathematicians, and other highly educated analytics professionals. 

Read more

Social, Spatial, & Temporal: The Coordinates of Community in the Cloud

Blog post info and actions

Blog post body

James Kobielus

Social networks have their foundations in the space-time continuum—you know, the funky coordinate system  that Einstein was so keen about.

Social network analysis is all about looking for patterns of “proximity” among people, considered in their cultural capacities as influencers and followers, innovators and imitators, first-movers and late adopters. Down deep, I consider social network analysis an important new branch of decision support systems as a discipline. The core question is: What unique situational chemistry causes various people, individually or collectively, to make various decisions at various places and times?

That’s where space and time enter the social network analysis equation. It’s not enough that I look up to your shining example and take my lead from what you say and do. It’s just as important that we be in the same city, neighborhood, or room. More than that, it’s important that you and I actually cross paths in order for you to actively influence me to buy that latte, or for you to calm me down and thereby stop me from storming out the door and severing my relationship with a retailer who has ignored my complaints one time too many.

Read more

UpLeveling the Conversation: Social CRM Summit Brings Experts and Social Business Insights Together

Blog post info and actions

Blog post body

Natalie Petouhoff

#SCRM (the hash our group uses to communicate on Twitter) group embodies the very essence of what social media is about: genuine authentic, direct and real conversations. Being a participant and a practitioner, I thought I would share my observations and thoughts... not just at this conference, but what I have seen in the actions and behaviors of this group over the past year or more... And these foreshadow a world that is being created right now as you are reading this...

Read more

Advanced Analytics Predictions For 2010

Blog post info and actions

Blog post body

James Kobielus

James G. Kobielus By James Kobielus

As we bid adieu to one decade and move into the next, it’s important to catch our collective breath and to take a quick look ahead. Here are some quick thoughts on the trends that will shape advanced analytics in the year to come. These trends will set the stage for thoroughgoing transformation of business intelligence (BI), data warehousing (DW), predictive analytics (PA), data mining (DM), business activity monitoring (BAM), complex event processing (CEP), and other key analytics technologies in the Teens:

  • Self-service operational BI puts information workers in driver’s seat: Enterprises have begun to adopt self-service BI to cut costs, unclog the analytics development backlog, and improve the velocity of practical insights. Users are demanding tools to do interactive, deeply dimensional exploration of information pulled from enterprise data warehouses, data marts, transactional applications, and other systems.
Read more

Podcast: Instrumenting Your Enterprise For Maximum Predictive Power

Blog post info and actions

Blog post body

James Kobielus

Our latest featured podcast is Jim Kobielus'"Instrumenting Your Enterprise For Maximum Predictive Power".

In this podcast, BP&A Senior Analyst Jim Kobielus discusses how best to leverage your company’s predictive investments. He also lays out a high level framework to assess your predictive analytics maturity.







We look forward to your questions and comments.



---

Read more

Social Network Analysis: Going to Become Too Ubiquitous for Its Own Good

Blog post info and actions

Blog post body

James Kobielus

James G. Kobielus By James Kobielus

Social networks are the future of online life, whether we like it or not. Before the end of the coming decade, relationships with everyone –including family, friends, colleagues, employers, merchants, suppliers, and government agencies—will hinge on your access to these parties, and theirs to you, through online communities of all shapes and sizes.


Social networks are becoming much more pervasive than today’s mass-market communities—such as Facebook, Twitter, and LinkedIn—would lead you to believe. Before long, many will be embedded in the full range of business and personal applications. In ten years’ time, today’s social networks will have evolved into a powerful, seamless worldwide infrastructure for collaboration, sharing, interaction, and transactions. Many will be integral features of the mobile, broadband, and streaming media services that shape business and consumer life.

Read more

Instrumenting Your Enterprise for Maximum Predictive Power

Blog post info and actions

Blog post body

James Kobielus

James G. Kobielus By James Kobielus

Business is all about placing bets and knowing if the odds are in your favor.

As I noted in my most recent Forrester report, business success depends on your company being able to visualize likely futures and take appropriate actions as soon as possible. You must be able to predict future scenarios well enough to prepare plans and deploy resources so that you can seize opportunities, neutralize threats, and mitigate risks.

Read more

How To Differentiate Advanced Data Visualisation Solutions

Blog post info and actions

Blog post body

Boris Evelson

Boris Evelson By Boris Evelson

I get many inquiries from clients on how to select a data visualization vendor / solution. The criteria that my clients often site are

  • Thick and thin client
  • Dynamic visualizations, not just static charts 
  • Ability to pull data from multiple sources
  • OLAP-like functionality

All these criteria are pretty much a commodity these days. The real differentiation will come once you start looking at advanced (key word "advanced") visualization features such as

Read more