The Forrester Blog For Information & Knowledge Management Professionals

Business intelligence

June 09, 2009

BI Mashup Maturity Model? Oxymoron? Au Contraire Mon Frère!

By James Kobielus

In one of my recent tweets, I commented that Forrester has developed a maturity model for enterprise adoption of mashup-style, self-service development of business intelligence (BI) applications. Indeed, we have, and it will appear in my forthcoming Forrester report, “Mighty Mashups: Do-It-Yourself Business Intelligence for the New Economy.”

Another tweeter--an astute, but sadly, non-Forrester BI analyst--scoffed that “BI mashup maturity model” is an oxymoron. Respectfully, I must disagree. Enterprises are adopting self-service BI approaches for many reasons--principally, to cut costs in a tight economy, to unclog the development backlog, and to speed delivery of actionable, targeted intelligence to decision makers. Also, companies are providing users with BI tools to do interactive, deeply dimensional exploration of information pulled from enterprise data warehouses (EDW), marts, cubes, transactional applications, and other systems. Furthermore, organizations everywhere have adopted browser-oriented BI environments that leverage the full Web 2.0 interactivity and collaboration.

Sitting at the convergence of those trends is BI mashup, which Forrester sees as the new paradigm for truly pervasive decision-support systems. What throws off some people is the term “mashup,” which sometimes gets pigeonholed as simply referring to using, say, Google Maps to display geocoded performance metrics and sundry Internet-sourced data in a browser-based dashboard. Yes, BI mashup encompasses that approach to presenting and integrating diverse data, but its application is much broader.

Just as important, BI mashup is not bleeding-edge. Rather, BI mashup leverages the in-memory BI clients, semantic virtualization layers, data federation middleware, automated data discovery, and other next-generation BI tools and platforms.

No one vendor or user has yet put together an end-to-end BI environment that is entirely focused on mashup-style self-service development. However, Forrester sees the BI industry converging toward as mashup-oriented architecture over the coming 2-3 years. With that in mind, we sketched out a BI maturity model that encompasses the following four levels (the first 3 of which are represented in case studies in the upcoming report):

  • Level 1: Lightweight presentation mashup against transactional applications: This basic maturity level is for companies that have no prior BI or EDW; have little in-house BI expertise; and are comfortable with allowing casual users to use their browsers to customize parameterized reports from data from packaged business applications.                                                                
  • Level 2: Deep presentation mashup against EDW: This level is for organization that do have prior BI and centralized EDWs, but have an understaffed BI development group and/or  power users and data modelers urgently require the ability to mashup and explore historical and current data within sophisticated BI workspaces.
  • Level 3: Full BI mashup in federated environment: This level is for organizations that have decentralized, dynamic data management environments, and have the expertise to design reusable, composite data services to seamlessly mashup internal and external information.
  • Level 4: Full collaborative mashup with IT governance: This level is for organizations that want to encourage subject  matter experts and operational users to collaborate on analytics created through mashup, but who are also concerned that all mashups be controlled, governed, and monitored in accordance with enterprise policies and best practices.

As I said, it will take a few years before we see a substantial number of enterprise case studies that implement the pinnacle of collaborative mashup with tight governance. Nevertheless, when you follow the evolution of next-generation solution portfolios from leading BI vendors such as SAP, IBM, Microsoft, and others, it’s clear that self-service user-centric mashup, to varying degrees, is a core theme.

BI mashup has such a strong business case that we’re confident it’s more than simply a “down economy” theme. It will almost certainly grow in importance for information and knowledge management professionals as the economy improves.

May 26, 2009

Database Religions Dissolve Into The Big Billowing Virtual Data Cloud

James-Kobielus By James Kobielus

Virtualization is a venerable old computing concept that has achieved new life in recent years.

Virtualization brings to life a new world of more flexible service provisioning while cleverly emulating the old world that is being replaced. Virtualization refers to any approach that abstracts the external interface from the internal implementation of some service, functionality, or other resource.

The promise of virtualization is that, no matter how scattered and diverse, all pooled resources behave as if they were a single unified resource, both for usage and administration. In a sense, this is the practical magic that Arthur C. Clarke identified with advanced technology. The external interface may conceal various facts about the implementations of the underlying resources. The virtualized resources may:

•    run on diverse operating and application platforms;
•    have been deployed on nodes in diverse locations;
•    have been aggregated across diverse hosting platforms (or partitioned within a single hosting platform, either through virtual machine software, separate CPUs, or separate blade servers); and have been provisioned dynamically in response to a client request.

When Noel Yuhanna and I presented on enterprise database virtualization last week at Forrester IT Forum, we took pains to point out that is not a radically new paradigm. In fact, database administrators (DBAs) have been doing virtualization for a long time and not realizing it. We’re all familiar with such database virtualization approaches as policy-based server clustering, massive parallel processing database grids, and enterprise information integration. In these environments, you can identify the virtualization layer as “single system image,” “semantic abstraction,” or some other approach.

What all these approaches share is that they make two or more repositories behave as if they were a single database for unified access, query, reporting, predictive analytics, and other applications. If you wish, I could drill down further into the layers of database virtualizationdata virtualization, transaction virtualization, and platform virtualizationbut that would be too much for a mere blog post.

One twist that I didn’t have time to explore in depth last week is the notion that the traditional hub-and-spoke enterprise data warehousing (EDW) architecture is itself a form of database virtualization. The hub-and-spoke model transforms analytic data to a common “spoke-side” semantic access model, such as star schema or columnar. As such, this approach abstracts from the data models (usually 3NF relational) implemented at the EDW hub tier, the staging tier (perhaps file-based), and OLTP sources (perhaps hierarchical, XML, or what have you).

When you realize that each data-persistence approach has its optimal deployment sphere, you’re thinking database virtualization. At that point, you start to realize that the various database religionsrelational is supreme, columnar is king, and so forthare not absolute truths. They’re simply sectarian texts in a tradition of longer vintage: the evolution of truly all-encompassing data virtualization clouds.

Yes, I’m using “cloud” in this context because it best describes this new paradigm. Cloud-based virtualization is beginning to seep into analytic infrastructures. To support flexible mixed-workload analytics, the EDW, over the coming five to 10 years, will evolve into a virtualized, cloud-based, and supremely scalable distributed platform.

What are the outlines of this new paradigm? The virtualized EDW will allow data to be transparently persisted in diverse physical and logical formats to an abstract, seamless grid of interconnected memory and disk resources and to be delivered with sub-second delay to consuming applications. EDW application service levels will be ensured through an end-to-end, policy-driven, latency-agile, distributed-caching and dynamic query-optimization memory grid, within an information-as-a-service (IaaS) environment. Analytic applications will migrate to the EDW platform and leverage its full parallel-processing, partitioning, scalability, and optimization functionality. At the same time, DBAs will need to make sure that cloud-based DW offerings meet their organizations’ most stringent security, performance, availability, and other service-level requirements.

I won’t opine here and now on how much enterprise data will be persisted in public clouds vs. private environments that incorporate many of the same platform virtualization technologies. I’ll save that discussion for the upcoming Forrester reports that Noel and I are developing in virtualization of transactional and analytic databases, respectively.

Expect those in Q3 or thereabouts. Thanks everybody who attended our preso last week in Vegas!

May 16, 2009

Information Post-Discovery - Latest BI Trend

Boris-Evelson By Boris Evelson

I just came back from an exciting week in Orlando, FL, shuttling between SAP SAPPHIRE and IBM Cognos Forum conferences. Thank you, my friends at SAP and IBM for putting the two conferences right next to each other (time- and location-wise), and for saving me an extra trip!

Both conferences showed new and exciting products and both vendors are making great progress towards my vision of “next generation BI”: automated, pervasive, unified and limitless.  I track about 20 different trends under these four categories, but there’s a particular one that is especially catching my attention these days. It went largely under covers at both conferences, and I was struggling with how to verbalize it, until my good friend and peer, Mark Albala, of http://www.info-sight-partners.com, put it in excellent terms for me in an email earlier today: it’s all about “pre-discovery” vs. “post-discovery” of data.

We can debate endlessly the pros and cons of traditional row oriented RDBMS vs. newer DBMS architectures specifically designed for BI and OLAP (like columnar – Sybase IQ, Vertica, Paraccel,  inverted index – Microsoft FAST Search, Endeca, Attivio, tokenized – illuminate, and in-memory analytical DBMS – TIBCO Spotfire, QlikTech) – and I had lots of fun doing that on the DM Radio show last Thursday!  One thing, however, remains undisputed: traditional RDBMS and OLAP architectures require pre-discovery of data, aka data integration and data modeling. No matter how much flexibility and richness we think we built into out relational or multidimensional data models, they are still only as good as our initial design. If we did not anticipate the types of questions that would be asked of our application in the future, no fixed relational or multidimensional data models will be able to help us.

But the world moves way too fast on us. For example, the methodology behind economic capital calculation in the financial services industry, according to Basel II requirements, may change on a weekly, sometimes even on a daily, basis due to regulatory and competitive pressures. No traditional data models and BI tools can keep up with such furiously quickly changing requirements. As a result, one of our recent surveys found that more than half of the respondents did not have most of the information they were looking for in their BI applications, and close to two thirds relied on IT for new BI requests.

What’s the answer? There are many, but one partial answer is post-discovery, rather than pre-discovery of data. For example, an inverted index DBMS from Attivio, or a tokenized data store from illuminate, or in-memory models from TIBCO Spotfire and QlikTech just need you to index or loading data "as is", not really requiring any modeling up front. And because all of these technologies can indeed cross reference every attribute with every other attribute (it's an index!), a virtual data model is created on the fly simply by virtue of asking it a question. Gone are the days of having to analyze your requirements, document your requests, work with IT to make it happen - a process that often takes weeks or months.

Sounds good? It does, but this is obviously not a BI panacea. Yet. I do not think we will see a mass conversion to these analytical engines that allow for post-discovery of information and data models in the short term. Why?

  • Many challenges still remain with these technologies such as lack of operational BI (you typically need to reload entire model or rebuild entire index to make updates), administration (partitioning, and modular backup and restore are not easy tasks with index DBMS today), and other mission critical, production features of large enterprise DBMS.
  • All these new technologies still rely on someone else doing all of the upfront leg work to integrate, reconcile and cleanse the data. There is no magical work around the hard work of planning, designing and implementing data quality and master data management processes and applications.

How’s all this related to SAP and IBM, you ask? Simple.

  • SAP is heavily promoting its guided searchengine – Explorer – previously known as Polestar. It’s a similar index to Endeca and Microsoft FAST Search, capable of allowing BI users discovering answers to previously unplanned questions. Unlike Attivio, though, Explorer still requires an underlying data model, either in the form of Business Objects Universe or SAP BW, but because under the covers it uses the SAP TRex index engine, the right technology is there, and it’s a huge step in the right direction for SAP.
  • IBM is also leading the market here. While its Cognos GoSearch product allows for some guided exploration too, it’s the acquisition of Exeros a couple of weeks ago that gives IBM unique capabilities (with some competition from Composite Software and Sypherlink) to post-discoverdata, data relationships and continuously update data models and data content based on the newly discovered information. I can't wait till IBM comes up with a fully automated way to update the data model, Cognos Framework Manager, and Cognos reports and dashboards based on a source system change!

This is yet another proof – and I’ll never get tired of saying this – that BI market is as vibrant, exciting and far from commoditization as ever!

May 05, 2009

Self-Service Business Intelligence Depends on Automated Data Discovery

James-Kobielus  By James Kobielus

If you tuned into my Forrester teleconference yesterday, you heard me discuss the end-to-end infrastructure necessary to fully support mashup-style self-service business intelligence (BI).

One of the key features for BI mashup is automated source-data discovery, which spares information workers from having to find new data sources or fresh updates from existing sources. Instead, the user simply relies on the BI and back-end data virtualization infrastructure to perform these critical activities as ongoing background tasks. Once new sources and feeds are discovered, transformed to a common semantic model, and published to a BI-mashup registry, all the user needs to do is drag and drop them visually into their mashed-up reports, dashboards, and other analytics.

Automated discovery is not only key to BI mashup, but to trustworthy data as well, because it helps detect and remediate anomalies across disparate data sources. Only a few vendors on the market today provide strong features for automated source discovery. One of them is Composite Software, which recently released an appliance that performs these functions. Another is Exeros, which is the closest thing to an automated-data-discovery pure-play in the market today.

Or, rather, was the closest thing, until IBM announced this morning that it is acquiring Exeros. I’ve been following Exeros for several years and have long considered them a strong candidate for acquisition by a leading BI, data warehousing (DW), data integration (DI), or data quality (DQ) vendor. On IBM’s part, this acquisition makes great sense as a complement to its InfoSphere and Optim portfolios on the data management and governance side of the house.

It will also fit nicely with IBM’s Cognos portfolio as a key enabler, potentially, for BI self-service mashup. As I stated on my teleconference, some vendors are further ahead on putting together a completely mashup-enabling end-to-end BI solution, and Cognos is among them. You can download the teleconference slides from Forrester’s website, listen to my streaming audio, and/or wait for my forthcoming report for more in-depth thoughts on this topic.

Now the ball’s in IBM’s rivals’ courts regarding whether, when, and how they plan to add automated source discovery to their BI portfolios.

April 17, 2009

Free BI Is Still No Free Lunch

Boris-Evelson By Boris Evelson

In my recent BI Belt Tightening For Tough Economic Times document I explored a few low-cost alternatives to traditional, mainstream, and typically relatively expensive Business Intelligence (BI) tools. While some of these alternatives indeed were a fraction of a cost of a characteristic large enterprise BI software license, there were even fewer truly zero cost options. But there were some. For example, you can:

  • Leverage and use no-cost bundled BI software already in-house.Small departments and workgroups may be able to leverage BI software that comes bundled at no additional cost with BI appliances, database management systems (DBMSes), and application licenses. You can consider using these few free licenses from Actuate, IBM Cognos, Information Builders, Jaspersoft, Microsoft, MicroStrategy, Panorama, Pentaho, and SAP Business Objects for additional functions such as testing, QA, and prototyping. While these few free licenses are just a drop in the bucket in a typical large enterprise BI license requirements, do look around and don’t waste money on BI products you may already have.

  • Use free Open Source BI components if you are a developer, since the source code is freely and easily available. Access to source code that makes the software easy to embed in other software solutions, so Open Source (OS) is also popular with independent software vendors (ISVs). However, OS BI as an enterprise grade BI solution, is not truly free. Even though individual reporting, OLAP, and even ETL components are available for free, most open source vendors sell add-on products that tie all components into a more fully functional BI application, such as specific dbms and ERP application connectors, GUI administration utilities and others

Now MicroStrategy is taking the concept of free BI one step further. Today MicroStrategy announced that it is making its Release 9software available for up to 100 users/consumers of info (1 CPU, 2 developers, 2 power users, online support only) at no cost! Now that’s a whole different story. Now we are talking about not just individual components, but a complete BI solution. We are also not talking about a small use case for just testing groups or QA or prototyping. With 100 users you can roll a BI application out for free to a whole department in a large enterprise or to a small business. Is this the beginning of a trend? Between OS BI and MicroStrategy new direction, I can’t help but to ponder: is BI (and other enterprise applications) going to go the way of printers or cell phones that are being given away almost at cost just to be able to hook and reel in the customers for supplies and service?

Free lunch And is it really free? Unfortunately not - it won’t be that easy for a while. As you saw recently in my 80/20 rule blog, typical BI costs involve so much services and so many indirect costs, that I do not see a truly free BI lunch in the near future.

April 08, 2009

Dislocation Intelligence In A Brutal Economy

James-Kobielus By James Kobielus

It's painful to see the auto, newspaper, construction, financial services, and so many other formerly vibrant sectors of the world economy go down the proverbial tubes. One of the most nauseating realities is when millions of people lose their jobs, homes, and communities in a seeming blink.

I grew up in the perpetually recessionary Detroit area. I'm attuned to the regional dislocations that come from depending too much on an industry that has seen better days. Abandoned storefronts, dilapidated housing, vacant lots, tumbleweed-quiet city streets--all of it evidence of a growing ghost town, telltale signs of a marginal economy that depends on government programs, private charity, and low-wage service jobs. During my college years, I was a policy analyst with an urban coalition in downtown Detroit, and I could see that the slide was long-term and nigh irreversible.

Even in relatively well-off areas such as Washington DC, where I've spent close to a quarter-century, we’re not immune to serious economic dislocations. The National Capital Region, so reliant on federal spending, is likely to feel the brunt of whatever cuts Obama will almost certainly make to close this massive deficit. And even here you can’t escape dislocations in sectors that we all depend on, such as retailing, as evidenced by, for example, the ex-Circuit City big boxes that seem even bigger and boxier now that they’re totally empty.

In recent years, corporations have adopted location intelligence solutions to support their market-entry strategies, such as adding new stores and waging marketing campaigns. They also use these tools--essentially, geographic information systems coupled with predictive modeling--to optimize their footprint in existing markets. And to a lesser extent, they also use location intelligence to plan their exit strategy of closings and retrenchments. But when departures are hasty--such as any Chapter 11 proceeding--all we’re left with are vast tracts of vacant real estate, plus many formerly employed people who must find a way to survive amid ruins. Imagine how a General Motors or Chrysler bankruptcy will hit Detroit--it will be a liquidation to rival Hurricane Katrina in its devastation of a major city.

Dislocation intelligence is something that the leaders of the auto companies--and the Obama administration--should exercise when making the tough decisions to restructure this critical industry. People’s pain should be factored into decisions to close and relocate plants, so that, for example, whole cities--such as Flint, Toledo, and Janesville--can make a smooth exit from over-reliance on this one industry. As part of that effort, urban planners should consider the infrastructure that remains behind, so that, for example, whole regions of a city are not suddenly deprived of hospitals, grocery stores, and other basic amenities. Imagine you’re on welfare and have to somehow go 10 miles to buy a loaf of bread.

Mapping tools are just tools, not salvation. Detroit long ago passed a point of no return. The US auto industry will never recover the manufacturing jobs that attracted people from all over the world to southern Michigan, northern Ohio, and other regions.

But we as a country should try to smooth over these economic dislocations so that they don’t completely wipe some places off the map.

April 01, 2009

Inmon’s Vitriolic Slap At “Virtual Data Warehousing” Does Not Withstand Scrutiny

James-Kobielus By James Kobielus

In a recent article, Bill Inmon incinerates a straw man concept that he refers to as “virtual data warehousing (DW).” For those unfamiliar with Inmon, he is generally considered the founder of DW as a data management discipline, has been at it since the 70s, and has more published books and articles to his name than most mortals. So he clearly may be considered an authority on the topic of DW.

But methinks Mr. Inmon doth protest too much on this “virtual DW” bugaboo, however defined (we’ll get to that in a moment). Also, he attacks this concocted notion with such emotional vehemence that it’s clear he considers it a threat to the centralized EDW paradigm upon which he has built his career and reputation.

For starters, his definition of this concept is oddly vague and questionably narrow: “a virtual data warehouse occurs when a query runs around to a lot of databases and does a distributed query.” Essentially, Inmon defines “virtual DW” as the ability to a) farm out a query to be serviced in parallel by two or more distributed databases, b) aggregate and join results from those databases, and c) deliver a unified result set to the requester.

That’s an important query pattern, but not the only one that should be supported under (pick your quasi-synonym) data federation, data virtualization, or enterprise information integration (EII) architectures. Inmon’s definition excludes the many federated queries that may only hit on a single database, with no joins and results aggregation, and with the EII fabric handling the necessary on-demand transformation from that source’s schema to an abstract semantic model.

Per my data federation report from last fall, Forrester has a broader perspective on the topic than does Mr. Inmon. Data federation is any on-demand approach that queries information objects from one or more sources; applies various integration functions to the results; maps the results to a source-agnostic semantic-abstraction model; and delivers the results to requesters. Nothing in the scoping of data federation necessarily requires the multi-source aggregation and joining that Inmon puts at the heart of “virtual DW.”

Putting Inmon’s narrow scoping of “virtual DW” behind us for the moment, let’s consider his chief objections to this approach. First, it requires the “analyst to integrate data” (as if that’s something analysts are ill-suited for or regard as some inordinate burden). Second, it consumes resources, experiences suboptimal performance, and “shuffles a lot of data around the system that otherwise would not need to be moved” (as if centralized DWs don’t consume resources, experience performance bottlenecks, and move data). Third, it is “limited to the [historical] data found in the [source] databases.” Fourth, it suffers from “no reconcilability of data...[hence] no single version of the truth for the corporation.”

It’s a fairly straightforward matter to dispatch these objections:

First, data integration--through ETL, EII, and other approaches--is a core job function for DW professionals, not some alien function outside their core competency.

Second, data federation is often the optimal approach for low-latency BI (just check out the case studies in my data federation and really urgent analytics reports). Federated environments can be tuned to provide top-notch performance and minimize source-system impacts when “shuffling” data around in a decentralized fabric.

Third, the source databases in a federation environment often include DWs, which, per their core function, usually manage a considerable amount of historical data. Once again, see my data federation report with discussion of case studies for a) Federation of Local DWs via Centralized EII Infrastructure and b) Federation of Dispersed EDW and ODS Data Into Siloed BI Environments.

Fourth, data federation is not totally incompatible with data reconciliation. In fact, federation environments can be architected for single version of the truth, data governance, and master data management. However, it can indeed be tricky to manage data quality in federated environments (see Rob Karel’s coverage of MDM and DQ for a deep dive on that issue).

My basic objection to Inmon’s line of discussion is that he treats data federation as mutually exclusive from the enterprise DW (EDW), when in fact they are highly complementary approaches, not just in theory but in real-world deployments. Yes, data federation can be deployed as an alternative to traditional EDWs, providing direct interactive access to online transactional processing (OLTP) data stores. However, data federation can also coexist with, extend, virtualize, and enrich EDWs, as well as other data-persistence nodes such operational data stores (ODS) and online analytical processing (OLAP) data marts. The case studies in the cited reports bear that out.

Inmon’s arguments are worth consideration. The centralized EDW model he touts is useful for illuminating some traditional best practices. But by no means can it do justice to the stubbornly heterogeneous, distributed, mixed-latency BI and DW requirements of most enterprises.

March 22, 2009

After So Many Years Of Ballyhoo, Semantic Web Still Searching For Killer App

James-KobielusBy James Kobielus

Cynics might call Semantic Web a technology looking for a solution. And they might have a point.

Semantic Web refers to a long-running World Wide Web Consortium (W3C) initiative that is working toward an ambitious--some might say hopelessly Utopian--goal. At heart, it is a vision for how the World Wide Web should evolve to realize its full interoperability potential.

People vary widely in how they interpret the scope of the Semantic Web initiative. The tech industries are swarming with a wide range of projects, products, and tools that implement different variants of this vision. What vision is that? In the broadest sense, Semantic Web refers to an all-encompassing metadata, description, and policy layer that can, potentially, support universal, automatic, comprehensive end-to-end interoperability across every macro or micro entity—including data, components, services, applications, and services—on every conceivable level.

Whew!!! If that’s not the working definition of “pie in the sky” or “boil the ocean” (pick your metaphor), I don’t know what is. In fact, I’m hard-pressed to refer to Semantic Web as a definable market or solution segment. However, it’s not entirely vacuous.

For starters, organizations can implement W3C-developed semantic description standards—such as Resource Description Framework (RDF) and Web Ontology Language (OWL)--to make the meaning of content unambiguously comprehensible to services, applications, bots, and other automated components. Second, there is a reasonably robust market for “ontology” tools to support RDF/OWL-based modeling of application semantics. Finally, there is some incremental adoption of these tools and concepts in established IT segments, such as:

  • Enterprise content management (ECM): Semantic approaches can support more powerful discovery, indexing, search, classification, commentary, and navigation across heterogeneous stores of unstructured and semi-structured content. Semantic search—driven by concepts, not mere text strings--is regarded by some as a primary Semantic Web application. Indeed, many Semantic Web vendors are primarily implementing the technology in search engines that leverage ontology-based concepts to improve search accuracy and reduce spurious hits.

  • Enterprise information integration (EII):Semantic approaches enable consolidated viewing, query, and update of structured data that has been retrieved from diverse sources. Indeed, most commercial EII environments present an abstract semantic layer that mediates access to heterogeneous data, such as enterprise resource planning and customer relationship management applications, converging it all to a common presentation-side schema. A handful of those EII vendors have begun to support Semantic Web standards, primarily through third-party software plug-ins

  • Enterprise service bus (ESB):Semantic approaches can facilitate multilayered application, process, and service interoperability across disparate environments. To date, there has been little production implementation of Semantic Web standards in the ESB arena, though some vendors have adopted semantics, ontologies, and RDF to describe the conceptual models implemented by application endpoints, agents, and intermediary nodes within ESB-like middleware approaches such as event stream processing.

But Semantic Web approaches are still on the periphery of these markets. 10+ years into its inception, Semantic Web still has no clear killer app. It’s not clear if or when that app will emerge.

March 20, 2009

Lean Information Management Strategies for Lean Times

James-KobielusBy James Kobielus

When the going gets tough, the tough get lean, focused, and flexible. To help organizations survive the bad times and thrive in all climates, their information management initiatives must remain agile and adaptable.

If you feel your information management strategy is anything but lean, you’re not alone. Many organizations struggle to gain control over information infrastructures that have become too bloated, rigid, and slow to realign with new business drivers.

Lean information management practices are essential for corporate survival. They are far more than belt-tightening exercises. They also help you build analytic muscle for excelling in any business environment. Here are some basic pointers for keeping your information management strategy lean:

  • Trim your information infrastructure of excess cost. Lean means you should cut excessive, budget-busting overhead from your information management environment. Careful cuts are best, because they optimize your existing operations without gutting the core information, analytics, and applications that underpin your core competencies. Silo, server, database, and application consolidation should be your principal approaches. Also, you should re-evaluate vendor-sourcing strategies and renegotiate licenses at more favorable terms. And you should investigate lower-cost alternatives, such as software-as-a-service, to address business intelligence, business performance solutions, enterprise data warehousing, master data management, enterprise content management, and other information management requirements.
  • Fit information initiatives to key business imperatives. Lean also means you fit, focus, and fully align your information management initiatives to mission-critical business imperatives. Strategic alignment ensures that you leverage information assets across diverse application domains and business processes, rather than allow that intelligence to languish underutilized in silos. To sustain this approach, you should establish an information management framework, such as a Business Intelligence Solution Center, that enables ongoing collaboration between business and IT stakeholders. You should engage all key business and technical groups in information management planning discussions.
  • Flex information architectures to changing circumstances. Finally, lean means maintaining an approach that is flexible and adaptable, able to shift course as your needs and environment change. In yoga terms, lean is all about building, toning, and stretching analytical muscle to keep it from tearing when you need to transition rapidly from one strategic alignment to the next. You need the flexibility to swing between centralized information management infrastructures and decentralized or federated environments. For end-to-end data management environments, Forrester has developed an architecture decision support tool that helps information managers to determine which of several topologies is best suited to their needs: centralized enterprise data warehouse, hub-and-spoke, independent data marts, data federation, and information-as-a-service.

Considered as a comprehensive strategy, these lean practices are true bloat-busters and recession-beaters. They allow organizations to deliver practical insights that address all pain points, even--especially!!!--within strict budgets.

March 18, 2009

Are There BI Implications In The Rumored IBM/Sun Merger? You Betcha!

Boris-Evelson By Boris Evelson

I always predicted that Open Source BI has to reach critical mass before it becomes a viable alternative for large enterprise BI platforms. All the individual components (a mixture of Open Source BI projects and commercial vendor wrappers around them) are slowly but surely catching up to their bigger closed source BI brothers. Talend and Kettle (a Pentaho led project) offer data integration components like ETL, Mondrian and Palo (SourceForge projects) have OLAP servers, BIRT (an Eclipse project), Actuate, Jaspersoft and Pentaho have impressive reporting components, Infobright innovates with columnar dbms well suited for BI, and productized offerings from consulting companies like European based Engineering Ingegneria Informatica – SpagoBI – offer some Open Source BI component integration.

However, even large closed source BI vendors that acquired multiple BI components over the years still struggle with full, seamless component integration. So what chance do Open Source BI projects and vendors with independent leadership structure and often varying priorities have for integrating highly critical BI components such as metadata, data access layers, GUI, common prompting/sorting/ranking/filtering approaches, drill-throughs from one product to another, etc? Today, close to none. However, a potential consolidation of such products and technologies under one roof can indeed create a highly needed critical mass and give these individual components a chance to grow into large enterprise quality BI solutions.

Who are the potential consolidators? Red Hat with its JBoss and Metamatrix, critical BI integration components would make sense. And/or Sun with its GlassFish app server, NetBeans integration components, and MySQL, a small, but growing option for DW platform would probably make an even better acquirer. Now, the recent rumor that IBM may be in M&A talks with Sun is throwing a wrench into my well oiled engine of prediction logic. This would be an interesting twist with lots of implications for IBM such as reconciling its WebSphere line of products with Sun’s GlassFish and NetBeans, and reconciling its InfoSphere line of products with Sun’s MySQL

If, and only if, IBM decides that such two-pronged product strategy – open and closed source – makes sense for them, then I can theoretically see IBM becoming the consolidator for Open Source BI products. It can then potentially leverage its resources and subject matter expertise from Cognos to build up and position both open source and closed source BI offerings targeted at specific client bases. But the challenges of developing, marketing, positioning and selling two families of highly overlapping and competing BI products will be huge!

If such future is not in IBM plans, then my hopes for the bright Open Source BI future and bets are on Red Hat.

Enter your email address:

Delivered by FeedBurner

Search this blog