Target.com Crash Not A Site Failure But A Failure To Forecast

Blog post info and actions

Blog post body

Brian Walker

Much has been made over the last two days of the Target.com crash and availability issues following its Missoni promotion that launched on Tuesday. Coming on the heels of the site’s relaunch on IBM Websphere Commerce after many years on the Amazon enterprise solution led many to speculate that there was a serious mistake made in the building of the new site. But this is not a failure of the new site, this is a failure to forecast.

When Target's marketers planned the media blitz and direct marketing around this Missoni promotion, launching the collection with great fanfare — and at the same time online and in-store— they likely planned on a web traffic response and resulting sales of 2-3x normal. That would be a terrific outcome, no doubt justifying the expense and effort of the multichannel promotion. I am sure no one forecasted a 10x normal response, and peak traffic 3x Black Friday traffic peaks. And beyond just the click-through, the consumer behavior on the site after click-through was unusual. This was not a product they were promoting, but a collection, and consumers were placing many, many items in their cart, well beyond the typical 1-2 items per cart a B2C site like Target's would expect.

Read more

Lies, Damned Lies, And Statistics . . . And Benchmarks

Blog post info and actions

Blog post body

Richard Fichera

I have been working on a research document, to be published this quarter, on the impact of 8-socket x86 servers based on Intel’s new Xeon 7500 CPU. In a nutshell, these systems have the performance of the best-of-breed RISC/UNIX systems of three years ago, at a substantially better price, and their overall performance improvement trajectory has been steeper than competing technologies for the past decade.

This is probably not shocking news and is not the subject of this current post, although I would encourage you to read it when it is finally published. During the course of researching this document I spent time trying to prove or disprove my thesis that x86 system performance solidly overlapped that of RISC/UNIX with available benchmark results. The process highlighted for me the limitations of using standardized benchmarks for performance comparisons. There are now so many benchmarks available that system vendors are only performing each benchmark on selected subsets of their product lines, if at all. Additionally, most benchmarks suffer from several common flaws:

  • They are results from high-end configurations, in many cases far beyond the norm for any normal use cases, but results cannot be interpolated to smaller, more realistic configurations.
  • They are often the result of teams of very smart experts tuning the system configurations, application and system software parameters for optimal results. For a large benchmark such as SAP or TPC, it is probably reasonable to assume that there are over 1,000 variables involved in the tuning effort. This makes the results very much like EPA mileage figures — the consumer is guaranteed not to exceed these numbers.
Read more

Complex Event Processing And IT Automation

Blog post info and actions

Blog post body

Jean-Pierre Garbani

Events are, and have been for quite some time, the fundamental elements of IT infrastructure real-time monitoring. Any status changed, threshold crossed in device usage, or step performed in a process generates an event that needs to be reported, analyzed, and acted upon by IT operations.

Historically, the lower layers of IT infrastructure (i.e., network components and hardware platforms) have been regarded as the most prone to hardware and software failures and have therefore been the object of all attention and of most management software investments. In reality, today’s failures are much more likely to be coming from the application and the management of platform and application updates than from the hardware platforms. The increased infrastructure complexity has resulted in a multiplication of events reported on IT management consoles.

Over the years, several solutions have been developed to extract the truth from the clutter of event messages. Network management pioneered solutions such as rule engines and codebook. The idea was to determine, among a group of related events, the original straw that broke the camel’s back. We then moved on to more sophisticated statistical and pattern analysis: Using historical data we could determine what was normal at any given time for a group of parameters. This not only reduces the number of events, it eliminates false alerts and provides a predictive analysis based on parameters’ value evolution in time.

The next step, which has been used in industrial process control and in business activities and is now finding its way into IT management solutions, is complex event processing (CEP). 

Read more

Really end to end management: Gomez and Compuware

Blog post info and actions

Blog post body

Jean-Pierre Garbani

The marriage of Gomez and Compuware is starting to bear fruits. One of the key aspects of web application performance management is end user experience. This is approached largely from the data center standpoint, within the firewall. But the best solution to understand the real customer experience is to have an agent sitting on the customer side of the application, without the firewall, a possibility that is clearly out of bounds for most public facing applications. The Gomez-Compuware alliance is the first time that these two sides are brought together within the same management application, Compuware Vantage. What Vantage brings to the equation is the Application Performance Management (APM) view of IT Operations: response time collected from the network and correlated with infrastructure and application monitoring in the data center. But, it’s not the customer view. What Gomez brings with its recent version, the “Gomez Winter 2010 Platform Release” is a number of features that let IT understand what goes beyond the firewall: not only how the application content was delivered, but how the additional content from external providers was delivered and what was the actual performance at the end user level: the outside-in view of the application is now combined with the inside-out view of IT Operations provided by Vantage APM. And this is now spreading outside the pure desktop/laptop user group to reach out the increasing mobile and smart phone crowd. IT used to be able to answer the question of “is it the application or the infrastructure?” with Vantage. IT can now answer a broader set of questions: “is it the application, the internet service provider, the web services providers?’ for an increasingly broader range of use-case scenarios.

Read more