By Dana Gardner E-Commerce Times
09/27/07 4:00 AM PT
By adding real-time analytics to their systems management practices, IT operators can determine the normal state of how systems should be performing. Then, by measuring the characteristics of systems under many conditions over time, administrators can gain predictive insights into their entire operations, based on a business services level of performance and demand.
The movement of IT and systems management to the end-to-end business service value level has been a long time coming. Yet the need has never been higher. Enterprises and on-demand application providers alike need to predict how systems will behave under a variety of conditions.
Rather than losing control to ever-increasing complexity -- and gaining less and less insight into the root causes of problematic applications and services -- operators must gain the ability to predict and prevent threats to the performance of their applications and services. Firefighting against applications performance degradation in a dynamic service-oriented architecture (SOA) just won't cut it.
By adding real-time analytics to their systems management practices, IT operators can determine the normal state of how systems should be performing. Then, by measuring the characteristics of systems under many conditions over time, administrators can gain predictive insights into their entire operations, based on a business services level of performance and demand. They can stay ahead of complexity, and therefore contain the costs of ongoing high-performance applications delivery.
I recently had a podcast discussion with Mazda Marvasti, the CTO of Integrien, on managing complexity by leveraging probabilistic systems management and remediation. I learned that Integrien's Alive suite uses probabilistic analysis to predict IT systems problems before costly applications outages. Furthermore, I received some details on the next Alive 6.0 release in Q4 of this year.
Dana Gardner: Can you give us some sense of the direction that the major new offerings within the Alive product set will take?
Mazda Marvasti: Basically, we have three pillars that the product is based on. First is usability. That's a particular pet peeve of mine. I didn't find any of the applications out there very usable. We have spent a lot of time working with customers and working with different operations groups. ... The second piece is interoperability. The majority of the organizations that we go to already have a whole bunch of systems, whether it be data collection systems, event management systems, or configuration management databases, etc.
Our product absolutely needs to leverage those investments -- and they are leveragable. But even those investments in their silos don't produce as much benefit to the customer as a product like ours going in there and utilizing all of that data that they have in there, and bringing out the information that's locked within it.
The third piece is analytics. What we have in the product coming out is scalability to 100,000 servers. We've kind of gone wild on the scalability side, because we are designing for the future. Nobody that I know of right now has that kind of a scale, except maybe Google (Nasdaq: GOOG), but theirs' is basically the same thing replicated thousands of times over, which is different than the enterprises we deal with, like banks or healthcare organizations.
Automated Institutional Wisdom
A single four-processor Xeon box, with Alive installed on it, can run real-time analytics for up to 100,000 devices. That's the level of scale we're talking about. In terms of analytics, we've got three new pieces coming out, and basically every event we send out is a predictive event. It's going to tell you this event occurred, and then this other set of events have a certain probability within a certain time frame to occur.
Not only that, but then we can match it to what we call our "fingerprinting." Our fingerprinting is a pattern-matching technology that allows us to look at patterns of events and formulate a particular problem. It indicates particular problems and those become the predictive alerts to other problems.
Now, with SOA and virtualization moving into application-development and data center automation, there is a tremendous amount of complexity in the operations arena. You can't have the people who used to have the "tribal knowledge" in their head determining where the problems are coming from or what the issues are.
The problems and the complexity have gone beyond the capability of people just sitting there in front of screens of data, trying to make sense out of it. So, as we gained efficiency from application development, we need consistency of performance and availability, but all of this added to the complexity of managing the data center.
That's how the evolution of the data center went from being totally deterministic, meaning that you knew every variable, could measure it, and had very specific rules telling you if certain things happened, and what they were and what they meant -- all the way to a non-deterministic era, which we are in right now.
Change Is Inevitable
Now, you can't possibly know all the variables, and the rules that you come up with today may be invalid tomorrow, all just because of change that has gone on in your environment. So, you cannot use the same techniques that you used 10 or 15 years ago to manage your operations today. Yet that's what the current tools are doing. They are just more of the same, and that's not meeting the requirements of the operations center anymore.
I've been working on these types of problems for the past 18 years. Since graduate school, I've been analyzing data extraction of information from disparate data. I went to work for Ford and General Motors -- really large environments. Back then, it was client-servers and how those environments were being managed. I could see the impending complexity, because I saw the level of pressure that there was on application developers to develop more reusable code and to develop faster with higher quality.
The run book is missing that information. The run book only has the information on how to clean it up after an accident happens.
That's the missing piece in the operations arena. Part of the challenge for our company is getting the operations folks to start thinking in a different fashion. You can do it a little at a time. It doesn't have to be a complete shift in one fell swoop, but it does require that change in mentality. Now that I am actually forewarned about something, how do I prevent it, as opposed to cleaning up after it happens.
Dana Gardner is president and principal analyst at Interarbor Solutions, which tracks trends, delivers forecasts and interprets the competitive landscape of enterprise applications and software infrastructure markets for clients. He also produces
BriefingsDirect sponsored podcasts. Disclosure: Integrien is a sponsor of BriefingsDirect business productivity podcasts.
DVR Market Penetration: Riding a Provider-Powered Wave September 26, 2007
The use of digital video recorders is growing fast among American TV watchers -- half of all households are projected to have one by 2011, according to recent research. However, this growth isn't necessarily due to rabid user demand. Rather, it's the cable and satellite TV providers that are pushing much of the growth with their offers of discounted and subsidized boxes and services.
Related Stories
SOA for Dummies September 25, 2007
Implementing SOA involves organizational changes on multiple levels, both technological and cultural, and can take anywhere from six months to six years, said Miko Matsumura, vice president and deputy CTO at webMethods and committee chair at OASIS, the Organization for the Advancement of Structured Information Standards.
New MySQL Enterprise Suite Due Soon September 17, 2007
Any business that has been shaken by server outages knows the value of monitoring capabilities. In the early days of open source, business managers worried most about the risk of finding out too late that promises of support from open source vendors would be paper-thin. The monitor service being offered at MySQL continuously monitors MySQL servers and raises alerts of special problems.
Meeting SOA and Web Services Security Challenges, Part 2 August 31, 2007
"SOA compounds the security decision because many moving parts, as in modular programs, come together as the final application. IT and security staffs need to closely coordinate at the beginning of an SOA project in order to determine the appropriate mitigation of confidentiality, integrity and availability," CipherOptics vice president of product management Scott Palmquist said.
Related News Alerts
More by Dana Gardner
Nothing New Under the Business Commerce Cloud? November 22, 2009
Business commerce clouds are all about leveraging cloud architecture to go to the next level: a dynamic business-services environment that wells up around the needs of a business group or niche, and then subsides when lack of demand dictates. Is this the wave of the future, or are we really just pouring old "business webs" wine into new bottles?
Text Analysis and the Next Generation of BI November 15, 2009
External data has grown in both volume and importance across the Internet. Companies are figuring out ways to make the most of Web data services for business intelligence. Real-time text analytics fills out a framework of Web data services that can form a whole greater than the sum of the parts. However, any BI or any text analysis is no better than the data source behind it.
Pumping Up Performance in Densely Packed Data Centers November 08, 2009
Thanks to architectural advancements and better efficiencies, densely stuffed data centers can carry ever-greater loads, and that can certainly work to consolidate and ultimately reduce costs. However, having fewer data centers means all the information they handle will likely have to travel longer distances between server and user. Network services and Internet performance management may be the solution.