Mahout on Elastic MapReduce: Running k-means Clustering

Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.

Mahout on Elastic MapReduce by Stephen Green
As a side note, there has been considerable [...]

CloudFront LogAnalyzer on Amazon Elastic MapReduce

The Amazon Elastic MapReduce team has a sample application, CloudFront LogAnalyzer, designed to analyze Amazon CloudFront access logs. This tool provided users with the power of Amazon Elastic MapReduce to quickly turn access log data into actionable intelligence.
Access logs are activity records about all requests delivered through Amazon CloudFront and contains a valuable set [...]

KMeans Clustering Now Running on Elastic MapReduce

Stephen Green, blogger and principal investigator of the AURA project in Sun Labs, has moved the state-of-the-art of analytics-as-a-service a few steps forward with the first documented working Mahout application on Amazon’s Elastic MapReduce (EMR).
EMR was announced on April 1st and on April 15th Stephen announced to the Mahout users group that he was going [...]

[ANNOUNCE] Apache Mahout 0.1 Released

The Apache Lucene project is pleased to announce the release of Apache Mahout 0.1. Apache Mahout is a subproject of Apache Lucene with the goal of delivering scalable machine learning algorithm implementations under the Apache license.  The first public release includes implementations for clustering, classification, collaborative filtering and evolutionary programming.
Highlights include:

Taste Collaborative Filtering
Several [...]

A Review of Zabbix - Zabbix Rules! (Part 2)

In A Review of Zabbix - Zabbix Rules! (Part 1) we provided a brief introduction to Zabbix in the context of network and security management.  In this post I will discuss Zabbix as an event processing platform.
Zabbix is like most event processing platforms.  Zabbix provides both agent-initiated events as well as server-requested events.  [...]

A Review of Zabbix - Zabbix Rules! (Part 1)

It was 1992, the Internet was still an uncertain “dream technology” by scientists, and I was working at SprintLink.   My role was the lead systems engineer to set up the network and security management center for SprintLink and Managed Router Network (MRN) team.
We used HP Open View Network Node Manager (NNM) for our core network [...]

Trend Prediction in Network Monitoring Systems

Following up on Real-Time Predictive Analytics for Web Servers I thought we should “move up a level” and look at various open network monitoring platforms with trend prediction capabilities.
Our web server management team picked Zabbix to monitor a busy production server and then we started to look into adding predictive analytics afterwards.   Alberto recommended we [...]

Real-Time Predictive Analytics for Web Servers

We recently made the decision to move to Zabbix to monitor one of our busy production Apache web servers.  One of the things we need to do in the future is try to predict system outages and take corrective actions before the system actually goes down.
For example, recently a busy server experenced an outage that [...]

Copyright © 2007-2008, The CEP Blog, All Rights Reserved.