Orwellian Event Processing

Recently we completed the installation and training of an open source Bayesian classifier to replace a rule-based approach to manage forum spam.  In a nutshell, we found the rule-based approach was highly prone to both false positives and false negatives; however, a statistical approach using a Bayesian approach has turned out to be far superior. [...]

What’s Really Happening in the World of CEP

There is quite a lot is happening in the world of complex event processing.  Interestingly enough, the people and the companies advancing processing complex events are not calling what they are doing “CEP the Buzzword” we read about in the press or that a handful of “pioneers” claim make them modern-day CEP “experts”.
The companies processing [...]

Announcing Amazon CloudWatch, AutoScaling and Load Balancing

AWS has announced the public beta of three new features for Amazon EC2: Amazon CloudWatch for monitoring AWS cloud resources, Auto Scaling for automatically growing and shrinking capacity based on demand, and Elastic Load Balancing for distributing incoming traffic across Amazon EC2 compute instances.

Mahout on Elastic MapReduce: Running k-means Clustering

Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.

Mahout on Elastic MapReduce by Stephen Green
As a side note, there has been considerable [...]

CloudFront LogAnalyzer on Amazon Elastic MapReduce

The Amazon Elastic MapReduce team has a sample application, CloudFront LogAnalyzer, designed to analyze Amazon CloudFront access logs. This tool provided users with the power of Amazon Elastic MapReduce to quickly turn access log data into actionable intelligence.
Access logs are activity records about all requests delivered through Amazon CloudFront and contains a valuable set [...]

[ANNOUNCE] Apache Mahout 0.1 Released

The Apache Lucene project is pleased to announce the release of Apache Mahout 0.1. Apache Mahout is a subproject of Apache Lucene with the goal of delivering scalable machine learning algorithm implementations under the Apache license.  The first public release includes implementations for clustering, classification, collaborative filtering and evolutionary programming.
Highlights include:

Taste Collaborative Filtering
Several [...]

Real CEP News: Amazon Announces Elastic MapReduce

Yesterday Amazon announced the public beta of Amazon Elastic MapReduce, a web-based service that enables businesses, researchers, data analysts, and developers to easily and cost-effectively process vast amounts of data.  Amazon Elastic MapReduce utilizes a hosted Hadoop framework running on the web-scale infrastructure of Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Simple Storage [...]

Early Event Detection – A Prototype Implementation

In my earlier post today, A Review of Zabbix - Zabbix Rules! (Part 2) I used the term “event precursor.”   Afterwords I thought “this is a nice event processing term, I wonder who used it before me?”
So, I did a bit of Googleing around and came up with this excellent paper where the term “event [...]

Copyright © 2007-2008, The CEP Blog, All Rights Reserved.