CEP Engines, Zen and Hollow Brain Theory

For the sake of discussion, let’s assume that the company you work for has a number of very challenging problems that your organization has spent millions of dollars trying to solve over the course of many years.  For many enterprises. these challenges include a number of near real-time challenges including fraud detection, intrusion detection, cyberattack [...]

Announcing Amazon EC2 with IBM by the Hour

Amazon EC2 running IBM now offers Amazon EC2 instances combined with popular IBM applications that you can pay for by the hour with no need for licenses or long term upfront commitments. You can now flexibly scale your IBM applications up and down and only pay for what you use. If you already have an [...]

Software: Buy At Your Own Risk!

If public corporations announced their earnings with the same inflated claims as software marketing folks they would be liable for criminal fraud.
If car companies claimed their cars met certain safety standards which they did not meet, they would also liable for criminal fraud.
On the other hand, software companies seem to have been given a regulatory [...]

KMeans Clustering Now Running on Elastic MapReduce

Stephen Green, blogger and principal investigator of the AURA project in Sun Labs, has moved the state-of-the-art of analytics-as-a-service a few steps forward with the first documented working Mahout application on Amazon’s Elastic MapReduce (EMR).
EMR was announced on April 1st and on April 15th Stephen announced to the Mahout users group that he was going [...]

The Promises and Perils of Twitter

One year ago I penned Event Processing in Twitter Space, and today parts of the net are buzzing about Twitter.
In a nutshell, Twitter is a one-to-many communications service that uses short messages (140 chars or less). Following on the heels of the blogging phenomena, Twitter has been primarily used for microblogging and group communications.
Twitter, and [...]

[ANNOUNCE] Apache Mahout 0.1 Released

The Apache Lucene project is pleased to announce the release of Apache Mahout 0.1. Apache Mahout is a subproject of Apache Lucene with the goal of delivering scalable machine learning algorithm implementations under the Apache license.  The first public release includes implementations for clustering, classification, collaborative filtering and evolutionary programming.
Highlights include:

Taste Collaborative Filtering
Several [...]

Predictive Model Markup Language (PMML)

Predictive Model Markup Language (PMML) is an XML-based language developed by the Data Mining Group (DMG).  PMML provides a standard XML schema for applications to define statistical and data mining models as well as share these models between PMML compliant applications.
PMML identifies a number of models including Association Rules, Cluster Models, General Regression, Naive Bayes, [...]

A New Archaeological Find

After having dug to a depth of 10 feet last year, New York scientists found traces of copper wire dating back 100 years and came to the conclusion that their ancestors already had a telephone network more than 100 years ago.
Not to be outdone by the New Yorkers, in the weeks that followed, a California [...]

Copyright © 2007-2008, The CEP Blog, All Rights Reserved.