Archive: April, 2009

CEP Engines, Zen and Hollow Brain Theory

Posted on 04/27/09 8 Comments

For the sake of discussion, let’s assume that the company you work for has a number of very challenging problems that your organization has spent millions of dollars trying to solve over the course of many years.  For many enterprises. these challenges include a number of near real-time challenges including fraud detection, intrusion detection, cyberattack [...]

Read more

Announcing Amazon EC2 with IBM by the Hour

Posted on 04/24/09 No Comments

Amazon EC2 running IBM now offers Amazon EC2 instances combined with popular IBM applications that you can pay for by the hour with no need for licenses or long term upfront commitments. You can now flexibly scale your IBM applications up and down and only pay for what you use. If you already have an [...]

Read more

Software: Buy At Your Own Risk!

Posted on 04/20/09 5 Comments

If public corporations announced their earnings with the same inflated claims as software marketing folks they would be liable for criminal fraud. If car companies claimed their cars met certain safety standards which they did not meet, they would also liable for criminal fraud. On the other hand, software companies seem to have been given [...]

Read more

KMeans Clustering Now Running on Elastic MapReduce

Posted on 04/19/09 1 Comment

Stephen Green, blogger and principal investigator of the AURA project in Sun Labs, has moved the state-of-the-art of analytics-as-a-service a few steps forward with the first documented working Mahout application on Amazon’s Elastic MapReduce (EMR). EMR was announced on April 1st and on April 15th Stephen announced to the Mahout users group that he was [...]

Read more

The Promises and Perils of Twitter

Posted on 04/19/09 1 Comment

One year ago I penned Event Processing in Twitter Space, and today parts of the net are buzzing about Twitter. In a nutshell, Twitter is a one-to-many communications service that uses short messages (140 chars or less). Following on the heels of the blogging phenomena, Twitter has been primarily used for microblogging and group communications. [...]

Read more

[ANNOUNCE] Apache Mahout 0.1 Released

Posted on 04/08/09 1 Comment

The Apache Lucene project is pleased to announce the release of Apache Mahout 0.1. Apache Mahout is a subproject of Apache Lucene with the goal of delivering scalable machine learning algorithm implementations under the Apache license.  The first public release includes implementations for clustering, classification, collaborative filtering and evolutionary programming. Highlights include: Taste Collaborative Filtering [...]

Read more

Predictive Model Markup Language (PMML)

Posted on 04/05/09 No Comments

Predictive Model Markup Language (PMML) is an XML-based language developed by the Data Mining Group (DMG).  PMML provides a standard XML schema for applications to define statistical and data mining models as well as share these models between PMML compliant applications. PMML identifies a number of models including Association Rules, Cluster Models, General Regression, Naive [...]

Read more

A New Archaeological Find

Posted on 04/04/09 No Comments

After having dug to a depth of 10 feet last year, New York scientists found traces of copper wire dating back 100 years and came to the conclusion that their ancestors already had a telephone network more than 100 years ago. Not to be outdone by the New Yorkers, in the weeks that followed, a [...]

Read more

Real CEP News: Amazon Announces Elastic MapReduce

Posted on 04/02/09 4 Comments

Yesterday Amazon announced the public beta of Amazon Elastic MapReduce, a web-based service that enables businesses, researchers, data analysts, and developers to easily and cost-effectively process vast amounts of data.  Amazon Elastic MapReduce utilizes a hosted Hadoop framework running on the web-scale infrastructure of Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Simple Storage Service [...]

Read more