Friday, May 10, 2013

The Zementis Partnership with FICO


Stuart Wells, FICO CTO, announced the strategic partnership between Zementis and FICO at FICO World on May 2, 2013. FICO clients will now benefit from the outstanding Zementis scoring technology.

How? The Zementis ADAPA scoring engine provides a highly scalable framework to deploy, integrate, and execute complex data mining and predictive models based on the PMML standard. Models built in most commercial and open source data mining tools, such as FICO Model Builder or R, can now instantly be deployed in the FICO Anaytic Cloud. 

Customers, application developers and FICO partners will be able to extract value and insight from their predictive models and data immediately, using ADAPA and PMML. This will result in quicker time to innovation and value on their analytic applications.

Read the press release!

Predictive Analytics Deployment

Zementis offers software solutions that enable scalable, real-time execution of predictive analytics across a variety of platforms based on the PMML standard. These include:

ADAPA Scoring EngineOur solution for real-time scoring. ADAPA is available for on-site deployment as a traditional license or as a service in the Amazon Elastic Compute Cloud (EC2) and IBM SmartCloud Enterprise. And now, with our FICO partnership, ADAPA will also be available in the FICO Analytic Cloud.

UPPI, the Universal PMML Plug-in: The leading solution for Big Data, UPPI provides scoring in-database and for Hadoop. It is available for EMC Greenplum, IBM Netezza, SAP Sybase IQ, Teradata/Aster as well as Hadoop/Hive and Datameer. 

Tuesday, May 7, 2013

KDD 2013 PMML Workshop (August 11, 2013)

Come and join us for the KDD PMML Workshop to be held in Chicago on August 11. Organized by the Data Mining Group (DMG), this workshop will feature invited talks and presentations of selected papers. 
KDD PMML Worshop
What: A half-day workshop on the Predictive Model Markup Language (PMML)
When: Sunday, August 11, 2013 - Time TBD
Where: Chicago, IL - Chicago Sheraton

Call For Papers
  • Abstracts due: May 14, 2013, 23:59pm CT
  • Papers due: May 24, 2013, 23:59pm CT
  • Acceptance notification: May 31, 2013
  • Final Camera Ready Paper Due: June 7, 2013
  • CFP Website

Thursday, April 11, 2013

Predictive Model Markup Language (PMML) Workshop at KDD 2013 in Chicago

Please join us for a Predictive Model Markup Language (PMML) workshop at KDD 2013 in Chicago on August 11, 2013, to exchange exciting new developments, leading practices, and high impact applications in big data, knowledge discovery and data mining which utilize the PMML standard. 

The annual ACM SIGKDD conference on Knowledge Discovery and Data Mining (KDD) is the premier international forum for data mining and big data researchers and practitioners from academia, industry, and government to share their ideas, research results and experiences. We invite submission of papers describing implementations of the Predictive Model Markup Language (PMML). Submitted papers will go through a competitive peer review process. Please consult the workshop website for full details regarding paper preparation and submission guidelines. 

PMML workshop website 
http://kdd13pmml.wordpress.com/ 

KDD conference web site 
http://www.kdd.org/kdd2013/

Wednesday, March 27, 2013

The Zementis Partnership with Infocom


It is our pleasure to announce a strategic partnership with Infocom. If you missed out on our press release, here is the headline:

Zementis and Infocom partner to deliver predictive analytic solutions in Japan.

Partnership

Dedicated to the Japanese market, Infocom combines strong expertise in data mining and predictive analytics with extensive delivery and consulting capabilities.

Zementis offers software solutions that enable scalable, real-time execution of predictive analytics across a variety of platforms based on the PMML standard. These include the ADAPA Scoring Engine available for on-site deployment or in the cloud, and UPPI, the Universal PMML Plug-in for in-database scoring and Hadoop (available for IBM Netezza, Teradata/Aster, EMC Greenplum, SAP Sybase IQ as well as Hadoop and Datameer).

Infocom will market, distribute and support Zementis's predictive analytics software in Japan.

To take a look at the press release, click HERE.

Additional Online Resources 

Friday, March 8, 2013

Making the Case for PMML and ADAPA

If you are not familiar with PMML, the Predictive Model Markup Language, you may be wondering what all the fuss is about ...

PMML is the de facto standard to represent data mining and predictive analytic solutions. With PMML, one can easily share a predictive solution among PMML-compliant applications and systems  For example, you can build your model in R, export it in PMML, and use ADAPA, the Zementis Scoring Engine, to deploy it in production.

Many data mining models are a one-time affair. You use historical data to build the model and use it to analyze ... historical data. Wait! That sounds more like descriptive analytics, not predictive analytics. Well, that is sort of true. To be truly predictive, a data mining model needs to be applied to new data. These are the models that need to be operationally deployed and, from my point of view, these are the solutions that are truly revolutionizing the way we do business and live in the Big Data world.

If you want then to use your data mining model to make predictions when presented with new data, it needs to be a dynamic asset. It cannot be static. You need to be able to build it and instantly put it to use. And, that's where PMML and ADAPA come in handy.

Obviously, a few data mining tools try to lock you in. You happily build the model using tool A, just to realize that you need the same tool to execute it. In this case, you are missing out. Here are some of the benefits of moving your predictive model to ADAPA:
  • Overcome speed/memory limitations
  • Dramatically lower your infrastructure cost
  • Tap into all the advantages of cloud computing with ADAPA on the Cloud (IBM SmartCloud or Amazon EC2)
  • Produce scores in real-time (using Web Services or Java API), on-demand, or batch-mode
  • Execute your models directly from Excel, by using the ADAPA Add-in for Excel
  • Benefit from using a set of PMML-compliant model development tools (best of breed)
  • Deploy your models in minutes
  • Manage models via Web Services or a Web console
  • Upload one or many models into ADAPA at once
  • Benefit from the seamless integration of business rules and predictive models (yes, for those who need it, ADAPA comes with a business rules engine)
PMML and ADAPA allow you to use best of breed tools (not the same old tool) for the job at hand. Also, you can leverage the expertise from a diverse group of data scientists. That means, not all your data scientists need to be experts on a single tool. They can use different tools that share one thing in common, the PMML standard. And, once represented in PMML, models can be easily understood by all team members. PMML allows for transparency and, in doing so, fosters best practices.



Why not benefit from: 1) an open standard to represent data mining models; and 2) a proven scoring engine that consumes any version of PMML and make it available for execution right away, in real-time?

Keep also in mind that ADAPA's sister product, the Universal PMML Plug-in (UPPI), allows you to move the same PMML file in-database or Hadoop. UPPI is currently available for EMC Greenplum, SAP Sybase IQ, IBM Netezza, and Teradata/Aster. With UPPI for in-database scoring, there is no need to move your data outside the database. Data and models reside inside it and so there is minimal data movement and maximum scoring speed. UPPI is also available for Datameer and will soon be available for Hadoop/Hive.

Making a model operational in minutes has never been easier! And, it is all because of PMML and scoring tools such as ADAPA and UPPI.

Thursday, February 28, 2013

In-database scoring with Teradata and Teradata Aster

The partnership between Zementis and Teradata allows customers with a variety of data mining tools to efficiently deploy predictive models based on the Predictive Model Markup Language (PMML) standard.  Focused on Big Data applications, the Universal PMML Plug-in (UPPI) for Teradata enables scalable execution of standards-based predictive analytics directly within the Teradata data warehouse.

To read more about the benefits of running your predictive solutions inside Teradata and Teradata Aster, please visit:

http://www.teradata.com/templates/Partners/PartnerProfile.aspx?id=12884902321


PMML Scoring

Zementis offers a range of products that make possible the deployment of predictive solutions and data mining models built in all the top commercial and open-source data mining vendors. Our products include the ADAPA Scoring Engine for real-time scoring and UPPI, which is currently available for a host of database platforms as well as Hadoop/Datameer. For a list of available platforms, please visit our in-database products page.

Rationale

Not all analytic tasks are born the same. If one is confronted with massive volumes of data that need to be scored on a regular basis, in-database scoring sounds like the logical thing to do. In all likelihood, the data in this case is already stored in a database and, with in-database scoring, there is no data movement. Data and models reside together hence scores and predictions flow on an accelerated pace




Monday, December 17, 2012

Zementis, enabling Big Data and real-time scoring with PMML


Zementis, Inc. is a company that makes software for the operational deployment and integration of predictive analytics and data-mining solutions. Its main products are the ADAPA Decision Engine, a platform for statistics and data processing, and the Universal PMML Plug-in for Hadoop and in-database scoring.



The name Zementis, symbolizing "concrete thoughts", is derived from the German word Zement (cement, concrete) and the Latin word Mentis (thought, intellect) and relates to the company's core competence in machine learning and AI.

Road to ADAPA

Founded in 2004 with the goal of providing predictive analytics to the marketplace, Zementis is composed of two main divisions, analytics and engineering. Although it started as a company focused on building predictive models, Zementis scientists soon realized that their models needed a platform in which they could be easily deployed and managed. From this need, the ADAPA Decision Engine came to be.

ADAPA initially supported only neural networks, but it soon became a platform for the deployment of a myriad of statistical techniques as well as data processing (download the ADAPA Product Datasheet for a list of supported techniques). From its inception, ADAPA has been based on open-standards, including PMML, the Predictive Model Markup Language. As a member of the Data Mining Group (DMG), the committee defining PMML, Zementis has helped shaped the standard as it becomes the necessary vehicle for the sharing of predictive solutions between applications.

In 2008, ADAPA was launched as a service on the Amazon Elastic Compute Cloud (Amazon EC2) and is currently being used worldwide by companies and individuals who want to execute their predictive models and decision logic.

In 2012, ADAPA cloud offering was extended to the IBM SmartCloud. In this way, IBM provides companies around the world predictive decisions when and where they are needed.

Universal PMML Scoring Engine - UPPI

Building on the heritage of its ADAPA Decision Engine, Zementis launched the Universal PMML Plug-in (UPPI), a highly optimized, in-database scoring engine for predictive models, fully supporting the PMML standard. With PMML, UPPI delivers a wide range of predictive analytics for high performance scoring. It shortens time to market for predictive models and empowers users through instant deployment of predictive models. UPPI is available for the following DB platforms:
The Universal PMML Scoring Engine is also available for Datameer for scoring in Hadoop.

Zementis Locations

Zementis HQ is located in San Diego in California. It also has an office in Hong Kong for servicing clients in the Asia-Pacific region.

References

  • R. Nisbet, J. Elder, and G. Miner. Handbook of Statistical Analysis and Data Mining Applications. Academic Press, 2009.

Tuesday, December 11, 2012

Big Data and Real-time Scoring with ADAPA, the Universal PMML Scoring Engine

When first released, ADAPA (Adaptive Decision And Predictive Analytics) was purely a scoring engine, used to produce scores out of data mining models expressed in PMML (Predictive Model Markup Language) format. More recently, however, with the addition of a rules engine to its core, ADAPA is able to seamlessly combine rules and predictive models, which enables businesses to manage and design automated decisioning systems. In this way, ADAPA allows for the concretization of Enterprise Decision Management (EDM) solutions.

PMML Support and Predictive Analytics


Predictive analytics comprises a series of modeling techniques which can be used to extract relevant patterns present in large amounts of data to better predict the future.

ADAPA is able to generate scores out of a variety of predictive modeling techniques expressed in PMML. PMML provides a standard way for the expression of predictive models. In this way, proprietary issues and incompatibilities are no longer a barrier to the exchange of models between applications.

Currently, ADAPA supports the following PMML elements:
    • Multinomial Logistic
    • General Linear
    • Ordinal Multinomial
    • Simple Regression
    • Generalized linear model
    • Cox Regression Models
  • Multiple Models: Model Composition, Ensembles, Segmentation, and Chaining (including Random Forest Models).
as well as a variety of elements involved in data pre- and post-processing:
  • Built-in Functions (logic and arithmetic operators as well as conditional logic)
  • Normalization
  • Discretization
  • Value Mapping
  • Custom Functions
  • Targets/Scaling
  • Outputs (including business decisions and thresholds)
  • Model Verification (which in ADAPA can also take the form of a CSV file)
Once a model is uploaded in ADAPA, it can be executed in batch and real-time. ADAPA is a PMML consumer, therefore it is able to execute PMML code exported from tools such as R, IBM SPSS, SAS, KNIME, KXEN, STATISTICA, BigML, RapidMiner, etc.

Besides offering a web-based console to manage models and rule sets, ADAPA includes capabilities to test these under its decision and validation framework.

Business rules


Business rules allow for business process and logic to be expressed outside of programming code. With ADAPA, the integration of predictive analytics and rules is seamless. Simply put, ADAPA allows both data-driven and expert knowledge to be combined into a single and concise solution, executed in real-time or in batch-mode.


ADAPA allows for business knowledge to be expressed in simple tabular format. In ADAPA, rules can be used to manage the execution of different predictive models depending on the business context. They can also incorporate scores generated by different predictive models throughout the business process. The acting together of the two technologies has the potential to significantly extend the precision of any decision logic.

ADAPA rules leverage the power of the leading Java open-source rules engine Drools which is supported by a strong community of developers and JBoss, a division of Red Hat. This fast, highly efficient rules engine has proven its excellence in numerous commercial installations.

All decisions in ADAPA are readily available by the use of Web Services.

ADAPA To Go


PMML Conversion


ADAPA provides its users with the ability to automatically convert older PMML models (versions 2.0, 2.1, 3.0, 3.1, 3.2, 4.0) to version 4.1. Besides schema validation, the conversion process also corrects known issues with PMML code from several sources/vendors. The aim is to successfully validate code in older versions of PMML and convert them to PMML 4.1. 

Transformations Generator


PMML provides a variety of data transformations, including value mapping, normalization, and discretization. It also offers several built-in functions as well as arithmetic and logical operators which can be combined to represent complex pre-processing steps. With the Transformations Generator tool, one can graphically design a transformation and obtain the respective PMML code. This can then be pasted into an existing PMML file and uploaded in ADAPA.

Software as a Service on the Cloud (Amazon EC2 and IBM SmartCloud)


ADAPA predictive analytics is available through the Amazon Elastic Computing Cloud (Amazon EC2) and the IBM SmartCloud. It provides the first SaaS (Software as a Service) predictive decisioning platform. The user can upload and manage several rule sets as well as models expressed in PMML and score data in real-time through the use of web-service calls (ADAPA will automatically convert older versions of PMML to version 4.1 and correct any known issues from different vendors). ADAPA as a Service empowers people, since it allows for anyone anywhere to deploy and use state of the art data mining models.

ADAPA Add-in for Microsoft Office Excel


To make the process of executing predictive models even simpler, Zementis also offers the ADAPA add-in for Excel 2007 and 2010 (available for free). With the add-in, anyone in the enterprise is able to score data in Excel by executing models previously deployed in the Cloud.

ADAPA allows for real-time data scoring at any time a new event occurs since it can be used from inside any application via Web Service Calls. Excel is just one such application which happens to be a very well known tool (used by many). This is remarkable, since it frees users from having to deal with all the technology required for scoring their data whenever necessary. With the Excel add-in, all one has to do is to select which data records to score (or the columns and rows containing the relevant data) and pressing on the “Score” button in Excel … et voila’ … new predictions are generated automatically for all selected records.

ADAPA Flavors


ADAPA is currently being offered in three ways:
  • On the Amazon Cloud: launch your own private instances of ADAPA on Amazon EC2.
  • On the IBM SmartCloud: launch your own private instances of ADAPA on the IBM SmartCloud.
  • On Site: ADAPA is also available for deployment on site or on your private cloud. 

In-Database Scoring


Built on the heritage of the ADAPA Decision Engine, the Universal PMML Plug-in (UPPI) is a highly optimized, in-database scoring engine for predictive models, fully supporting the PMML standard. With PMML, UPPI delivers a wide range of predictive analytics for high performance scoring. It shortens time to market for predictive models and empowers users through instant deployment of predictive models.


UPPI is available for the following platforms:

Scoring for Hadoop


Zementis and Datameer have partnered to deliver first-ever standards-based execution of predictive analytics on a massive parallel scale. This joint solution combines the Zementis Universal PMML Scoring Engine for real-time execution of predictive models with the power and scale of Datameer, an end-to-end BI solution that includes data source integration, an analytics engine, visualization and dashboarding.

UPPI for Datameer brings together essential technologies, offering the best combination of open standards and scalability for the application of predictive analytics. The Plug-in fully supports the Predictive Model Markup Language (PMML), the de facto standard for data mining applications, which enables the integration of predictive models from IBM SPSS, SAS, R, and many more.

References

Resources

  • Zementis Support - Help desk and support forums providing support information for PMML, ADAPA, and the Universal PMML Plug-in (UPPI).
  • ADAPA product page - contains information about ADAPA on the Cloud, on Site, and the add-in for Excel.
  • Deploy! Newsletter - monthly newsletter containing the latest news on ADAPA and predictive analytics.
  • PMML - PMML resources page including examples.
  • PMML Tools - The Transformations Generator.
  • Videos - webinars and on-line video tutorials about model deployment, ADAPA, Excel add-in, PMML, ...
  • Data Mining Group (DMG) - describes PMML, the Predictive Modeling Markup Language, as well as gives information on all the companies currently supporting the standard.
  • Drools homepage
  • PMML 4.1 is here! - gives a short summary of the new features of the latest release of PMML.
  • PMML in Action (2nd Edition) - PMML book available on Amazon (Paperback and Kindle).
  • PMML Presentation - video of Dr. Alex Guazzelli's PMML presentation for the ACM Data Mining Group at LinkedIn.





Copyright © 2009 Zementis Incorporated. All rights reserved.

Privacy - Terms Of Use - Contact Us