Welcome!

IBM Cloud Authors: Yeshim Deniz, Liz McMillan, Elizabeth White, Pat Romanski, Carmen Gonzalez

Related Topics: @CloudExpo, Mobile IoT, @ThingsExpo

@CloudExpo: Blog Post

Machine Learning - Azure vs AWS By @SrinivasanSunda | @CloudExpo #IoT #Cloud

The importance of machine learning

Machine Learning - Azure vs AWS

Machine Learning, which is a process to predict future patterns and incidents based on the models created out of past data, is definitely the most important part of the success of the Internet of Things in the enterprise and consumer space. The main reason is that without machine learning the entire backbone of the Internet of Things - event acquisition, event processing , event storage and event reporting - is merely a live display of events happening elsewhere and will not provide any value to its consumers. Think of a smart monitor in an oil well that monitors various climatic conditions and other factors that can cause a failure; unless the monitor is able to predict of a failure and corrects itself the usage of such solution is quite limited.

MLPaaS - Azure Vs AWS
In that context, Machine Learning Platform as a Service (MLPaaS) has been a major component of the major cloud platforms. Both Azure and AWS have equivalent services, the below thoughts are comparison of major building blocks of a machine learning service and how the respective cloud providers handle them.

Machine Learning Component

Azure

Amazon AWS

Training Data Enablement: As the machine learning falls in to two major categories of Supervised Learning and Unsupervised Learning, proper training data is one of the most important aspect of a success of a machine learning experiment and how well a MLPaaS facilitates availability and usage of training data is a key factor.

Azure ML has extensive options for data input and manipulation. The Data sources could be any of, Hive, Azure SQL, Blob Storage, web based data feeding engines and even the data could be manually entered.

 

Never a input data from source could be directly used as a training data and hence in this context, Azure ML has an array of transformation functions like, Filter, Data Manipulation, Split and Reduce.

 

With the effective use of above options Azure ML will provide an effective means of integrating training data as part of the machine learning process.

AWS Machine Learning also supports multiple data sources within its eco system.

 

Amazon Simple Storage Service (Amazon S3) is storage for the AWS cloud platform. Amazon ML uses Amazon S3 as a

primary data repository.

 

Amazon ML allows you to create a data source object from data residing in Amazon Redshift, which is the Data Warehouse Platform as a service.

 

Amazon ML also allows you to create a datasource object from data stored in a MySQL database in Amazon

Relational Database Service (Amazon RDS).

 

Also Amazon ML provides a rich set of data transformation functions like, N-gram transformation, Orthogonal Sparse Bigram transformation and more.

Support For Machine Learning Life Cycle: Developing and consuming a machine learning model for an enterprise use case is in itself a eco system. There are multiple players like data scientist, data analyst, ETL Developers, Visualization Engineers and business users are involved and each one plays an important role. Hence any machine learning service should support this life cycle of work flow.

One of the key success factor of Azure ML is the positioning of Azure ML studio and its user friendly graphical interface and supporting workflows which makes the machine learning process highly collaborative and interactive.

The concept of Workspace nicely allows for separation of duties as well as seamless integration with rest of Azure eco system like storage. Typically Data scientist initially creates models and train them with various parameters and data combinations \. Also rich Visualization features help data scientist to test the results easily.

Once a model is trained successfully, Azure provides easy options to create a scoring experiment which can be ultimately published as a web service to be consumed by client applications.

The graphical interface of Amazon ML provides a very similar experience and features in terms of creating and training models.

 

While there is no separation between a training and scoring experiment, Amazon ML provides lot of options for model evaluation and interpretation.

 

When we evaluate an ML model, Amazon ML provides an industry-standard metric and a number of

insights to review the predictive accuracy of the model.

Algorithm Support: This is probably the most important piece of evaluating a machine learning service as there are different algorithms which can be applied for different situations.

While almost all machine learning solutions are covered under the three major categories namely, Clustering, Classification and Regression based on whether we needed a supervised machine learning or unsupervised machine learning.

However the real challenge could be the particular algorithm that suit the above 3 analysis categories.

Azure machine learning supports a whole array of algorithms be it, Decision Trees, Logistic Regression, Bayes Point Machine, Nerual Networks, K-Means ... to just name a few.

One important aspect of Azure machine learning is the democratization of these advanced algorithms that even without any programming knowledge of machine learning languages like R we could effectively deploy them for given use cases.

Amazon ML supports three types of ML models: binary classification, multiclass classification, and regression.

 

As the name indicates, Binary classification is used to predict one of two possible out comes.

 

Multi class classification is used to predict one of three or more possible out comes.

 

Regression is used to predict a continuous variable which is a number.

However as per documentation there does not seem to be an option within the Amazon ML to select individual algorithms like a K-Means as part of evaluating the model.

Consumer Applications: Once the model is trained it has to be put into the practice and the most natural usage is that the results of machine learning are to be used as part of consumer application and in todays context it is mostly a mobile based consumer. So a robust machine learning service should support multiple consumer applications too.

Azure machine learning provides ready to go client side code for the web services that are published. It supports clients for both request and response model as well as batch based execution. Azure machine learning also produces sample client side code in C#, Python and R. It provides an easy interface for testing the request and response parameters. When it comes to batch execution, Azure machine learning provides APIs for submitting and starting a job and sample code is available in C#, Python and R. With this support Azure machine learning provides excellent support for developing client side applications.

Amazon support both batch predictions as well as real time predictions with the support of API for each of the tasks.

 

Amazon ML API has batch prediction APIs like, Create, Update, Delete which can be used for creating batch applications.

 

Similarly the real time machine learning API samples are available in platforms like Java, Python and Scala.

Pricing aspects are not discussed in the table because PaaS solutions like machine learning are charged per usage and the pricing is either per prediction or by per prediction hour and typically enterprises would worry more about the capabilities of the platform in choosing a machine learning service.

Also without doing significant machine learning case studies we cannot comment on the algorithms and their support; however, a higher level view indicates that Azure Machine Learning supports more algorithms and individual choice of algorithms within a category like clustering, classification which may be of interest to seasoned data scientists. Also most data scientists predict the future of machine learning will be on unsupervised learning which has got a good support from Azure in the form clustering algorithms, especially the K-Means algorithm.

More Stories By Srinivasan Sundara Rajan

Highly passionate about utilizing Digital Technologies to enable next generation enterprise. Believes in enterprise transformation through the Natives (Cloud Native & Mobile Native).

@ThingsExpo Stories
Infoblox delivers Actionable Network Intelligence to enterprise, government, and service provider customers around the world. They are the industry leader in DNS, DHCP, and IP address management, the category known as DDI. We empower thousands of organizations to control and secure their networks from the core-enabling them to increase efficiency and visibility, improve customer service, and meet compliance requirements.
Join IBM November 1 at 21st Cloud Expo at the Santa Clara Convention Center in Santa Clara, CA, and learn how IBM Watson can bring cognitive services and AI to intelligent, unmanned systems. Cognitive analysis impacts today’s systems with unparalleled ability that were previously available only to manned, back-end operations. Thanks to cloud processing, IBM Watson can bring cognitive services and AI to intelligent, unmanned systems. Imagine a robot vacuum that becomes your personal assistant tha...
In his Opening Keynote at 21st Cloud Expo, John Considine, General Manager of IBM Cloud Infrastructure, will lead you through the exciting evolution of the cloud. He'll look at this major disruption from the perspective of technology, business models, and what this means for enterprises of all sizes. John Considine is General Manager of Cloud Infrastructure Services at IBM. In that role he is responsible for leading IBM’s public cloud infrastructure including strategy, development, and offering ...
Coca-Cola’s Google powered digital signage system lays the groundwork for a more valuable connection between Coke and its customers. Digital signs pair software with high-resolution displays so that a message can be changed instantly based on what the operator wants to communicate or sell. In their Day 3 Keynote at 21st Cloud Expo, Greg Chambers, Global Group Director, Digital Innovation, Coca-Cola, and Vidya Nagarajan, a Senior Product Manager at Google, will discuss how from store operations...
With major technology companies and startups seriously embracing Cloud strategies, now is the perfect time to attend 21st Cloud Expo October 31 - November 2, 2017, at the Santa Clara Convention Center, CA, and June 12-14, 2018, at the Javits Center in New York City, NY, and learn what is going on, contribute to the discussions, and ensure that your enterprise is on the right path to Digital Transformation.
Recently, REAN Cloud built a digital concierge for a North Carolina hospital that had observed that most patient call button questions were repetitive. In addition, the paper-based process used to measure patient health metrics was laborious, not in real-time and sometimes error-prone. In their session at 21st Cloud Expo, Sean Finnerty, Executive Director, Practice Lead, Health Care & Life Science at REAN Cloud, and Dr. S.P.T. Krishnan, Principal Architect at REAN Cloud, will discuss how they b...
SYS-CON Events announced today that mruby Forum will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. mruby is the lightweight implementation of the Ruby language. We introduce mruby and the mruby IoT framework that enhances development productivity. For more information, visit http://forum.mruby.org/.
Digital transformation is changing the face of business. The IDC predicts that enterprises will commit to a massive new scale of digital transformation, to stake out leadership positions in the "digital transformation economy." Accordingly, attendees at the upcoming Cloud Expo | @ThingsExpo at the Santa Clara Convention Center in Santa Clara, CA, Oct 31-Nov 2, will find fresh new content in a new track called Enterprise Cloud & Digital Transformation.
SYS-CON Events announced today that NetApp has been named “Bronze Sponsor” of SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. NetApp is the data authority for hybrid cloud. NetApp provides a full range of hybrid cloud data services that simplify management of applications and data across cloud and on-premises environments to accelerate digital transformation. Together with their partners, NetApp emp...
SYS-CON Events announced today that Avere Systems, a leading provider of enterprise storage for the hybrid cloud, will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Avere delivers a more modern architectural approach to storage that doesn't require the overprovisioning of storage capacity to achieve performance, overspending on expensive storage media for inactive data or the overbui...
Smart cities have the potential to change our lives at so many levels for citizens: less pollution, reduced parking obstacles, better health, education and more energy savings. Real-time data streaming and the Internet of Things (IoT) possess the power to turn this vision into a reality. However, most organizations today are building their data infrastructure to focus solely on addressing immediate business needs vs. a platform capable of quickly adapting emerging technologies to address future ...
Most technology leaders, contemporary and from the hardware era, are reshaping their businesses to do software. They hope to capture value from emerging technologies such as IoT, SDN, and AI. Ultimately, irrespective of the vertical, it is about deriving value from independent software applications participating in an ecosystem as one comprehensive solution. In his session at @ThingsExpo, Kausik Sridhar, founder and CTO of Pulzze Systems, will discuss how given the magnitude of today's applicati...
In a recent survey, Sumo Logic surveyed 1,500 customers who employ cloud services such as Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP). According to the survey, a quarter of the respondents have already deployed Docker containers and nearly as many (23 percent) are employing the AWS Lambda serverless computing framework. It’s clear: serverless is here to stay. The adoption does come with some needed changes, within both application development and operations. Tha...
SYS-CON Events announced today that Taica will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. TAZMO technology and development capabilities in the semiconductor and LCD-related manufacturing fields are among the best worldwide. For more information, visit https://www.tazmo.co.jp/en/.
SYS-CON Events announced today that Avere Systems, a leading provider of hybrid cloud enablement solutions, will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Avere Systems was created by file systems experts determined to reinvent storage by changing the way enterprises thought about and bought storage resources. With decades of experience behind the company’s founders, Avere got its ...
SYS-CON Events announced today that TidalScale will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. TidalScale is the leading provider of Software-Defined Servers that bring flexibility to modern data centers by right-sizing servers on the fly to fit any data set or workload. TidalScale’s award-winning inverse hypervisor technology combines multiple commodity servers (including their ass...
As hybrid cloud becomes the de-facto standard mode of operation for most enterprises, new challenges arise on how to efficiently and economically share data across environments. In his session at 21st Cloud Expo, Dr. Allon Cohen, VP of Product at Elastifile, will explore new techniques and best practices that help enterprise IT benefit from the advantages of hybrid cloud environments by enabling data availability for both legacy enterprise and cloud-native mission critical applications. By rev...
SYS-CON Events announced today that Ryobi Systems will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Ryobi Systems Co., Ltd., as an information service company, specialized in business support for local governments and medical industry. We are challenging to achive the precision farming with AI. For more information, visit http:...
Amazon is pursuing new markets and disrupting industries at an incredible pace. Almost every industry seems to be in its crosshairs. Companies and industries that once thought they were safe are now worried about being “Amazoned.”. The new watch word should be “Be afraid. Be very afraid.” In his session 21st Cloud Expo, Chris Kocher, a co-founder of Grey Heron, will address questions such as: What new areas is Amazon disrupting? How are they doing this? Where are they likely to go? What are th...
High-velocity engineering teams are applying not only continuous delivery processes, but also lessons in experimentation from established leaders like Amazon, Netflix, and Facebook. These companies have made experimentation a foundation for their release processes, allowing them to try out major feature releases and redesigns within smaller groups before making them broadly available. In his session at 21st Cloud Expo, Brian Lucas, Senior Staff Engineer at Optimizely, will discuss how by using...