IBM Cloud Authors: Elizabeth White, Yeshim Deniz, Pat Romanski, Liz McMillan, Stefan Bernbo

Blog Feed Post

R and Data Week 2013

by Joseph Rickert Data Week 2013 is being held this week in sunny San Francisco at the Fort Mason conference center overlooking the Bay. Holding a Bay Area R User Group Meeting (BARUG) at Data Week helped to raise the R consciousness among the hip conference crowd attracted by the intoxicating mix of blue skies, big data hype, startups and visionaries. The BARUG members, on the other hand, came mostly for the free beer and lightning talks. There were six, 12 minute talks with themes that ranged from basic R applications to using R to replace SAS in a big-league manufacturing process. Timothy Sweetser began the evening by showing the regression model he used to analyze BART fares. This was an elementary, but clever analysis of an everyday kind of question, the sort that briefly floats through your mind while you are buying a ticket: “How come this trip costs this much, but I paid a different amount last week for what seemed like a similar trip”. The plot below shows the strata in fares by distance as well as Timothy’s regression model. Utham Kamath described Mathpak, a new cloud based, platform for building collaborative analytical applications, marketing and monetizing them, and showed how R based applications would fit nicely into this scheme. It seemed to me that Utham and his fellow developers are envisioning an new “pick up game” kind of collaboration where developers from around the world will undertake serious projects that anyone of them alone would not have the resources to even contemplate. Clark Fitzgerald spoke about the favorable economics of running R in Amazon cloud (EC2) virtual machines. He compared serious computational hardware to tractors from the point of view that most people just rent tractors when they need to do the heavy lifting. He went on to make the case that the economics of cloud based computing are favorable for even relatively small projects involving teaching and automation. You don’t necessarily have to be working on some high performance computing project to see the benefits. Elaine Jones showed how her IBM tape storage manufacturing group achieved some serious cost cutting by replacing an expensive ($150K) SAS group license with R to do a number of ETL tasks that are fundamental to the production workflow. Critical tasks such as extracting raw data from DB2, summarizing it, formatting it and loading it into a different DB2 databases that used to take 30 or so SAS programs are now handled by R scripts. The following graph shows the production workflow and where R replaced SAS. For someone who blogs about R, it was really encouraging to hear that Elaine first heard about R in from reading the 2009 NY TImes article about R published in an internal INM webpage. Mathias Brandewinder talked about the new F# to R type provider, a kind of “bridge mechanism” for sharing data and resources between the two languages. Types enable R to be expressed as an F# resource. Now, F# users can call R from within the F# environment, and R developers can make use of F# in production code. Mathias gave very convincing live demo where working from his F# IDE he seemed to be mixing F# and R code on the fly to achieve an impressive level of integration. It was like watching a musician switch between instruments. Harrison Decker finished up the evening by describing how reproducible research tools in R are evolving to meet the needs of scientists and researchers. Reproducible research: Allows authors to reproduce the results and figures in their research publications Aids verification of results by other researchers  Allows researchers to learn from and build on the work of others Builds community Harrison very eloquently articulated one of the major strengths of R when he said, almost in passing: “R grows because people are building and sharing”. The slides from all of the presenters will be posted on the BARUG meetup website. Other R related activities include a well-attended R Bootcamp that was held on Tuesday, "The R Summit" a series of talks by Tess Nesbit of Data Song, Uday Tennety of Revolution Analytics, Ryan Walker of Blue Shield of California and Ryan White of A9, and a panel discussion "R means: Business", led by David Smith. The talks and panel discussion are taking place today.

Read the original blog entry...

More Stories By David Smith

David Smith is Vice President of Marketing and Community at Revolution Analytics. He has a long history with the R and statistics communities. After graduating with a degree in Statistics from the University of Adelaide, South Australia, he spent four years researching statistical methodology at Lancaster University in the United Kingdom, where he also developed a number of packages for the S-PLUS statistical modeling environment. He continued his association with S-PLUS at Insightful (now TIBCO Spotfire) overseeing the product management of S-PLUS and other statistical and data mining products.<

David smith is the co-author (with Bill Venables) of the popular tutorial manual, An Introduction to R, and one of the originating developers of the ESS: Emacs Speaks Statistics project. Today, he leads marketing for REvolution R, supports R communities worldwide, and is responsible for the Revolutions blog. Prior to joining Revolution Analytics, he served as vice president of product management at Zynchros, Inc. Follow him on twitter at @RevoDavid

IoT & Smart Cities Stories
To Really Work for Enterprises, MultiCloud Adoption Requires Far Better and Inclusive Cloud Monitoring and Cost Management … But How? Overwhelmingly, even as enterprises have adopted cloud computing and are expanding to multi-cloud computing, IT leaders remain concerned about how to monitor, manage and control costs across hybrid and multi-cloud deployments. It’s clear that traditional IT monitoring and management approaches, designed after all for on-premises data centers, are falling short in ...
DXWordEXPO New York 2018, colocated with CloudEXPO New York 2018 will be held November 11-13, 2018, in New York City and will bring together Cloud Computing, FinTech and Blockchain, Digital Transformation, Big Data, Internet of Things, DevOps, AI, Machine Learning and WebRTC to one location.
We are seeing a major migration of enterprises applications to the cloud. As cloud and business use of real time applications accelerate, legacy networks are no longer able to architecturally support cloud adoption and deliver the performance and security required by highly distributed enterprises. These outdated solutions have become more costly and complicated to implement, install, manage, and maintain.SD-WAN offers unlimited capabilities for accessing the benefits of the cloud and Internet. ...
Business professionals no longer wonder if they'll migrate to the cloud; it's now a matter of when. The cloud environment has proved to be a major force in transitioning to an agile business model that enables quick decisions and fast implementation that solidify customer relationships. And when the cloud is combined with the power of cognitive computing, it drives innovation and transformation that achieves astounding competitive advantage.
DXWorldEXPO LLC announced today that "IoT Now" was named media sponsor of CloudEXPO | DXWorldEXPO 2018 New York, which will take place on November 11-13, 2018 in New York City, NY. IoT Now explores the evolving opportunities and challenges facing CSPs, and it passes on some lessons learned from those who have taken the first steps in next-gen IoT services.
SYS-CON Events announced today that Silicon India has been named “Media Sponsor” of SYS-CON's 21st International Cloud Expo, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Published in Silicon Valley, Silicon India magazine is the premiere platform for CIOs to discuss their innovative enterprise solutions and allows IT vendors to learn about new solutions that can help grow their business.
In his general session at 19th Cloud Expo, Manish Dixit, VP of Product and Engineering at Dice, discussed how Dice leverages data insights and tools to help both tech professionals and recruiters better understand how skills relate to each other and which skills are in high demand using interactive visualizations and salary indicator tools to maximize earning potential. Manish Dixit is VP of Product and Engineering at Dice. As the leader of the Product, Engineering and Data Sciences team at D...
SYS-CON Events announced today that CrowdReviews.com has been named “Media Sponsor” of SYS-CON's 22nd International Cloud Expo, which will take place on June 5–7, 2018, at the Javits Center in New York City, NY. CrowdReviews.com is a transparent online platform for determining which products and services are the best based on the opinion of the crowd. The crowd consists of Internet users that have experienced products and services first-hand and have an interest in letting other potential buye...
Founded in 2000, Chetu Inc. is a global provider of customized software development solutions and IT staff augmentation services for software technology providers. By providing clients with unparalleled niche technology expertise and industry experience, Chetu has become the premiere long-term, back-end software development partner for start-ups, SMBs, and Fortune 500 companies. Chetu is headquartered in Plantation, Florida, with thirteen offices throughout the U.S. and abroad.
The standardization of container runtimes and images has sparked the creation of an almost overwhelming number of new open source projects that build on and otherwise work with these specifications. Of course, there's Kubernetes, which orchestrates and manages collections of containers. It was one of the first and best-known examples of projects that make containers truly useful for production use. However, more recently, the container ecosystem has truly exploded. A service mesh like Istio addr...