Welcome!

IBM Cloud Authors: Liz McMillan, Elizabeth White, Pat Romanski, Kevin Benedict, Yeshim Deniz

Blog Feed Post

The Attack of Oracle Guest

Last October I published a post that identified the features that both JBoss Data Grid and Oracle Coherence provide (link). My goal was to establish a baseline for the features that a data grid should provide. It was not to state that one data grid was better than the other. Little did I know an Oracle employee would respond by attacking Red Hat, its engineers, and myself.

It is fear? Is it hostility? I don’t know.

I have engaged in discussions with competitors before. Roman and I engaged in a competitive discussion in response to one of my posts comparing IBM WebSphere and JBoss EAP (link). However, we both conducted ourselves in a professional manner. I’ve engaged in competitive discussions with Spring evangelists, but we focused on the technology.

I always enjoy reading the discussions between Cameron, Nikita, and Nati on the TheServerSide. I find their discussions to be insightful. They conduct themselves in a professional manner. To me, it looks like they respect each other and they respect each others products.

To be fair, this is just a single anonymous visitor. If they had not left a comment while connected to Oracle’s network, I would not have known that they work for Oracle. However, I would have inferred it.

Let the show begin.

Oracle Guest

I’m also interested in this question too RK. JDG lack of references that could corroborate performance majority against Coherence. Coherence in the other hand has a lot of public cases that shows how scalable, reliable and fast it is. It is the word’s first in-memory computing platform of the world, so this blog doesn’t offer credibility at all, mostly because Shane is a marketing guy from Red Hat.

He’s just using a old marketing technique to improve reliability of their offers product, comparing it with another one which is leader in its industry, like Coherence. Comparing with Coherence would pass the idea of “JDG is so good just like Coherence, so instead of buying Coherence, buy from Red Hat” but in fact it is not true. JDG should implement A LOT OF features to be comparable with Coherence.


Me

I published the results of a performance test (JDG 6.0.1) last December (link). I have written a technical white paper that includes the results of a number of performance tests (JDG 6.0.1). However, it is awaiting publication. I expect it to be made available via the Red Hat Customer Portal. In addition, I will be publishing the results of a few performance tests (JDG 6.1) executed on better hardware to How to JBoss within the next two weeks.

I executed the performance tests with RadarGun (link), an open source project for data grid performance testing. When I published the results, I provided both the RadarGun and the JDG configuration files. The best way for an organization to select a data grid based on performance and reliability is to configure and execute their own performance tests based on their own requirements in their own controlled environment.

Oracle Guest

Those results compares JDG against Terracotta from Software AG, not with Coherence from Oracle. You cannot say at all that JDG is better than Coherence because you’ve never tested. Again, not reliable statements coming from you. You’ve tried to use a Terracotta comparison to generalize JDG performance results. Lets call Oracle, VMware, IBM, Gigaspaces and TIBCO to participate of the tests.

Me

I have never stated that JDG performs better than Oracle Coherence. Thus, I am unaware of these “not reliable statements”. Can you could point them out? You are welcome to call Oracle, VMWare, IBM, GigaSpaces, and TIBCO. They are welcome to use the my RadarGun and JDG configuration files to configure and execute their own performance tests with RadarGun. This is essentially what I did after coming across the performance test results published by Terracotta. I simply used the parameters that they made available. As I mentioned previously, a number of organizations evaluating JDG are doing just that. They are executing performance tests against both JDG and Oracle Coherence using RadarGun or YCSB (Yahoo! Cloud Serving Benchmark).


Me

Tongosol Coherence was an innovative product in its day, but that day was several years ago. I do not question that it remains reliable. However, there have been a number of advancements in distributed systems over the past few years. JBoss Data Grid brings together the reliability of the previous generation of data grids and the innovation of the next generation of data grids.

Oracle Guest

For some unique features of Coherence like its non-blocking I/O TCP/IP network based on TCMP, which allow it to achive better results with distributed transactions, fail-over detection (the fastest of the industry) WAN replication with latency issues due geographical distribution and the HTTP Session offload from AppServers. Not mentioning integration with A LOT OF AppServers like WebLogic, GlassFish, Websphere, Tomcat, IIS, Resin and even your JBoss AS. JDG only gives support for which is from Red Hat. What a nice example of being “open” hãm ?! :)

Me

The TCMP features that you listed are not unique to TCMP. They are provided by JGroups as well. Those features include non-blocking I/O (NIO), failure detection, and cross site (WAN) replication. I would hope that Oracle Coherence*Web would support both Oracle WebLogic and Oracle GlassFish. Is that really a feature? If so, Red Hat provides it as well. JDG supports both JBoss EAP and JBoss EWS (Apache Tomcat). However, there is no reason for an organization to use JDG or Oracle Coherence*Web for session replication with IBM WebSphere.


Me

Red Hat public references include both Chicago Board Options Exchange (CBOE) and Cisco, and they have both presented at Red Hat Summit / JBoss World. I can’t think of an environment with higher demands for both performance and reliability than financial trading. The Pentaho BI Platform / Server includes a plugin for Infinispan (link). There is no Oracle Coherence plugin.

Oracle Guest

Only this? Coherence has thousands of customer references, including mission critical ones that for years NEVER, I mean, NEVER restarted their servers. Come on, you can do better than this. Red Hat (you) should be a little bit more humble when talking about leaders like Oracle. Someday Red Hat will be a huge company, I don’t doubt that, but that didn’t happened so far and will take some time.

Me

Can you point out a list that includes thousands of public customer references for Oracle Coherence? The only list that I found includes 39 customer references, and that list includes duplicates (link). You state that thousands of Oracle Coherence customers have NEVER restarted their servers. That is a bold claim. Do you have evidence to support such a claim? After all, servers may be restarted to upgrade the hardware and / or operating system. That, and enterprise software typically has a finite life cycle. Has not a just one of those thousands of Oracle Coherence customers ever upgraded their original version to the latest version? I suspect that you and I have different interpretations of “huge company”. I find it ironic that you demand humility while showing disrespect.


Me

Are you stating that because the company you work for (Oracle) productized (well, acquired) a data grid before the company I work for (Red Hat) and that my role is now in marketing, I lack credibility in the data grid domain? I would advise against such a statement. My technical knowledge of data grids is second to none, and it is not derived from my role in marketing. I have worked with a number of enterprise organizations in the financial, telecommunication, and media sectors in a developer / architect capacity in my previous role to integrate data grids in demanding environments.

Oracle Guest

Oh yes? Give me examples of data grid technologies you’ve worked with, scenarios of data partitioning and JVM tuning you’ve implemented for, entity domain versioning strategies you’ve designed it, hashCode algorithms strategies that you’ve proposed for a complex based key node, examples of KPIs that you retrieved from JMX and from the DG, and of course, examples of the following DG scenarios: average latency less than 600 microseconds, 5k TPS or higher considering a transaction with a minimum of 15KB of size, client applications both based on Java, C++, .NET and “the rest of world” that could be accessed with REST or SOAP, projects with more than 20K hours of duration (real one projects) instead of stupid POCs, usage of at least three serious data grids technologies including Coherence, GemFire, Websphere eXtreme Scale, Gigapaces, TIBCO ActiveSpaces, etc.

Me

I do not question your knowledge and experience, nor am I going to. I am dumbfounded as to why you feel justified in questioning mine.

I look at it like this. You have pilots, and you have mechanics. You have users, and you have engineers. A pilot knows how the controls work, a mechanic knows the parts work. When it comes to JDG, I have been a full time pilot and a part time mechanic. However, the activities that you have mentioned are those of a user, not of an engineer. Further, they are not specific to data grids. It’s one thing to talk about metrics, latency, and throughput. It’s another to talk about concurrency, algorithms, and how distributed systems work.

JVM tuning. I have posted a handful of notes on both OS and JVM tuning (link / link / link). Instead of talking about JVM tuning, let’s talk about implementations of ConcurrentMap (link). JMX. I have monitored and analyzed the performance of JDG with JBoss Operations Network, in-house tools, and BTrace. Here is a list of JMX attributes and operations for JDG (link). An average of latency of 600 microseconds is not particularly impressive in the financial trading industry. Nor is 5,000 transactions per second. Did I not mention, the fact that I collaborated with their engineers, and that co-presented with them at Red Hat Summit / JBoss World? Instead of talking about latency and throughput, let’s talk about data structures and eviction algorithms (link). I’ll be honest, I have not worked on projects that required integration in a heterogeneous environment. Those that have, have done so with REST and memcache. Oracle Coherence doesn’t support the memcache protocol, does it? Instead of talking about REST and SOAP, let’s talk about local / remote transaction contexts and the number of remote procedure calls (RPC) required for optimistic / pessimistic locking. Partitioning and hashing. JDG has implemented consistent hashing and virtual nodes. A modern solution. It uses an implementation of the excellent MurmurHash3 algorithm (link). It does not rely on dated implementation based on centralized and / or manual hashing. Does Oracle Coherence? Instead of talking about hashing, let’s talk about vector clocks.

Let’s talk about rebalancing and push / pull implementations.

Of course, that is the benefit of open source software. Users can be engineers. The can understand the implementation by studying the code. That is exactly what I did. I studied the code, I modified the code, I created and submitted patches, and I engaged in discussions with Red Hat engineers on implementations details. With proprietary software, users can only be users.

Asking how fast someone has flown will not reveal how much they know about planes.

Are you familiar with all of the projects that I have been on? I ask because I’m uncertain as to why you would describe them as “stupid POCs”. I do not think that the engineers at CBOE or any of the other organizations that I have collaborated with would appreciate you calling the work that they put into production “stupid POCs”. I know I don’t.


Me

How am I “improving the reliability” of JBoss Data Grid by identifying the functionality that both JBoss Data Grid and Oracle Coherence provide? Do you not believe that JBoss Data Grid has implemented A LOT OF features? The functionality descibed in this post represents nearly all of the features and benefits listed in the Oracle Coherence data sheet (link). JBoss Data Grid lacks a few features provided by Oracle Coherence. Oracle Coherence lacks a few features provided by JBoss Data Grid. Would you say that Oracle Coherence has not implemented A LOT OF features because it lacks a few features provided by JBoss Data Grid?

Oracle Guest

No! It just had integrated a couple open-source existing technologies into a new ecosystem and productized in a minimum level to take some money from the customers with subscriptions. Nothing really new, innovated, creative or respectable. The type of thing Red Hat likes to do: take existing technologies, combine them and make some money.

Me

What are these open-source, existing technologies that you are referring to? Could they be Infinispan? Of course they exist, Red Hat created them. It would be hard to productize something that does not exist. I find it both disrespectful and insulting to Red Hat engineers to describe their work as not new, innovative, creative, or respectable.  You said “take existing technologies, combine them and make some money”. Interesting. Is that not what Oracle did with Tangosol Coherence? Oracle purchased their data grid. Red Hat created its data grid.

Oracle Guest

You really knows to play with words, starting with the usage of the word “nearly” :)

You forgot some key features that only Coherence has like: Elastic Data (off-heap and SSD storage of data), distributed GC against any type of storage and cache layout, ability to handle thousands of GB being able to handle even terabytes of data. Don’t came say to me that with on-heap allocation and regular JVM like HotSpot (or OpenJDK which is even worse) you could allocate terabytes of data. Native SDKs for C/C++ and .NET, Continuous Queries, support for many AppServers rather than only JBoss, integration with Java EE 6 using @Resource annotation, monitoring and management capabilities both integrated with the product and with other external tools like Enterprise Manager, integration with CEP world to enrich events and being the clustering enabled mechanism to handle fail-over scenarios, security features that could deal with scenarios of authentication, authorization, SSL and load-balancers (Eg: BigIP) integration. Pre-built filters and a powerful query language that could make easier for the developers to interact with the cache instead of force them to write Java code, support for Hibernate, Toplink, EclipseLink, GoldenGate, etc. Thousands of pre-implemented scenario patterns in the product and externally with the incubator strategy started by Tangosol and now owned by Oracle. Oh and of course: support for a high performance serialization strategy and a highly scalable TCP/IP implementation like TCMP. Not mentioning that support for InfiniBand based networks.

Me

I admit that off-heap storage is an interesting concept. However, I question how practical it is. I would not recommend storing a TB of data on a single node with or without off-heap storage. I recommend partitioning physical servers into multiple virtual servers. It increases node portability while reducing the effects (e.g. rebalancing) of adding or removing nodes. JDG supports Java EE integration with both @resource and @inject. Does Oracle Coherence not support @inject? JDG includes management and monitoring as well. JDG clients are smart clients. It is not practical to load balance requests. Can you point me to a list of these “thousands of pre-implemented scenario patterns”? JDG supports both high performance serialization (JBoss Marshalling) and a highly scalable TCP / IP implementation (JGroups). However, JDG does not require developers to write additional code to use high performance serialization unlike Oracle Coherence and Portable Object Format (POF) (link). I will give you Infiniband, but it may not matter for long (link).

Update: I thought when you referred to off-heap storage that you referring to off-heap memory. I had not realized that “off-heap storage” is the new marketing term for “disk storage”. It turns out that Elastic Data is marketing for “overflow to disk” (link). This was a featured provided by Ehcache 10 years ago. It’s a feature provided by JDG. You mentioned 1TB of data. However, Elastic Data can only support up to 100GB per node. It is not a persistence solution. It does not support eviction. It should not be used with aggregation (i.e. map / reduce) or entry processors.

Oracle Guest

All of the “unique” features provided by JDG are not considered by real customers, independent analysts like Gartner, Forrester and IDC as really important. Are features that just align with the Red Hat strategy to force its entrance in the Big Data world, which on the other hand is a terrible strategy because to a real Big Data strategy Red Hat lacks A LOT OF technology stacks compared with real Big Data vendors like Oracle, EMC and IBM. Just an example, even Oracle does not consider Coherence as its Big Data strategy. When Oracle talk about Coherence, they’re talking about caching, grid and in-memory computing scenarios, which fits perfectly to elastic data grid technologies.

Me

It find it funny that justify the lack of features by stating that they are not important to analysts. Will you go on record as stating that Oracle Coherence will never implement JDG features that it lacks? Software evolves. New features become standard features. Personally, I think data grids will continue to incorporate features provided by NoSQL implementations. Eventual consistency comes to mind. What is a “real’ customer”? Is there another kind of customer? What do these features have to do with Red Hat’s big data strategy? Did you see the Red Hat big data announcement (link)? It was quite clear on what is and what is not our big data strategy. Just as Coherence is not Oracle’s big data strategy, JDG is not Red Hat’s big data strategy. We too place our data grid in the context of in-memory distributed data and parallel processing. I would expect a data grid to “fit perfectly to elastic data grid technologies” as it is, after all, a data grid and one of the defining characteristics of a data grid is that it is elastic. However, there is some overlap between in-memory data grids, NoSQL, and big data platforms. They all distribute data and implement parallel processing. The provide data locality. As such, in-memory data grids fit perfectly inside of broader big data solutions.


Read the original blog entry...

More Stories By Daniel Thompson

I curate the content on this page, but the credit goes to my talented colleagues for the posts that you see here. Much of what you read on this page is the work of friends at How to JBoss, and I encourage you to drop by the site at http://www.howtojboss.com for some of the best JBoss technical and non-technical content for developers, architects and technology executives on the Web.

@ThingsExpo Stories
Ask someone to architect an Internet of Things (IoT) solution and you are guaranteed to see a reference to the cloud. This would lead you to believe that IoT requires the cloud to exist. However, there are many IoT use cases where the cloud is not feasible or desirable. In his session at @ThingsExpo, Dave McCarthy, Director of Products at Bsquare Corporation, will discuss the strategies that exist to extend intelligence directly to IoT devices and sensors, freeing them from the constraints of ...
WebRTC is bringing significant change to the communications landscape that will bridge the worlds of web and telephony, making the Internet the new standard for communications. Cloud9 took the road less traveled and used WebRTC to create a downloadable enterprise-grade communications platform that is changing the communication dynamic in the financial sector. In his session at @ThingsExpo, Leo Papadopoulos, CTO of Cloud9, discussed the importance of WebRTC and how it enables companies to focus...
The best-practices for building IoT applications with Go Code that attendees can use to build their own IoT applications. In his session at @ThingsExpo, Indraneel Mitra, Senior Solutions Architect & Technology Evangelist at Cognizant, provided valuable information and resources for both novice and experienced developers on how to get started with IoT and Golang in a day. He also provided information on how to use Intel Arduino Kit, Go Robotics API and AWS IoT stack to build an application tha...
Is your aging software platform suffering from technical debt while the market changes and demands new solutions at a faster clip? It’s a bold move, but you might consider walking away from your core platform and starting fresh. ReadyTalk did exactly that. In his General Session at 19th Cloud Expo, Michael Chambliss, Head of Engineering at ReadyTalk, will discuss why and how ReadyTalk diverted from healthy revenue and over a decade of audio conferencing product development to start an innovati...
With an estimated 50 billion devices connected to the Internet by 2020, several industries will begin to expand their capabilities for retaining end point data at the edge to better utilize the range of data types and sheer volume of M2M data generated by the Internet of Things. In his session at @ThingsExpo, Don DeLoach, CEO and President of Infobright, discussed the infrastructures businesses will need to implement to handle this explosion of data by providing specific use cases for filterin...
Early adopters of IoT viewed it mainly as a different term for machine-to-machine connectivity or M2M. This is understandable since a prerequisite for any IoT solution is the ability to collect and aggregate device data, which is most often presented in a dashboard. The problem is that viewing data in a dashboard requires a human to interpret the results and take manual action, which doesn’t scale to the needs of IoT.
So, you bought into the current machine learning craze and went on to collect millions/billions of records from this promising new data source. Now, what do you do with them? Too often, the abundance of data quickly turns into an abundance of problems. How do you extract that "magic essence" from your data without falling into the common pitfalls? In her session at @ThingsExpo, Natalia Ponomareva, Software Engineer at Google, provided tips on how to be successful in large scale machine learning...
What does it look like when you have access to cloud infrastructure and platform under the same roof? Let’s talk about the different layers of Technology as a Service: who cares, what runs where, and how does it all fit together. In his session at 18th Cloud Expo, Phil Jackson, Lead Technology Evangelist at SoftLayer, an IBM company, spoke about the picture being painted by IBM Cloud and how the tools being crafted can help fill the gaps in your IT infrastructure.
"C2M is our digital transformation and IoT platform. We've had C2M on the market for almost three years now and it has a comprehensive set of functionalities that it brings to the market," explained Mahesh Ramu, Vice President, IoT Strategy and Operations at Plasma, in this SYS-CON.tv interview at @ThingsExpo, held June 7-9, 2016, at the Javits Center in New York City, NY.
"delaPlex is a software development company. We do team-based outsourcing development," explained Mark Rivers, COO and Co-founder of delaPlex Software, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
Whether your IoT service is connecting cars, homes, appliances, wearable, cameras or other devices, one question hangs in the balance – how do you actually make money from this service? The ability to turn your IoT service into profit requires the ability to create a monetization strategy that is flexible, scalable and working for you in real-time. It must be a transparent, smoothly implemented strategy that all stakeholders – from customers to the board – will be able to understand and comprehe...
Traditional IT, great for stable systems of record, is struggling to cope with newer, agile systems of engagement requirements coming straight from the business. In his session at 18th Cloud Expo, William Morrish, General Manager of Product Sales at Interoute, outlined ways of exploiting new architectures to enable both systems and building them to support your existing platforms, with an eye for the future. Technologies such as Docker and the hyper-convergence of computing, networking and sto...
SYS-CON Events announced today that LeaseWeb USA, a cloud Infrastructure-as-a-Service (IaaS) provider, will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. LeaseWeb is one of the world's largest hosting brands. The company helps customers define, develop and deploy IT infrastructure tailored to their exact business needs, by combining various kinds cloud solutions.
The cloud market growth today is largely in public clouds. While there is a lot of spend in IT departments in virtualization, these aren’t yet translating into a true “cloud” experience within the enterprise. What is stopping the growth of the “private cloud” market? In his general session at 18th Cloud Expo, Nara Rajagopalan, CEO of Accelerite, explored the challenges in deploying, managing, and getting adoption for a private cloud within an enterprise. What are the key differences between wh...
It’s 2016: buildings are smart, connected and the IoT is fundamentally altering how control and operating systems work and speak to each other. Platforms across the enterprise are networked via inexpensive sensors to collect massive amounts of data for analytics, information management, and insights that can be used to continuously improve operations. In his session at @ThingsExpo, Brian Chemel, Co-Founder and CTO of Digital Lumens, will explore: The benefits sensor-networked systems bring to ...
Much of IT terminology is often misused and misapplied. Modernization and transformation are two such terms. They are often used interchangeably even though they mean different things and have very different connotations. Indeed, it is somewhat safe to assume that in IT any transformative effort is likely to also have a modernizing effect, and thus, we can see these as levels of improvement efforts. However, many businesses are being led to believe if they don’t transform now they risk becoming ...
SYS-CON Events announced today the Enterprise IoT Bootcamp, being held November 1-2, 2016, in conjunction with 19th Cloud Expo | @ThingsExpo at the Santa Clara Convention Center in Santa Clara, CA. Combined with real-world scenarios and use cases, the Enterprise IoT Bootcamp is not just based on presentations but with hands-on demos and detailed walkthroughs. We will introduce you to a variety of real world use cases prototyped using Arduino, Raspberry Pi, BeagleBone, Spark, and Intel Edison. Y...
Large scale deployments present unique planning challenges, system commissioning hurdles between IT and OT and demand careful system hand-off orchestration. In his session at @ThingsExpo, Jeff Smith, Senior Director and a founding member of Incenergy, will discuss some of the key tactics to ensure delivery success based on his experience of the last two years deploying Industrial IoT systems across four continents.
SYS-CON Events announced today that Venafi, the Immune System for the Internet™ and the leading provider of Next Generation Trust Protection, will exhibit at @DevOpsSummit at 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Venafi is the Immune System for the Internet™ that protects the foundation of all cybersecurity – cryptographic keys and digital certificates – so they can’t be misused by bad guys in attacks...
Identity is in everything and customers are looking to their providers to ensure the security of their identities, transactions and data. With the increased reliance on cloud-based services, service providers must build security and trust into their offerings, adding value to customers and improving the user experience. Making identity, security and privacy easy for customers provides a unique advantage over the competition.