Welcome!

IBM Cloud Authors: Elizabeth White, Pat Romanski, Liz McMillan, Yeshim Deniz, John Esposito

Blog Feed Post

MarkLogic 6: An introduction

By

The latest permeation of MarkLogic (version 6) offers ACID (atomicity, consistency, isolation, durability) transactions, horizontal scaling, real-time indexing, high availability, disaster recovery, government-grade security and built-in search.

MarkLogic has made application development easier with Java and REST APIs. They also added JSON support. This allows developers to use their language of choice, and eliminate the need to learn new production language. It also provides data visualization widgets. These widgets can display the shape and dimensions of data, identify trends or patterns and explore the data as a whole.

Version 6 also comes with built in database analytics. Integration with IBM Cognos and Tableau is included as well. This enables analysts to create reports, dashboards and the data. Lastly, version 6 adds the in-database MapReduce capabilities.

Find out more about MarkLogic version 6 here.

And here is more from their website:

Mission-critical Big Data Applications around the world are powered by MarkLogic. It is the only Enterprise NoSQL database that manages all types of data at scale in real time. It gives you the range of features you need to deliver value. It lets you leverage your existing tools, knowledge, and experience. And it provides a reliable, scalable, and secure platform for your important data.

New Feature Highlights

Business Intelligence Tools

Business Intelligence Tools
Big Data in the enterprise needs to be accessible to everyone who could benefit from the information. To make that easier, MarkLogic now includes out-of-the-box integration with Business Intelligence tools like IBM Cognos and Tableau to allow analysts to use familiar solutions for generating reports, dashboards and data exploration results from data stored in MarkLogic.

 

REST API

REST API
In order to enable developers to work in their language of choice, MarkLogic now includes a REST API that allows you to perform searches, create documents, read documents, update documents, and delete documents. The REST API allows you to build fully functional MarkLogic applications in any programming language. It also allows you to directly load JSON documents.

 

MarkLogic Java API

MarkLogic Java API
The new Java API allows you full-featured access to MarkLogic functionality with pure Java. The MarkLogic Java API is written on top of the REST API, and has all of its functionality such as paginated search with facets and snippets, full document CRUD operations, and more.

 

Enhanced JSON Support

Enhanced JSON Support
Our new JSON library makes it easy to store JSON documents as key-value stores, and to convert them back and forth between JSON and XML. The REST Client API and the MarkLogic Client API for Java make use of this functionality to make it easier to load and work with JSON documents.

 

Visualization Widgets

Visualization Widgets
We’ve also added Visualization Widgets so you can easily build powerful applications that help your users discover the shape and dimensions of data, quickly assess trends and patterns, and explore data more intuitively. You can access these widgets with MarkLogic Application Builder.

 

In-database MapReduce & User Defined Functions

In-database MapReduce & User Defined Functions
Many customers wanted more flexibility to develop complex, real-time analytics. We’ve extended MarkLogic 6 to let you create user-defined aggregate functions (UDFs) that take advantage of MarkLogic’s parallel-processing architecture. We call this In-database MapReduce, and it lets you create blindingly fast analytic functions with custom C++ code by writing “map” and “reduce” functions.

 

In-Database Analytic Functions

In-Database Analytic Functions
In order to enable customers to leverage the power of the MarkLogic platform to produce enterprise-grade analytics, we’ve included several built-in XQuery functions to perform analytic and statistical functions.

 

MarkLogic Content Pump (mlcp)

MarkLogic Content Pump
In order to speed loading and exporting of data between databases, we are introducing the MarkLogic Content Pump (mlcp). mlcp is a command-line tool for loading content into MarkLogic Server and for migrating content from one instance of MarkLogic to another, even if they are on different platforms. If you have a Hadoop cluster, mlcp takes advantage of Hadoop to parallelize the loading. mlcp takes much of the functionality of the open source projects Record Loader and xqsync and bundles them in a single package, and allows them to take advantage of Hadoop if it is available; Hadoop is not required to use mlcp, but is used if it is available.

 

FIPS 140-2 Cryptographic Compliance

FIPS 140-2 Cryptographic Compliance
MarkLogic 6 includes the OpenSSL Federal Information Processing Standard (FIPS) Object Module, which was evaluated by the National Institute of Standards (NIST) for FIPS 140-2 compliance. More details on the OpenSSL Object Module and on the FIPS 140-2 compliance.

 

Search API Enhancements

Search API Enhancements
When you’ve got a lot of data, great search is key. We’ve worked to make our search API even better with the following enhancements:

  • Structured Search
  • Extracting metadata at search time
  • Modify the unconstrained term behavior using <search:term>
  • Range constraints for path range indexes
  • Ability to return values from range indexes with search:values
  • JSON key support

 

Path Range Indexes

Path Range Indexes
In order to enable fine-grained range indexes while maintaining the advantages of using the lexicon functions and the range query constructors MarkLogic now includes support for range index specified by a path. You can specify a subset of XPath as the definition of what goes into an index. The search API can take advantage of path range indexes to create range constraints on them. Path range indexes are also useful when setting up SQL views on data stored in a MarkLogic database.

 

Synonym Search

Synonym Search
You now have the option to ensure that documents that contain multiple synonyms are scored appropriately, rather than unnaturally gaining points to the search score. Learn more about how the architecture of MarkLogic works and how you can deploy it.

Read the original blog entry...

More Stories By Bob Gourley

Bob Gourley writes on enterprise IT. He is a founder and partner at Cognitio Corp and publsher of CTOvision.com

@ThingsExpo Stories
With 15% of enterprises adopting a hybrid IT strategy, you need to set a plan to integrate hybrid cloud throughout your infrastructure. In his session at 18th Cloud Expo, Steven Dreher, Director of Solutions Architecture at Green House Data, discussed how to plan for shifting resource requirements, overcome challenges, and implement hybrid IT alongside your existing data center assets. Highlights included anticipating workload, cost and resource calculations, integrating services on both sides...
Big Data engines are powering a lot of service businesses right now. Data is collected from users from wearable technologies, web behaviors, purchase behavior as well as several arbitrary data points we’d never think of. The demand for faster and bigger engines to crunch and serve up the data to services is growing exponentially. You see a LOT of correlation between “Cloud” and “Big Data” but on Big Data and “Hybrid,” where hybrid hosting is the sanest approach to the Big Data Infrastructure pro...
"We are a well-established player in the application life cycle management market and we also have a very strong version control product," stated Flint Brenton, CEO of CollabNet,, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
We all know the latest numbers: Gartner, Inc. forecasts that 6.4 billion connected things will be in use worldwide in 2016, up 30 percent from last year, and will reach 20.8 billion by 2020. We're rapidly approaching a data production of 40 zettabytes a day – more than we can every physically store, and exabytes and yottabytes are just around the corner. For many that’s a good sign, as data has been proven to equal money – IF it’s ingested, integrated, and analyzed fast enough. Without real-ti...
I wanted to gather all of my Internet of Things (IOT) blogs into a single blog (that I could later use with my University of San Francisco (USF) Big Data “MBA” course). However as I started to pull these blogs together, I realized that my IOT discussion lacked a vision; it lacked an end point towards which an organization could drive their IOT envisioning, proof of value, app dev, data engineering and data science efforts. And I think that the IOT end point is really quite simple…
A critical component of any IoT project is what to do with all the data being generated. This data needs to be captured, processed, structured, and stored in a way to facilitate different kinds of queries. Traditional data warehouse and analytical systems are mature technologies that can be used to handle certain kinds of queries, but they are not always well suited to many problems, particularly when there is a need for real-time insights.
We're entering the post-smartphone era, where wearable gadgets from watches and fitness bands to glasses and health aids will power the next technological revolution. With mass adoption of wearable devices comes a new data ecosystem that must be protected. Wearables open new pathways that facilitate the tracking, sharing and storing of consumers’ personal health, location and daily activity data. Consumers have some idea of the data these devices capture, but most don’t realize how revealing and...
Unless your company can spend a lot of money on new technology, re-engineering your environment and hiring a comprehensive cybersecurity team, you will most likely move to the cloud or seek external service partnerships. In his session at 18th Cloud Expo, Darren Guccione, CEO of Keeper Security, revealed what you need to know when it comes to encryption in the cloud.
You think you know what’s in your data. But do you? Most organizations are now aware of the business intelligence represented by their data. Data science stands to take this to a level you never thought of – literally. The techniques of data science, when used with the capabilities of Big Data technologies, can make connections you had not yet imagined, helping you discover new insights and ask new questions of your data. In his session at @ThingsExpo, Sarbjit Sarkaria, data science team lead ...
Extracting business value from Internet of Things (IoT) data doesn’t happen overnight. There are several requirements that must be satisfied, including IoT device enablement, data analysis, real-time detection of complex events and automated orchestration of actions. Unfortunately, too many companies fall short in achieving their business goals by implementing incomplete solutions or not focusing on tangible use cases. In his general session at @ThingsExpo, Dave McCarthy, Director of Products...
"delaPlex is a software development company. We do team-based outsourcing development," explained Mark Rivers, COO and Co-founder of delaPlex Software, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
WebRTC is bringing significant change to the communications landscape that will bridge the worlds of web and telephony, making the Internet the new standard for communications. Cloud9 took the road less traveled and used WebRTC to create a downloadable enterprise-grade communications platform that is changing the communication dynamic in the financial sector. In his session at @ThingsExpo, Leo Papadopoulos, CTO of Cloud9, discussed the importance of WebRTC and how it enables companies to focus...
Is your aging software platform suffering from technical debt while the market changes and demands new solutions at a faster clip? It’s a bold move, but you might consider walking away from your core platform and starting fresh. ReadyTalk did exactly that. In his General Session at 19th Cloud Expo, Michael Chambliss, Head of Engineering at ReadyTalk, will discuss why and how ReadyTalk diverted from healthy revenue and over a decade of audio conferencing product development to start an innovati...
Early adopters of IoT viewed it mainly as a different term for machine-to-machine connectivity or M2M. This is understandable since a prerequisite for any IoT solution is the ability to collect and aggregate device data, which is most often presented in a dashboard. The problem is that viewing data in a dashboard requires a human to interpret the results and take manual action, which doesn’t scale to the needs of IoT.
SYS-CON Events announced today that 910Telecom will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Housed in the classic Denver Gas & Electric Building, 910 15th St., 910Telecom is a carrier-neutral telecom hotel located in the heart of Denver. Adjacent to CenturyLink, AT&T, and Denver Main, 910Telecom offers connectivity to all major carriers, Internet service providers, Internet backbones and ...
CenturyLink has announced that application server solutions from GENBAND are now available as part of CenturyLink’s Networx contracts. The General Services Administration (GSA)’s Networx program includes the largest telecommunications contract vehicles ever awarded by the federal government. CenturyLink recently secured an extension through spring 2020 of its offerings available to federal government agencies via GSA’s Networx Universal and Enterprise contracts. GENBAND’s EXPERiUS™ Application...
IoT generates lots of temporal data. But how do you unlock its value? You need to discover patterns that are repeatable in vast quantities of data, understand their meaning, and implement scalable monitoring across multiple data streams in order to monetize the discoveries and insights. Motif discovery and deep learning platforms are emerging to visualize sensor data, to search for patterns and to build application that can monitor real time streams efficiently. In his session at @ThingsExpo, ...
Verizon Communications Inc. (NYSE, Nasdaq: VZ) and Yahoo! Inc. (Nasdaq: YHOO) have entered into a definitive agreement under which Verizon will acquire Yahoo's operating business for approximately $4.83 billion in cash, subject to customary closing adjustments. Yahoo informs, connects and entertains a global audience of more than 1 billion monthly active users** -- including 600 million monthly active mobile users*** through its search, communications and digital content products. Yahoo also co...
"There's a growing demand from users for things to be faster. When you think about all the transactions or interactions users will have with your product and everything that is between those transactions and interactions - what drives us at Catchpoint Systems is the idea to measure that and to analyze it," explained Leo Vasiliou, Director of Web Performance Engineering at Catchpoint Systems, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York Ci...
"Tintri was started in 2008 with the express purpose of building a storage appliance that is ideal for virtualized environments. We support a lot of different hypervisor platforms from VMware to OpenStack to Hyper-V," explained Dan Florea, Director of Product Management at Tintri, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.