Mark Logic Extends its XML Server

by Bill Ives

MarkLogic Server allows organizations to store,  search, analyze and dynamically deliver XML content. I have written about one application before (see – US Army’s Battle Command Knowledge System (BCKS) Moves to XML-based Platform). Recently, they announced the release of MarkLogic Server 4.0. I spoke with John Kreisa, Director of Product marketing. This new release is their largest so far and contains a number of extensions that we went over. 

But first let me cover the basics. MarkLogic Server is an XML server which provides a software application development platform for creating XML-based content related applications. The XML basis provides greater granularity in database searches and more efficient document delivery than traditional means. It accommodates semi-structured data. John explained that they this is what is often called unstructured data such as narrative but that they prefer the term semi-structure as all data has some structure.  In the US Army application I wrote about earlier, both the speed of access and the granularity of search were key benefits of this approach.

The new release adds to these features in several ways. First, it adds new geospatial capabilities that enable organizations to build location-based applications that search and analyze content based on location information. The new release provides built-in support for popular geospatial data tagging formats such as GML, KML, and GeoRSS/Simple, as well as new geospatial query functions for point, radius, bounding box and polygon constraints. As information consumers and workers become more mobile, the delivery of information in the context of their physical location can greater improve relevance. For example, military personnel operating directly in the battlefield could search for background information relevant to their next mission, just as shoppers can see local places to eat after completing their mission. You can see MarkLogic geospatial bucketing below.

[photopress:geospatial_map.png,full,pp_image]

Release 4.0 also provides built-in support for entity identification and inline markup. This new text mining feature works in 11 languages and identifies 18 different types of entities, including person, organization, location, credit card number, email address, latitude/longitude, date, and time. For example, one query might be to distinguish between Paris Hilton the person and Paris Hilton the hotel to find which is relevant to the query. This works through tags. In this context, Mark Logic is also introducing the Open Enrichment Framework, an initiative created to speed integration with third-party entity extraction engines and other content enrichment tools.

One new feature that I especially liked is co-occurrence analytics that finds and counts pairs of entities in content. This can potentially expose previously unknown relationships and provide useful new insights. The system groups content based on pre-set parameters, such as treatments and side effects.

Co-occurrence would allow, in this example, a medical researcher to determine the common instances of pharmaceuticals and side-effects and then display the results graphically on a map based on frequency. You could also look for pairs of people or the pairing of a person and places.

There are also new alert capabilities. You can be notified of new content that contains the terms, places, etc. that you pre-set. This feature is very scalable. The modular documents feature allows you to re-use content more efficiently. Instead of cutting and pasting portions of a document. This feature has support for XPointer and XInclude, mechanisms for merging XML documents. You simply link back to it so it is store once, link many. This ensures a consistency of content that is especially useful for policies in regulated industries.

John went on to describe their new support for W3C XQuery 1.0. Mark Logic has hundreds of active XQuery-based deployments so this will smooth migration to XQuery 1.0 as the new release provides compatibility modes to ensure interoperability with applications developed with all previous versions. Finally he concluded with the enhancements to the administration functions including the automation of key management activities through scriptable administration and scheduled back-ups, as well as event logging and auditing support.

XML provides a more flexible foundation for content creation, storage, sharing, and searching. I think the new MarkLogic Server 4.0 takes greater advantage of the potential within this format and will even more useful for developing web 2.0 and enterprise 2.0 applications. 

Share:
  • e-mail
  • TwitThis
  • del.icio.us
  • StumbleUpon
  • Digg
  • Reddit
  • SphereIt
  • Facebook
  • Google Bookmarks


No comments yet »

Your comment

Used MarkLogic? Let us know about your experiences with it

HTML-Tags:
<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

Additional comments powered by BackType





Custom Search
Online Database Reviews

Be sure to catch Bill Ives' ongoing review series in which he looks at online, sharable database apps. The focus of Bill's reviews: web-based business software that enables companies and individuals to better organize, track, and share information, as well as better manage projects, processes and workflows.

Among the Web-based tools he's reviewed: Zoho, QuickBase, and TrackVia.

Looking for apps that help you and your team get work done?

Check out the AppGap's Appopedia, an ever-expanding section with reviews of more than 150 of today's best tools to help you better manage projects and collaborate. Reviews are presented in a useful directory that breaks down tools by category and function, e.g., online crm, project management, human resources, security, etc. Check it out here.

The AppGap Webinar Series

The AppGap has hosted a series of discussions with leading thinkers and doers intended to illuminate how new apps and approaches are changing the way we work and help companies and individuals implement better collaboration, project management, and productivity practices and solutions. Access, via the links below, the recordings, each about an hour long, of the discussions.

- 5 Big Ideas for Getting All That Work Done
- Should Your Business be Friends with Facebook
- The Future of Work

Email Newsletter icon, E-mail Newsletter icon, Email List icon, E-mail List icon Sign up for our Email Newsletter

Recent Comments

  • hopenic: Oh Enterprise Backup Conundrum > RT @BillIves: @theappgap Perfect Search Addresses Issues in Enterprise Back...
  • BillIves: post on @theappgap Perfect Search Addresses Major Issues in Enterprise Back Up Search http://bit.ly/cIfv2d...
  • EitanSaban: Perfect Search Addresses Major Issues in Enterprise Back Up Search http://bit.ly/cpHFTp This comment was...
  • IdeatoEmpire: Perfect Search Addresses Major Issues in Enterprise Back Up Search http://bit.ly/cgaRSM This comment...
  • Mandar: It is really interesting to watch offers floating from all around to take Coghead’s customers away....
The AppGap is a blog and resource on the future of work and how new tools are addressing age-old challenges of organization, collaboration, and innovation. But it is also an idea: that there remains a gap between the toolset that exists and what's needed...

Can today's project management software be done better? What can online CRM help companies companies accomplish? Which development platform can help individuals and organizations build better online databases, Web based applications, and HR solutions? And what are the processes and best practices that help organizations large and small achieve success. Find out more.

About | Contributor Bios | Blog Policy | Contact us