by Bill IvesMay 17, 2011 at 3:24 am · Filed under Web 2.0

There is a growing market in the eDiscovery and compliance space as organizations amass a growing and vast amount of content in an increasing variety of formats. I recently spoke with Index Engines about their capabilities. They have developed the means to look into a variety of backup formats that have previously been difficult to deal with. Index Engines recently announced that the Index Engines Collection Engine now works with EMC Data Domain deduplication storage systems and leverages existing backup processes to automatically identify and extract specific files and email for regulatory, compliance and legal applications. They also work with a number of other formats.
As they pointed out, only a small subset of the data captured in the backup process is of value for long-term access. To filter down the volume of data that is archived, detailed knowledge of the backup images is required. Index Engines Collection Engine automatically indexes backup images, identifies the useful content, collects what is relevant and writes it back to Data Domain storage making it available for compliance and litigation purposes.
The Index Engines Collection Engine for Data Domain indexes the content of backup images so that they can be searched and analyzed for business relevance. These searches can be high-level metadata such as user mailboxes, or detailed queries based on file or email content, location and date ranges. Searches are saved as stored queries that run automatically once a new backup is executed.
The relevant set of data that is identified within the backup image is extracted into a Collection image and written back to the Data Domain system. This allows a small subset, typically less than 5% of the backup data, to be retained for long-term access and also takes advantage of the Data Domain deduplication technology. Specific user files or email can be extracted, keeping all metadata intact, for compliance with legal and regulatory requirements. Here is sample of the Index Engine interface.

This issue will only grow in importance as organizations are generating an ever-increasing amount of data. The introduction of more unstructured data through internal social media only makes it a more massive job. In addition, Index Engines indicated that courts and regulatory agencies are becoming more demanding as the technology for eDiscovery becomes more accessible. They used to give organizations more of break because the process was so expense. Now that progress has been made and costs are coming down this tolerance is tightening up. This is certainly a growth field that serves a very useful purpose.
Operating in real time, the Darwin Awareness Engineâ„¢ allows for the efficient scanning of content to find both breaking news and underlying casual patterns in the topics of your interest. Rather than using semantic technology to attempt to...
ContinueCoveo currently has a three-part product strategy. There is the core search platform, Coveo Enterprise Search 6.2, the company’ s enterprise -grade informatio n access solution, used by the likes of Lockheed Martin, GEICO and YUM! Brands; Coveo...
ContinueThe need for fast and more effective enterprise search continues to increase as data grows exponentia lly. I spoke with Ken Ebert, form Perfect Search, about this problem. According to Gartner, enterprise data growth over the next five years is...
ContinueDarwin Ecosystem offers an awareness engine for content discovery within the enterprise and on the web. In contrast to a search engine that goes after answers to known questions, an awareness engine aggregates and displays content themes in real...
ContinueCoveo offers enterprise search technology including their enterprise search modules and search-bas ed applicatio ns for such functions as call centers and litigation & compliance . It has now launched a free, entry-leve l enterprise search...
ContinueI have written about Attivio on the AppGap before (see Atti vio Tightly Integrates Structured Data and Unstructur ed Content for a New Approach to Informatio n Access and Attiv io on Some Potential Winners in our New Economic World). Recently,...
ContinueThe introducti on of Web 2.0 social media into the enterprise creates large amounts of potentiall y useful informatio n about the social side of business processes. Â The key is gaining awareness and access to this social intelligen ce. Paula...
ContinueEquivio recently introduced Equivio> ;Relevance â„¢, an expert-gui ded system that enhances the eDiscovery process through automated document prioritiza tion. Traditiona lly, people, such as attorneys, engaged in intense investigat ions use...
Continue I have written about Exalead before, see Exalead’ s CloudView Offers Integrated Search Capabiliti es. They have a web search tool like Google that is used in widely in France and other parts of Europe. The CloudView product looked inside the...
Continue Recommind recently announced MindServer Searchâ„¢ 6.0, the latest version of its flagship enterprise search product. It is built on Recommindâ €™s COREâ„¢ platform (Context Optimized Relevancy Engine). This latest version adds features such as...
ContinueAs more channels of informatio n open, search becomes more complex as the opportunit y for informatio n silos increases. I recently spoke with Paul Doscher, US CEO of Exalea d. Exalead has introduced its CloudView product line to address this...
ContinueKnowledge Plaza is a Web-based platform for enterprise search, social bookmarkin g, knowledge management , informatio n brokerage and expert identifica tion. It was developed by the Belgium based firm, Whatever. I recently spoke with Olivier...
ContinueI recently wrote about Attivio. It offers the Active Intelligen ce Engineâ„¢ (AIE) for Unified Informatio n Access, bringing together business intelligen ce and enterprise search capabiliti es. (see Attivio Tightly Integrates Structured Data and...
ContinueA number of search engines have attempted to tackle the semantic web concept with varying degrees of success, some came out before web 2.0 emerged. In each of the cases that I am familiar with, they used complex search algorithms to attempt to...
ContinueThe key word approach has dominated search in the consumer web. Google is my default launch page for the web. However, within the enterprise this approach is not always sufficient . People do not always know what they are looking for and, even if...
ContinueInsideView recently launched a new offering, SalesView, an on-demand Business Search and Intelligen ce applicatio n, designed to bring insight gained from subscripti on-based and user-gener ated sources to the enterprise . It integrates with many...
Continue