Over the past 10 years, there has been a proliferation of near-duplicate documents in enterprise environments. In live implementations, Equivio generally finds that 30-50% of all documents and emails are near-duplicates. This is in addition to the exact duplicates, which typically comprise 10-30% of any repository.

The proliferation of near-duplicates is a result of:

 
the falling cost of physical storage and the ease of information distribution
the widespread use of email and the corresponding ease of distribution
more complex work flows and processes, in which far more people are reviewing and correcting documents
 

The proliferation of near-duplicates impacts key business processes. This is a problem of process efficiency, rather than data storage costs. Disk space cost does not drive the business case because storage is now very inexpensive. In any case, in most business processes, near-duplicates are not deleted.

So what is the value proposition?
The answer to this question is straightforward: Equivio's ability to group documents into sets of near-duplicate makes information much more compact and manageable. The near-duplicate sets make it much easier and quicker to find and review documents you are interested in. Given the explosion of document volumes, and the increasing costs of review, Equivio's near-duplicate detection technology is compelling in all human-centric, content-intensive situations where people need to make sense of large repositories of documents.

   
What is a near-duplicate?
Calculate your ROI on Equivio
 
 
 
  HomeProductsTechnologySolutionsCorporateNews & EventsContact Us
   

© 2004-2008 Equivio. All rights reserved.