Products

SlipStream Disambiguation Engine

SlipStream is an entity resolution and disambiguation tool designed to be used with the Twister Data Framework.

The SlipStream entity resolution and disambiguation process ensures the successful locating and merging of records that represent the same real-world entity. The result of this process is a streamlined data store that contains single merged entities.

An average-size text document can produce hundreds of entities. A collection of text documents can produce thousands. Often, many of these extracted entities are of the same entity or a partial view of an entity. Standard data stores can quickly become overloaded with duplicate entities that by themselves are not useful.

By working with Twister Data Framework, which uses software that partitions data throughout a parallel framework, multiple SlipStream engines can perform the complex match-and-merge process in parallel across hundreds of nodes. There is no table-scan bottleneck because multiple databases are running in parallel and new data sources can be added as the amount of data increases with no inherent slow down over time.