Build it.
Synthesysâ„¢ is our Software Development Kit (SDK), which makes using knowledge derived from unstructured data as simple and reliable as using a conventional database for structured data.
Need to drop in an analytics engine? This is it.
Unlike most solutions that focus on returning a large batch of relevant documents, we focus on the entities within all of the documents ( i.e. the people, places, and things) and visualize the connections across all of the data. This allows analysts to see “patterns in haystacks” of data and allow them to seamlessly pivot between key actors and events across massive amounts of unstructured data.
We bring two technologies together to aid in discovery. The first is a top-down learning system that provides entity extraction and categorization. Entity extraction lets you designate categories (such as people and locations) and then automatically marks entities in the document that fit those categories. This isn’t by using lists, but by looking at how the terms are used in language. This means it will find People and Locations it has never seen before.
The second is a system that finds the semantic associations to a term. An association is a related term. It may be a variation of a person’s name -such as an alias. It may be a property of an entity or it may be a related entity. All together, this technology (covered by US Patent 7,249,117) gives the system a more human-like ability to learn what words mean based on how they are used. The process is fully automated and requires no a priori knowledge or models. Just give it data, and the system bootstraps the rest.
Together these technologies help to summarize massive amounts of data by resolving key entities, enabling Identity Intelligence solutions such as discovering implied social networks, and aiding analysts in making key inferences.
Discovery starts with a given concept and the system gives you the interactions with related entities and events. If you are interested in the Energy industry you could do a search for “nuclear power plants,†but what if the document you want has it written as “uranium reprocessing facility†or mentions “spent rodsâ€? Not only will Digital Reasoning find these hidden gems, it will summarize what they mean and show key connections between concepts – plants and technologies, plants and locations, etc.