~~Title: Text Search Engine~~

<html><font color=#990000 size="+2"><b>Text Search Engine</b></font></html>

The StreamScape Text Search Engine is based on [[wp> Apache Lucene]].  It is a shared-index implementation of the popular open source engine that allows users to query and perform massive parallel text search and index operations across nodes and machines, similar to the way Elastic Seacrh does it.  StreamScape allows users drive index and query operations using RPL script and DSQL queries; and seamlessly blend structured data analysis with text search results.

//
Data remains as raw text until it is mined and the information contained within it is harnessed. Mining data to make sense out of it has applications in varied fields of industry and academia. In this article, we explore the best open source tools that can aid us in data mining.


//
Data mining, also known as knowledge discovery from databases, is a process of mining and analysing enormous amounts of data and extracting information from it. Data mining can quickly answer business questions that would have otherwise consumed a lot of time. Some of its applications include market segmentation – like identifying characteristics of a customer buying a certain product from a certain brand, fraud detection – identifying transaction patterns that could probably result in an online fraud, and market based and trend analysis – what products or services are always purchased together, etc. This article focuses on the various open source options available and their significance in different contexts.

