~~Title: Cognitive Autoation - Find Nearest~~

<html><font color=#990000 size="+2"><b>Cognitive Automation: FIND NEAREST</b></font></html>

StreamScape provides rich semantic analysis capabilities through the Smemantic Graoh Service using FIND NEAREST service call.

====Similarity Serach (Synonymity)====

The examples below use a Semantic Graph of news articles taken from the popular Google Workset.  It contaions several million articles that have ben converted into a Semantic Graph by the Semagraph Service:

<sxh rpl; gutter: false;>

admin@TestNode12_nlp:TXSemagraph.Generator> find nearest to 'dirty'
 
Word        Similarity          Word Frequency  Docs Count
----------  ------------------  --------------  ----------
filthy      0.8449857234954834  2606.0          2533
unclean     0.7724915742874146  382.0           381
grubby      0.7435568571090698  699.0           688
stained     0.7204205393791199  2548.0          2482
gross       0.7139873504638672  905.0           893
dusty       0.7084083557128906  823.0           815
disgusting  0.7057594060897827  2469.0          2427
moldy       0.703738808631897   421.0           420
soiled      0.7032114863395691  199.0           196
mouldy      0.6908437013626099  325.0           321
 
Time: 31 (ms).
 
admin@TestNode12_nlp:TXSemagraph.Generator> find nearest to 'clean'
 
Word          Similarity           Word Frequency  Docs Count
------------  -------------------  --------------  ----------
spotless      0.6618425250053406   5005.0          4990
immaculate    0.6572406888008118   2273.0          2270
cleanthe      0.475935161113739    19.0            19
compact       0.4754776060581207   1157.0          1149
cleanand      0.4696241021156311   11.0            11
cleanish      0.4605141878128052   29.0            29
stylish       0.46049776673316956  2965.0          2926
pristine      0.45799362659454346  325.0           324
comefortable  0.45054948329925537  2.0             2
superclean    0.44967395067214966  10.0            10
 
Time: 23 (ms).
 
admin@TestNode12_nlp:TXSemagraph.Generator> find nearest to 'stylish'
 
Word           Similarity          Word Frequency  Docs Count
-------------  ------------------  --------------  ----------
tasteful       0.8143267035484314  781.0           779
sleek          0.801754891872406   672.0           669
chic           0.7923199534416199  2056.0          2021
elegant        0.788985550403595   3278.0          3241
minimalistic   0.7847214937210083  125.0           125
classy         0.7688407897949219  1774.0          1749
contemporary   0.7668522000312805  1987.0          1970
sophisticated  0.7547444105148315  502.0           498
minimalist     0.7511155605316162  642.0           633
modern         0.7378312945365906  19027.0         18733
 
admin@TestNode12_nlp:TXSemagraph.Generator> find nearest to 'tasteful'
 
Word          Similarity          Word Frequency  Docs Count
------------  ------------------  --------------  ----------
contemporary  0.8135494589805603  1987.0          1970
minimalistic  0.8077958822250366  125.0           125
sleek         0.8075861930847168  672.0           669
minimalist    0.7911320328712463  642.0           633
stylish       0.7881219387054443  2965.0          2926
elegant       0.7720562219619751  3278.0          3241
simplistic    0.7708852887153625  28.0            28
modern        0.7693807482719421  19027.0         18733
styling       0.7643940448760986  174.0           173
topline       0.7609306573867798  2.0             2
 
</sxh>