首页| JavaScript| HTML/CSS| Matlab| PHP| Python| Java| C/C++/VC++| C#| ASP| 其他|
购买积分 购买会员 激活码充值

您现在的位置是:虫虫源码 > 其他 > sketchsort

sketchsort

  • 资源大小:8.71 MB
  • 上传时间:2021-06-30
  • 下载次数:0次
  • 浏览次数:0次
  • 资源积分:1积分
  • 标      签: SketchSort

资 源 简 介

Introduction SketchSort(1) is a software for all pairs similarity search. It takes as an input data points and outputs approximate neighbor pairs within a distance (1.0 - cosine similarity). First, the input data points are mapped to binary strings (sketches) by locality sensitive hashing, and then neighbor pairs of strings within a Hamming distance are enumerated by the multiple sorting method (2). Finally, the cosine distances for such neighbor pairs are calculated. If the cosine distance for a neighbor pair is no more than a user-specified threshold , the neighbor pair is output. One might worry about missed nearest neighbor pairs by SketchSort. A theoretical lower bound of the expectation of missing edge ratio is derived. It enables us to set parameters so as to limit the empirical missing edge ratio as small as possible. Quick Start To compile SketchSort , please type the followings: tar -xjvf sketchsort

文 件 列 表

sketchsort-0.0.8
.lock-wscript
.svn
.waf-1.5.11-e59024de37e231036fc8c065a6fb3048
dat
src
VIP VIP
0.180730s