首页| JavaScript| HTML/CSS| Matlab| PHP| Python| Java| C/C++/VC++| C#| ASP| 其他|
购买积分 购买会员 激活码充值

您现在的位置是:虫虫源码 > 其他 > ssgnc

ssgnc

  • 资源大小:145.97 kB
  • 上传时间:2021-06-30
  • 下载次数:0次
  • 浏览次数:0次
  • 资源积分:1积分
  • 标      签:

资 源 简 介

Search System for Giga-scale N-gram Corpus The SSGNC is a search system designed for N-gram corpus of around 100GB. The first version was designed for the Google N-gram Corpus and thus the SSGNC was short for Search System for Google N-gram Corpus. But now the system is applicable to other N-gram corpus, so currently the G of the SSGNC means the initial letter of Giga-scale. This system uses a kind of inverted index for finding specified N-grams but the index structure natively supports only a simple search function to find N-grams containing one of the given tokens. So this system provides filtering functions to find N-grams containing all the given tokens or to handle queries containing wildcards. Search Features The latest SSGNC can handle the following kinds of queries. Unordered: Unordered boolean AND query A query "A B" matches both "A B" and "B A". N-grams containing t

文 件 列 表

ssgnc-0.4.6
configure.ac
README
cgi
build-tools
depcomp
ChangeLog
COPYING
NEWS
install-sh
INSTALL
AUTHORS
aclocal.m4
tests
include
Makefile.am
Makefile.in
lib
configure
missing
search-tools
VIP VIP
0.272897s