首页| JavaScript| HTML/CSS| Matlab| PHP| Python| Java| C/C++/VC++| C#| ASP| 其他|
购买积分 购买会员 激活码充值

您现在的位置是:虫虫源码 > Java > 图:一种高性能的文本挖掘工具,土耳其文本预处理

图:一种高性能的文本挖掘工具,土耳其文本预处理

资 源 简 介

PRETO: A High-performance Text Mining Tool for Preprocessing Turkish Texts Text documents are usually unstructured and written in natural language. To apply conventional data mining techniques on text documents, a preprocessing operation is indispensable. Here, we introduce PRETO, a cross-platform, powerful and scalable preprocessing tool developed specifically for preprocessing Turkish texts, with a wide range of preprocessing options like stemming, stopword filtering, statistical term filtering, and n-gram generation. Source code in Java is available via Subversion at the Source page: http://code.google.com/p/preto/source/checkout PRETO is developed using NetBeans IDE. So we recommend you use it. You can download the executable version from the downloads page:

文 件 列 表

lib
AbsoluteLayout.jar
appframework-1.0.3.jar
swing-worker-1.1.jar
zemberek-cekirdek-2.1.1.jar
zemberek-tr-2.1.1.jar
PRETO.jar
VIP VIP
0.260791s