首页| JavaScript| HTML/CSS| Matlab| PHP| Python| Java| C/C++/VC++| C#| ASP| 其他|
购买积分 购买会员 激活码充值

您现在的位置是:虫虫源码 > 其他 > 生成测试和基准应用程序需要大数据集的合成网站实体日志数据。

生成测试和基准应用程序需要大数据集的合成网站实体日志数据。

资 源 简 介

MalGen Is a set of scripts which generate large, distributed data sets suitable for testing and benchmarking software designed to perform parallel processing on large data sets. The data sets can be thought of as site-entity log files. After an initial seeding, the scripts allow for the data generation to be initiated from a single central node to run the generation concurrently on multiple remote nodes of the cluster. The data generated follows certain statistical distributions which we believe presents a usable model for such logs. There are two intended uses for MalGen 1. is to generate a large, possibly distributed, data set for use with analytics. 1. is to generate data for use with benchmarking algorithms or applications. With the first use, records are generated probabilistically and extra records may be produced so that the entire data set follows the specified distribution. With the second use

文 件 列 表

malstone-googlecode-0_8_2
._.DS_Store
.DS_Store
hadoop
._malstone-TR-09-01.pdf
malstone-TR-09-01.pdf
sector
VIP VIP
0.238972s