首页| JavaScript| HTML/CSS| Matlab| PHP| Python| Java| C/C++/VC++| C#| ASP| 其他|
购买积分 购买会员 激活码充值

您现在的位置是:虫虫源码 > 其他 > 一个修改版本的Hadoop支持高效的迭代数据处理大型商业集群

一个修改版本的Hadoop支持高效的迭代数据处理大型商业集群

资 源 简 介

Why do we develop the HaLoop project? The growing demand for large-scale data mining and data analysis applications has led both industry and academia to design new types of highly scalable data-intensive computing platforms. MapReduce and Dryad are two popular platforms in which the dataflow takes the form of a directed acyclic graph of operators. However, these new platforms do not have built-in support for iterative programs, which arise naturally in many applications including data mining, web ranking, graph processing, model fitting, and so on. What is HaLoop? Simply speaking, HaLoop = Ha, Loop:-) HaLoop is a modified version of the Hadoop MapReduce framework, designed to serve these applications. HaLoop not only extends MapReduce with programming support for iterative applications, but also dramatically improves their efficiency by making the task scheduler loop-aware and by adding various caching mechanisms. We evaluate HaLoop on real q

文 件 列 表

haloop
.classpath
.project
.svn
bin
build
build.xml
conf
descendant.sh
ivy
ivy.xml
kmeans.sh
lib
naive.sh
naivecode.sh
naivedescendant.sh
naivepagerank.sh
naivepageranktest.sh
naivesn.sh
pagerank.sh
pageranktest.sh
pc.sh
rebuild.sh
recursive.sh
retest.sh
sample.sh
snetwork.sh
src
test.sh
timediff.sh
wcount.sh
VIP VIP
0.190646s