利用持久化方法(JPA),解决了Hadoop的子任务无法共享数据的问题,提出了一个hadoop上的数据挖掘框架,可以完成树型结构。具体实现了DBtree。
下面是ris格式的引文,存盘后,可为endnote等文献管理软件导入。
TY - CONF
JO - Computer and Information Technology, International Conference on
TI - An Efficient Data Mining Framework on Hadoop using Java Persistence API
SN - 978-0-7695-4108-2
SP - 203
EP - 209
A1 - Yang Lai
A2 - Shi ZhongZhi
PY - 2010/06/29
KW - Data Mining
KW - Distributed applications
KW - JPA
KW - ORM
KW - Distributed file systems
KW - Cloud computing
VL - 0
JA - Computer and Information Technology, International Conference on
UR - http://doi.ieeecomputersociety.org/10.1109/CIT.2010.71
ER -
利用JPA做“公共黑板”,解决了数据挖掘中hadoop的子任务无法共享数据的问题,提出了树型结构的高效算法。具体实现了kdtree的hadoop版本。
代码可以在http://svn.javaforge.com/svn/hadoopjpa/HadoopDataMining check out. 需要先注册;如果不能成功,换小写地址。
下面是ris格式的引文,存盘后可为endnote等文献管理软件导入。
TY - CHAP
AU - Lai, Yang
AU - ZhongZhi, Shi
A2 - Shi, Zhongzhi
A2 - Vadera, Sunil
A2 - Aamodt, Agnar
A2 - Leake, David
T1 - An Efficient Data Indexing Approach on Hadoop Using Java Persistence API
T2 - Intelligent Information Processing V
T3 - IFIP Advances in Information and Communication Technology
PY - 2010
PB - Springer Boston
SN -
SP - 213
EP - 224
VL - 340
UR - http://dx.doi.org/10.1007/978-3-642-16327-2_27
DO - 10.1007/978-3-642-16327-2_27
AB - Data indexing is common in data mining when working with high-dimensional, large-scale data sets. Hadoop, a cloud computing project using the MapReduce framework in Java, has become of significant interest in distributed data mining. To resolve problems of globalization, random-write and duration in Hadoop, a data indexing approach on Hadoop using the Java Persistence API (JPA) is elaborated in the implementation of a KD-tree algorithm on Hadoop. An improved intersection algorithm for distributed data indexing on Hadoop is proposed, it performs O(M+logN), and is suitable for occasions of multiple intersections. We compare the data indexing algorithm on open dataset and synthetic dataset in a modest cloud environment. The results show the algorithms are feasible in large-scale data mining.
ER -
一个字体列表
在java环境中有效的windows字体被列出383个,分为正规和斜体两列。
一般程序员需要等宽字体来编程,而且需要区分0(zero)和O。在列表中的第一个为作者推荐的编程字体。
在数学文章的英文编辑中,我们可以要用到这些符号:for all 和 exist,即“对所有”和“存在一个”,以及"is proven"证明与"is true"逻辑蕴涵,在作者推荐的第二、三、四个字体中我们可以使用这些符号。
A list of fonts available for JAVA in windows XP
383 fonts are presented for java in windows, as two columns regular and italic.
A programmer needs monospace fonts for coding, which can distinguish 0(zero) and O. The first font in the list is the preferred by the author.
In English mathematical paper, we have to use this glyphs: for all, and exist, and "is proven", and "is true". The preferred second, the third, the 4th font can be used in your paper.
Total fonts under windows: 383;
First Example: A URL address;
Second Example: Upper and Lower Character, Digits;
Third Example: Mathematical Glyph (if exists);
Forth Example: Chinese Glyph (if exists);
Author: [email protected]
Date: 2010-10-22