没有合适的资源？快使用搜索试试~ 我知道了~

文库首页人工智能机器学习Large Scale Machine Learning with Spark

Large Scale Machine Learning with Spark

Spark

Machine

Learning

需积分: 10 10 下载量 43 浏览量 2018-02-28 15:01:22 上传评论收藏 11.47MB PDF 举报

温馨提示

试读

472页

Data processing, implementing related algorithms, tuning, scaling up and finally deploying are some crucial steps in the process of optimising any application. Spark is capable of handling large-scale batch and streaming data to figure out when to cache data in memory and processing them up to 100 times faster than Hadoop-based MapReduce.This means predictive analytics can be applied to streaming and batch to develop complete machine learning (ML) applications a lot quicker, making Spark an ideal candidate for large data-intensive applications. This book focuses on design engineering and scalable solutions using ML with Spark. First, you will learn how to install Spark with all new features from the latest Spark 2.0 release. Moving on, you’ll explore important concepts such as advanced feature engineering with RDD and Datasets. After studying developing and deploying applications, you will see how to use external libraries with Spark.

资源推荐

资源详情

资源评论

Mastering Apache Spark 2.x Scale your m l and d l systems with SparkML, DL4j and

• Examine Advanced Machine Learning and DeepLearning with MLlib, SparkML, SystemML, H2O and DeepLearning4J • Study highly optimised unified batch and real-time data processing using SparkSQL and ...

Big Data Analytics with Spark 无水印pdf 0分

4星 · 用户满意度95%

Big Data Analytics with Spark is a step-by-step guide for learning Spark, which is an open-source fast and general-purpose cluster computing framework for large-scale data analysis. You will learn how

Advanced Analytics with Spark, 2nd Edition

5星 · 资源好评率100%

In the second edition of this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, ...

You will also learn how to perform large-scale machine learning on Big Data using Apache Spark. The book covers preparing your data for analysis, training machine learning models, and visualizing the

Spark 2.0 for Beginners(PACKT,2016)

5星 · 资源好评率100%

Spark is one of the most widely-used large-scale data processing engines and runs extremely fast. It is a framework that has tools which that are equally useful for application developers as well as .

Large-Scale Graph Processing on Spark

Large-Scale Graph Processing on Spark

Python: Real World Machine Learning

5星 · 资源好评率100%

The third module in this learning path, Large Scale Machine Learning with Python, dives into scalable machine learning and the three forms of scalability. It covers the most effective machine learning

Apache Spark 2 for Beginners [2016]

5星 · 资源好评率100%

Perform efficient data processing, machine learning and graph processing using various Spark components A practical guide aimed at beginners to get them up and running with Spark Book Description ...

博客中聚类算法（K-means、FCM、DBSCAN、DPC）的数据集（免积分）

5星 · 资源好评率100%

博客中K-means、FCM、DBSCAN、DPC算法的数据，包括Iris鸢尾花数据集、Wine葡萄酒数据集、Seed小麦种子数据集、glass数据集、WDBD乳腺癌数据集，以及几个人工数据集常用的人工数据集（Flame、Spiral等），下载在直接存入项目文件夹即可，如果下载不了，可以私信我，看到后会及时回复。

实验三医学知识图谱构建与推理

实验目的： 1、能够掌握图数据库Neo4j运行环境的搭建，以及利用Cypher语句构建Neo4j数据库的方法。 2、能够运用知识推理方法设计医学知识图谱知识的推理方法。 3、能够编程实现医学知识图谱的查询、存储和推理。具体内容： 1、基于文档“知识图谱构建学习案例（《伤寒论》节选）.xlsx”中的数据，仿照下列语句，构建相应的《伤寒论》知识图谱（片段），将完整的构建Neo4j数据库的Cypher

Large Scale Machine Learning

with Spark

Discover everything you need to build robust machine

learning applications with Spark 2.0

Md. Rezaul Karim

Md. Mahedi Kaysar

BIRMINGHAM - MUMBAI

Large Scale Machine Learning with Spark

transmitted in any form or by any means, without the prior written permission of the

publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the

information presented. However, the information contained in this book is sold without

warranty, either express or implied. Neither the authors, nor Packt Publishing, and its

dealers and distributors will be held liable for any damages caused or alleged to be caused

directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the

companies and products mentioned in this book by the appropriate use of capitals.

However, Packt Publishing cannot guarantee the accuracy of this information.

First published: October 2016

Production reference: 1201016

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham

B3 2PB, UK.

ISBN 978-1-78588-874-8

www.packtpub.com

Credits

Authors

Md. Rezaul Karim

Md. Mahedi Kaysar

Copy Editor

Safis Editing

Reviewer

Muthukumar Subramanian

Project Coordinator

Shweta H Birwatkar

Commissioning Editor

Akram Hussain

Proofreader

Safis Editing

Acquisition Editor

Lester Frias

Indexer

Aishwarya Gangawane

Content Development Editor

Amrita Noronha

Graphics

Disha Haria

Technical Editor

Akash Patel

Production Coordinator

Arvindkumar Gupta

About the Authors

Md. Rezaul Karim has more than 8 years of experience in the area of research and

development with a solid knowledge of algorithms and data structures, focusing C, C++,

Java, R, and Python and big data technologies such as Spark, Kafka, DC/OS, Docker, Mesos,

Hadoop, and MapReduce.

He was first enchanted by machine learning while studying an Advanced Artificial

Intelligence post-graduate course by applying the combined technique of Hadoop-based

MapReduce and machine learning together for market basket analysis on large-scale

business-oriented transactional databases in back 2010. Consequently, his research interests

include machine learning, data mining, Semantic Web, big data, and bioinformatics. He has

published more than 30 research papers in renowned peer-reviewed international journals

and conferences focusing on the areas of data mining, machine learning, and bioinformatics,

with good citations.

He is a Software Engineer and Researcher currently working at the Insight Centre for Data

Analytics, Ireland (the largest data analytics center in Ireland and the largest Semantic Web

research institute in the world) as a PhD Researcher. He is also a PhD candidate at the

National University of Ireland, Galway. He also holds an ME (Master of Engineering)

degree in Computer Engineering from the Kyung Hee University, Korea, majoring in data

mining and knowledge discovery. And he has a BS (Bachelor of Science) degree in

Computer Science from the University of Dhaka, Bangladesh.

剩余471页未读，继续阅读

评论收藏

内容反馈

资源评论

资源反馈

评论星级较低，若资源使用遇到问题可联系上传者，3个工作日内问题未解决可申请退款~

blackkettle

粉丝: 9
资源: 11

上传资源快速赚钱

我的内容管理展开

我的资源快来上传第一个资源

我的收益

登录查看自己的收益

我的积分登录查看自己的积分

我的C币登录后查看C币余额

我的收藏

我的下载

下载帮助

前往需求广场，查看用户热搜

Large Scale Machine Learning with Spark

large scale machine learning with spark

Large Scale Machine Learning with Spark azw3

Large Scale Machine Learning with Python

Large Scale Machine Learning with Spark.pdf

Large Scale Machine Learning with Python.rar

Large Scale Machine Learning with Spark-Packt Publishing(2016).pdf

Advanced Analytics with Spark: Patterns for Learning from Data at Scale

哥伦比亚大学-Large Scale Machine Learning

Machine Learning with Spark

Machine Learning with Spark - Second Edition

Mastering Apache Spark 2.x Scale your m l and d l systems with SparkML, DL4j and

Big Data Analytics with Spark 无水印pdf 0分

Advanced Analytics with Spark, 2nd Edition

LargeScaleMachineLearningwithPython.pdf

Machine-Learning-with-Spark

Hands-On Data Science and Python Machine Learning 随书代码

Hands-On Data Science and Python Machine Learning

Spark 2.0 for Beginners(PACKT,2016)

Large-Scale Graph Processing on Spark

Python: Real World Machine Learning

Apache Spark 2 for Beginners [2016]

博客中聚类算法（K-means、FCM、DBSCAN、DPC）的数据集（免积分）

实验三 医学知识图谱构建与推理

最新资源

实验三医学知识图谱构建与推理