[ 1 ]
Learning Data Mining
with Python
Harness the power of Python to analyze data and
create insightful predictive models
Robert Layton
BIRMINGHAM - MUMBAI
Learning Data Mining with Python
Copyright © 2015 Packt Publishing
All rights reserved. No part of this book may be reproduced, stored in a retrieval
system, or transmitted in any form or by any means, without the prior written
permission of the publisher, except in the case of brief quotations embedded in
critical articles or reviews.
Every effort has been made in the preparation of this book to ensure the accuracy
of the information presented. However, the information contained in this book is
sold without warranty, either express or implied. Neither the author, nor Packt
Publishing, and its dealers and distributors will be held liable for any damages
caused or alleged to be caused directly or indirectly by this book.
Packt Publishing has endeavored to provide trademark information about all of the
companies and products mentioned in this book by the appropriate use of capitals.
However, Packt Publishing cannot guarantee the accuracy of this information.
First published: July 2015
Production reference: 1230715
Published by Packt Publishing Ltd.
Livery Place
35 Livery Street
Birmingham B3 2PB, UK.
ISBN 978-1-78439-605-3
www.packtpub.com
Credits
Author
Robert Layton
Reviewers
Asad Ahamad
P Ashwin
Christophe Van Gysel
Edward C. Delaporte V
Commissioning Editor
Taron Pereira
Acquisition Editor
James Jones
Content Development Editor
Siddhesh Salvi
Technical Editor
Naveenkumar Jain
Copy Editors
Roshni Banerjee
Trishya Hajare
Project Coordinator
Nidhi Joshi
Proofreader
Sas Editing
Indexer
Priya Sane
Graphics
Sheetal Aute
Production Coordinator
Nitesh Thakur
Cover Work
Nitesh Thakur
About the Author
Robert Layton has a PhD in computer science and has been an avid Python
programmer for many years. He has worked closely with some of the largest
companies in the world on data mining applications for real-world data and has
also been published extensively in international journals and conferences. He has
extensive experience in cybercrime and text-based data analytics, with a focus
on behavioral modeling, authorship analysis, and automated open source
intelligence. He has contributed code to a number of open source libraries,
including the scikit-learn library used in this book, and was a Google Summer
of Code mentor in 2014. Robert runs a data mining consultancy company called
dataPipeline, providing data mining and analytics solutions to businesses in a
variety of industries.