Categorical Data Analysis by Example

Introduces the key concepts in the analysis of categoricaldata with illustrative examples and accompanying R code This book is aimed at all those who wish to discover how to analyze categorical data without getting immersed in complicated mathematics and without needing to wade through a large amount of prose. It is aimed at researchers with their own data ready to be analyzed and at students who would like an approachable alternative view of the subject. Each new topic in categorical data analysis is illustrated with an example that readers can apply to their own sets of data. In many cases, R code is given and excerpts from the resulting output are presented. In the context of loglinear models for crosstabulations, two specialties of the house have been included: the use of cobweb diagrams to get visual information concerning significant interactions, and a procedure for detecting outlier category combinations. The R code used for these is available and may be freely adapted. In addition, this book: • Uses an example to illustrate each new topic in categorical data • Provides a clear explanation of an important subject • Is understandable to most readers with minimal statistical and mathematical backgrounds • Contains examples that are accompanied by R code and resulting output • Includes starred sections that provide more background details for interested readers Categorical Data Analysis by Example is a reference for students in statistics and researchers in other disciplines, especially the social sciences, who use categorical data. This book is also a reference for practitioners in market research, medicine, and other fields. GRAHAM J. G. UPTON is formerly Professor of Applied Statistics, Department of Mathematical Sciences, University of Essex. Dr. Upton is author of The Analysis of Crosstabulated Data (1978) and joint author of Spatial Data Analysis by Example (2 volumes, 1995), both published by Wiley. He is the lead author of The Oxford Diction
CATEGORICAL DATA ANALYSIS BY EXAMPLE www.allitebooks.com www.allitebooks.com CATEGORICAL DATA ANALYSIS BY EXAMPLE GRAHAM J G. UPTON WILEY www.allitebooks.com Copyright o 2017 by John Wiley sons, Inc. All rights reserved Published by John Wiley sons, InC, Hoboken, New Jersey Published simultaneously in Canada No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning, or otherwise, except as permitted under Section 107 or 108 of the 1976 United States Copyright Act, without either the prior written permission of the Publisher, or authorization through payment of the appropriate percopy fee to the Copyright Clearance Center, InC, 222 Rosewood Drive, Danvers, MA 01923, (978)7508400, fax (978)7504470,oronthewebatwww.copyright.com.RequeststothePublisherforpermissionshould be addressed to the Permissions Department, John Wiley sons, Inc, Ill River Street, Hoboken, N 07030,(201)7486011,fax(201)7486008,oronlineathttp://www.wiley.com/go/permission Limit of liability/Disclaimer of Warranty: While the publisher and author have used their best efforts in preparing this book, they make no representations or warranties with respect to the accuracy or completeness of the contents of this book and specifically disclaim any implied warranties of merchantability or fitness for a particular purpose. No warranty may be created or extended by sales representatives or written sales materials. The advice and strategies contained herein may not be suitable for your situation. You should consult with a professional where appropriate. Neither the publisher nor author shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages For general information on our other products and services or for technical support, please contact our Customer Care Department within the United States at(800)7622974, outside the United States at (317)5723993 or fax(317)5724002 Wiley also publishes its books in a variety of electronic formats. Some content that appears in print may not be available in electronic formats. For more information about Wiley products, visit our web site at www.wiley.com Library of Congress CataloginginPublication Data Names: Upton, Graham J G, author Title: Categorical data analysis by example/ Graham J.G. Upton Description Hoboken, New Jersey: John Wiley sons, 2016. Includes index Identifiers: LCCN 2016031847(print)I LCCn 2016045176(ebook) isbn9781119307860(cloth) IsBN9781119307914( pdf) ISBN9781119307938(epub) Subjects: LCSH: Multivariate analysis. I Loglinear models Classification: LCC QA278 U68 2016(print)I LCC QA278(ebook)I DDC 519.5/35dc23 Lcrecordavailableathttps://ccn.loc.gov/2016031847 Printed in the united states of america 10987654321 www.allitebooks.com CONTENTS PREFACE ACKNOWLEDGMENTS 1 NTRODUCTION 1.1 What are Categorical data? 1.2 A Typical Data set 2 1.3 Visualization and CrossTabulation 3 1.4 Samples, Populations, and random Variation 4 1.5 Proportion, Probability, and Conditional Probability 5 1.6 Probability distributions 6 1.6.1 The Binomial distribution 6 1. 6.2 The Multinomial distribution 7 1. 6. 3 The Poisson distribution 7 1. 6. 4 The normal distribution 7 1.6.5 The ChiSquared(x)Distribution 8 1. 7 *The Likelihood 9 2 ESTIMATION AND INFERENCE FOR CATEGORICAL DATA 2.1 Goodness of fit 11 www.allitebooks.com I CONTENTS 2.1.1 Pearson's x GoodnessofFit Statistic 11 2.1.2 *The Link between X and the poisson and x2Distributions 12 2.1.3 The LikelihoodRatio GoodnessofFit Statistic 2.1.4* Why the G and X Statistics Usually Have Similar Values 14 2.2 Hypothesis Tests for a Binomial Proportion Large Sample) 14 2. 2. 1 The Normal score Test 15 2.2.2 *Link to pearson's X GoodnessofFit Test 15 2.2.3 G2 for a Binomial Proportion 15 2.3 Hypothesis Tests for A Binomial Proportion(Small Sample)16 2.3.1 OneTailed Hypothesis Test 16 2.3.2 TwoTailed Hypothesis Tests 18 2.4 Interval Estimates for A Binomial Proportion 18 2.4.1 Laplaces method 19 2.4.2 Wilsons Method 19 2.4.3 The AgrestiCoull Method 20 2.4.4 Small Samples and Exact Calculations 2( R ferenc 2 3 THE 2 X 2 CONTINGENCY TABLE 25 3.1 Introduction 25 3.2 Fisher's Exact Test(For Independence) 27 3.2.1 Derivation of the exact Test formula 28 3.3 Testing Independence with Large Cell Frequencies 29 3.3.1 USing Pearson,s GoodnessofFit Test 30 3.3.2 The Yates Correction 30 3.4 The 2x2 Table in a medical Context 32 3.5 Measuring Lack of Independence( Comparing Proportions )34 3.5.1 Difference of Proportions 35 3.5.2 Relative risk 36 3.5.3 OddsRatio 37 References 40 www.allitebooks.com CONTENTS VIl 4 THEXJ CONTINGENCY TABLE 4.1 Notation 41 4.2 Independence in the l J Contingency Table 42 4.2.1 Estimation and degrees of freedom 42 4.2.2 OddsRatios and Independence 43 4.2.3 Goodness of fit and lack of fit of the Independence model 43 4.3 Partitioning 46 4.3.1 Additivity of G 46 4.3.2 Rules for partitioning 4 4.4 Graphical Displays 49 4.4.1 Mosaic Plots 49 4.4.2 Cobweb diagrams 50 4.5 Testing Independence with Ordinal Variables 52 References 54 5 THE EXPONENTIAL FAMILY 55 5.1 Introduction 55 5.2 The Exponential Family 56 5.2.1 The Exponential Dispersion Family 57 5.3 Components of a general Linear Model 57 5.4 Estimation 58 References 59 6 A MODEL TAXONOMY 61 6.1 Underlying Questions 61 6.1.1 Which Variables are of Interest? 61 6.1.2 What Categories should be used? 61 6.1.3 What is the Type of Each Variable?62 6.1. 4 What is the Nature of each variable? 62 6.2 Identifying the Type of Model 63 7 THE2 XJ CONTINGENCY TABLE 65 7.1 A Problem with X(and G)65 7. 2 USing the logit 66 www.allitebooks.com v CONTENTS 7.2.1 Estimation of the Logit 67 7. 2. 2 The null model 68 7.3 Individual data and grouped data 69 7.4 Precision Confidence Intervals and Prediction Intervals 73 7.4.1 Prediction intervals 74 7.5 Logistic Regression with a Categorical Explanatory Variable 76 7.5.1 Parameter Estimates with Categorical variables (>2)78 7.5.2 The dummy Variable Representation of a Categorical Variable 79 References 80 8 LOGISTIC REGRESSION WITH SEVERAL EXPLANATORY VARIABLES 81 8.1 Degrees of freedom when there are no interactions 81 8.2 Getting a Feel for the data 83 8.3 Models with twoVariable interactions 85 8.3.1 Link to the Testing of Independence between Two Variables 87 9 MODEL SELECTION AND DIAGNOSTICS 89 9.1 Introduction 89 9.1.1 Ockham's razor 90 9.2 Notation for Interactions and for models 91 9.3 Stepwise Methods for Model Selection Using G 92 9.3.1 Forward Selection 94 9.3.2 Backward Elimination 96 9.3.3 Complete stepwise 98 9.4 AIC and Related Measures 98 9.5 The Problem Caused by rare Combinations of Events 100 9.5.1 Tackling the Problem 101 9.6 Simplicity versus Accuracy 103 9.7 DFBETAS 105 References 107 www.allitebooks.com
 5.98MB
Categorical Data Analysis
20170118Praise for the Second Edition: 'A musthave book for anyone expecting to do research and/or applications in categorical data analysis'. ('Statistics in Medicine'). 'It is a total delight reading this book'. ('Pharmaceutical Research'). 'If you do any analysis of categorical data, this is an essential desktop reference'. ('Technometrics'). The use of statistical methods for analyzing categorical data has increased dramatically, particularly in the biomedical, social sciences, and financial industries. Responding to new developments, this book offers a comprehensive treatment of the most important methods for categorical data analysis. 'Categorical Data Analysis, Third Edition' summarizes the latest methods for univariate and correlated multivariate categorical responses. Readers will find a unified generalized linear models approach that connects logistic regression and Poisson and negative binomial loglinear models for discrete data with normal regression for continuous data. This edition also features: an emphasis on logistic and probit regression methods for binary, ordinal, and nominal responses for independent observations and for clustered data with marginal models and random effects models; two new chapters on alternative methods for binary response data, including smoothing and regularization methods, classification methods such as linear discriminant analysis and classification trees, and cluster analysis; new sections introducing the Bayesian approach for methods in that chapter; more than 100 analyses of data sets and over 600 exercises; notes at the end of each chapter that provide references to recent research and topics not covered in the text, linked to a bibliography of more than 1,200 sources; and, a supplementary website showing how to use R and SAS; for all examples in the text, with information also about SPSS and Stata and with exercise solutions. 'Categorical Data Analysis, Third Edition' is an invaluable tool for statisticians and methodologist
 29.68MB
CategoricalDataAnalysis.pdf
20180410The explosion in the development of methods for analyzing categorical data that began in the 1960s has continued apace in recent years. This book provides an overview of these methods, as well as older, now standard, methods. It gives special emphasis to generalized linear modeling techniques, which extend linear model methods for continuous variables, and their extensions for multivariate responses.
 8.68MB
定性数据分析英文第三版高清版 Categorical Data Analysis 3rd Edition by ALAN AGRESTI
20180405Categorical Data Analysis, Third Edition summarizes the latest methods for univariate and correlated multivariate categorical responses. Readers will find a unified generalized linear models approach that connects logistic regression and Poisson and negative binomial loglinear models for discrete data with normal regression for continuous data. ALAN AGRESTI is Distinguished Professor Emeritus in the Department of Statistics at the University of Florida. He has presented short courses on categorical data methods in thirty countries.
 3.66MB
Categorical Data Analysis.
20140818Categorical Data Analysis.是国外一数学学者的数据分析，主要是讲述logist模型和线性模型的来源，应用，检验，参数估计，这些模型可以应用到医生医学疾病的研究上面
 1.80MB
英文原版Categorical Data Analysis by Example 1st Edition
20190923Includes starred sections that provide more background details for interested readersCategorical Data Analysis by Example is a reference for students in statistics and researchers in other disciplines...
 26.25MB
Categorical Data Analysis Using The SAS System
20091106Chapter 1. Introduction 1 1.1 Overview . . . . . . . ....1.2 Scale of Measurement ....1.3 Sampling Frameworks ....1.4 OverviewofAnalysis Strategies ....1.5 WorkingwithTables in ...11.4 Analysis ofPainStudy . . . . ...
 11.33MB
Biostatistics by Example Using SAS Studio
20180831Biostatistics by Example Using SAS Studio PDF Purpose SAS University Edition and its user interface, SAS Studio, have become very ...• Categorical data analysis • Power and sample size calculations
 30.17MB
Learning pandas  Second Edition  2017 pdf 2分
201804241. pandas and Data Analysis Introducing pandas Data manipulation, analysis, science, and pandas Data manipulation Data analysis Data science Where does pandas fit? The process of data analysis The ...
 2.94MB
定性数据分析Categorical Data Analysis 2nd Edition  Agresti
20161005分享产生价值！ A valuable new edition of a standard reference "A 'musthave' book for anyone expecting to do research and/or applications in categorical data analysis." –Statistics in Medicine on Categorical Data Analysis, First Edition The use of statistical methods for categorical data has increased dramatically, particularly for applications in the biomedical and social sciences. Responding to new developments in the field as well as to the needs of a new generation of professionals and students, this new edition of the classic Categorical Data Analysis offers a comprehensive introduction to the most important methods for categorical data analysis. Designed for statisticians and biostatisticians as well as scientists and graduate students practicing statistics, Categorical Data Analysis, Second Edition summarizes the latest methods for univariate and correlated multivariate categorical responses. Readers will find a unified generalized linear models approach that connects logistic regression and Poisson and negative binomial regression for discrete data with normal regression for continuous data. Adding to the value in the new edition is coverage of: Three new chapters on methods for repeated measurement and other forms of clustered categorical data, including marginal models and associated generalized estimating equations (GEE) methods, and mixed models with random effects Stronger emphasis on logistic regression modeling of binary and multicategory data An appendix showing the use of SAS for conducting nearly all analyses in the book Prescriptions for how ordinal variables should be treated differently than nominal variables Discussion of exact smallsample procedures More than 100 analyses of real data sets to illustrate application of the methods, and more than 600 exercises An Instructor's Manual presenting detailed solutions to all the problems in the book is available from the Wiley editorial department.
 4.55MB
Categorical Data Analysis Using SAS(3rd) 无水印原版pdf
20180508Categorical Data Analysis Using SAS(3rd) 英文无水印原版pdf 第3版 pdf所有页面使用FoxitReader、PDFXChangeViewer、SumatraPDF和Firefox测试都可以打开 本资源转载自网络，如有侵权，请联系上传者或csdn删除 查看此书详细信息请在美国亚马逊官网搜索此书
 4.84MB
定性数据分析categorical data analysis 第二版，Alan Agresti
20091102This book provides an overview of methods for analyzing categorical data. It gives special emphasis to generalized linear modeling techniques and their extensions for multivariate responses.
 9.83MB
Time_Series_Analysis_and_its_Applications_with_R_Examples_4th_ed
20180509ARIMA models, spectral analysis and statespace models, the text includes modern developments including categorical time series analysis, multivariate spectral methods, long memory series, nonlinear ...
 4.49MB
Python Machine Learning By ExamplePackt Publishing(2017).epub
20180311A resurging interest in machine learning is due to the same factors that have made data mining and Bayesian analysis more popular than ever. This book is your entry point to machine learning. Chapter...
 2.77MB
The EM Algorithm and Extensions (2nd Edition)
20090518McLachlan is the author or coauthor of Analyzing Microarray Gene Expression Data, Finite Mixture Models, and Discriminant Analysis and Statistical Pattern Recognition, all published by Wiley....
 1.78MB
ECONOMETRIC_MODELS_with_MATLAB.pdf
20190517For models with categorical responses, see “Parametric Classification” on page 142 or “Supervised Learning (Machine Learning) Workflow and Algorithms” on page 152. The regression process ...
 1.41MB
源码Deep Learning with Theano
20180806connect the real world data to the input of a neural net, in particular for categorical and discrete data. This chapter presents an example on how to build an embedding space through training with ...

下载
行业分类机械工程一种用于中药薄片的切片烘干装置.zip
行业分类机械工程一种用于中药薄片的切片烘干装置.zip

下载
行业分类电子电器光电转换元件及图像传感器.zip
行业分类电子电器光电转换元件及图像传感器.zip

下载
行业分类电子电器供电电路和空调器.zip
行业分类电子电器供电电路和空调器.zip

下载
行业分类作业装置基于微型飞行器的目标检测方法.zip
行业分类作业装置基于微型飞行器的目标检测方法.zip

下载
行业分类作业装置基于物联网和大数据的汽车充电桩控制系统.zip
行业分类作业装置基于物联网和大数据的汽车充电桩控制系统.zip

下载
行业分类作业装置基于随动非接触支撑的薄壁件镜像铣削装备及方法.zip
行业分类作业装置基于随动非接触支撑的薄壁件镜像铣削装备及方法.zip

下载
KEIL ARM7/ARM9支持安装包
KEIL ARM7/ARM9支持安装包

下载
行业分类物理装置一种设计人员角色管控方法.zip
行业分类物理装置一种设计人员角色管控方法.zip

下载
行业分类作业装置基于选区激光熔化成型的镍包覆陶瓷复合粉末制备方法.zip
行业分类作业装置基于选区激光熔化成型的镍包覆陶瓷复合粉末制备方法.zip

下载
ridge_regression所用数据PRISON.csv
ridge_regression所用数据PRISON.csv