COGS-109-Modeling-and-Data-Analysis:利用多元线性回归和聚类的最终项目

共5个文件

ipynb：1个

md：1个

pdf：1个

JupyterNotebook

需积分: 10 118 浏览量 2021-03-18 04:49:41 上传评论 4 收藏 2.84MB ZIP 举报

资源详情

资源评论

资源推荐

收起资源包目录

COGS-109-Modeling-and-Data-Analysis-main.zip （5个子文件）

COGS-109-Modeling-and-Data-Analysis-main

COGS_109_Final Report.ipynb 650KB

COGS 109 Final Report PDF.pdf 1.23MB

ObesityDataSet.csv 257KB

README.md 1KB

Obesity Analysis Poster.jpg 1.29MB

# COGS-109-Modeling-and-Data-Analysis This project uses Linear Regression and K-means Clustering to conduct an analysis on the Eating Habits dataset, which contains variables that determines obesity. Research Focus: Using exploratory linear regression and clustering, we aim to examine several attributes from the dataset to find which are the optimal indicators to predict the weight of an individual. Dataset Information: The dataset consists of data collected from individuals from Mexico, Peru, and Colombia. This data is useful for the estimation of the obesity levels based on eating habits and physical conditions. There are 2111 instances and 17 different attributes. Additionally, the data is classified using the values of Insufficient Weight, Normal Weight, Overweight Level I, Overweight Level II, Obesity Type I, Obesity Type II and Obesity Type III. NOTE: The main report can be found under: "COGS 109 Final Report.pdf" The Jupyter Notebook Containing our code can be found under: "COGS 109 Final report.ipynb" The Presentation Poster can be found under: "Obesity Analysis Poster" The Dataset we used can be found under: "ObesityDataSet.csv" ***Dataset credits to UCI Machine Learning Repository: https://archive.ics.uci.edu/ml/datasets/Estimation+of+obesity+levels+based+on+eating+habits+and+physical+condition+