# UCI_Automobile_Dataset_Exploratory_dataAnalysis
In this project, I have done exploratory data analysis on the UCI Automobile dataset available at https://archive.ics.uci.edu/ml/machine-learning-databases/autos/imports-85.data
This dataset consists of data From the 1985 Ward's Automotive Yearbook. Here are the sources
1) 1985 Model Import Car and Truck Specifications, 1985 Ward's Automotive Yearbook.
2) Personal Auto Manuals, Insurance Services Office, 160 Water Street, New York, NY 10038
3) Insurance Collision Report, Insurance Institute for Highway Safety, Watergate 600, Washington, DC 20037
Number of Instances: 398
Number of Attributes: 9 including the class attribute
Attribute Information:
mpg: continuous
cylinders: multi-valued discrete
displacement: continuous
horsepower: continuous
weight: continuous
acceleration: continuous
model year: multi-valued discrete
origin: multi-valued discrete
car name: string (unique for each instance)
This data set consists of three types of entities:
I - The specification of an auto in terms of various characteristics
II - Tts assigned an insurance risk rating. This corresponds to the degree to which the auto is riskier than its price indicates. Cars are initially assigned a risk factor symbol associated with its price. Then, if it is riskier (or less), this symbol is adjusted by moving it up (or down) the scale. Actuaries call this process "symboling".
III - Its normalized losses in use as compared to other cars. This is the relative average loss payment per insured vehicle year. This value is normalized for all autos within a particular size classification (two-door small, station wagons, sports/specialty, etc…), and represents the average loss per car per year.
The analysis is divided into two parts:
Data Wrangling
Pre-processing data in python
Dealing with missing values
Data formatting
Data normalization
Binning
Exploratory Data Analysis
Descriptive statistics
Groupby
Analysis of variance
Correlation
Correlation stats
Acknowledgment
Dataset: UCI Machine Learning Repository
Data link: https://archive.ics.uci.edu/ml/machine-learning-databases/autos/imports-85.data
没有合适的资源?快使用搜索试试~ 我知道了~
对 UCI 汽车数据集进行了探索性数据分析
共4个文件
names:1个
md:1个
ipynb:1个
2 下载量 198 浏览量
2023-02-23
21:56:07
上传
评论 1
收藏 824KB ZIP 举报
温馨提示
我对 UCI 汽车数据集进行了探索性数据分析,网址为https://archive.ics.uci.edu/ml/machine-learning-databases/autos/imports-85.data 该数据集包含来自 1985 年沃德汽车年鉴的数据。这是来源 1985 年型号进口汽车和卡车规格,1985 年沃德汽车年鉴。 Personal Auto Manuals, Insurance Services Office, 160 Water Street, New York, NY 10038 保险碰撞报告,公路安全保险协会,Watergate 600,Washington, DC 20037 实例数:398 属性数:9,包括类属性 属性信息: mpg:连续气缸:多值离散排量:连续马力:连续重量:连续加速度:连续模型年份:多值离散产地:多值离散车名:字符串(每个实例唯一) 该数据集由三种类型的实体组成: I - 汽车在各种特性方面的规格 II - Tts 分配了保险风险评级。这对应于汽车比其价格指示的风险更大的程度。汽车最初被分配一个与其价格相关的风险因素符号。
资源推荐
资源详情
资源评论
收起资源包目录
Automobile_Dataset_UCI_Analysis-main.zip (4个子文件)
Automobile_Dataset_UCI_Analysis-main
Automobile Dataset Exploratory Data Analysis.ipynb 1.2MB
imports-85 (1).data 25KB
imports-85 (1).names 5KB
README.md 2KB
共 4 条
- 1
资源评论
Mrrunsen
- 粉丝: 8471
- 资源: 473
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功