# Adults DataSet UCI
# Problem Setting
A polling institute wants to be able to estimate an individual’s income from his/her personal data (see einkommen.train). To this aim, 30.000 individuals were interviewed concerning the features summarized below. For some of the individuals, not all features are available. Crucially, the income of only 5.000 of the interviewee’s is known.
# Steps:
* Data Integration
* Feature Representation
* EDA Pairplot
* Correlation of Numeric Attributes
* Missing Value Representation
* Data Cleaning, covert categorical variables to numerical
* Check missing values
* Feature Selection
* Model Selection and Evaluation
* 'Logistic Regression'
* 'Random Forest'
* 'Neural Network'
* 'GaussianNB'
* 'DecisionTreeClassifier'
* 'SVM'
没有合适的资源?快使用搜索试试~ 我知道了~
Load the data into Python and preprocess it. Choose adequate dat
共4个文件
ipynb:2个
train:1个
md:1个
0 下载量 58 浏览量
2023-02-06
21:43:07
上传
评论
收藏 1.37MB ZIP 举报
温馨提示
Adults DataSet UCI Problem Setting A polling institute wants to be able to estimate an individual’s income from his/her personal data (see einkommen.train). To this aim, 30.000 individuals were interviewed concerning the features summarized below. For some of the individuals, not all features are available. Crucially, the income of only 5.000 of the interviewee’s is known. Steps: Data Integration Feature Representation EDA Pairplot Correlation of Numeric Attributes Missing Value Representation
资源推荐
资源详情
资源评论
收起资源包目录
AdultsDataSetUCI-master.zip (4个子文件)
AdultsDataSetUCI-master
einkommen.train 3.4MB
Project_Assignment.ipynb 664KB
.ipynb_checkpoints
Project_Assignment-checkpoint.ipynb 752KB
README.md 779B
共 4 条
- 1
资源评论
小夕Coding
- 粉丝: 5895
- 资源: 461
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功