mlp.zip_MLP_loss_人工智能_人工智能导论作业

共7个文件

py：3个

pyc：2个

pickle：2个

版权申诉

loss

人工智能

5星 · 超过95%的资源 195 浏览量 2022-09-24 08:38:04 上传评论 1 收藏 1.52MB ZIP 举报

**标题解析：** "mlp.zip_MLP_loss_人工智能_人工智能导论作业_水杯分类" 这个标题提到了几个关键元素。“mlp.zip”表明这是一个使用多层感知机（MLP，Multi-Layer Perceptron）模型的项目文件，被压缩在zip格式的文件中。接着，“MLP_loss”指涉的是模型训练过程中损失函数（Loss Function）的变化，通常用于衡量模型预测结果与实际结果的差距。再者，“人工智能”和“人工智能导论作业”暗示这是一项关于人工智能课程的学习任务，可能涉及机器学习的基础理论和实践应用。“水杯分类”说明具体的任务是图像分类，目标是识别和区分水杯。 **描述解析：** 描述提到这个作业是用mlp方法实现的水杯图片分类，并生成了loss的下降趋势图。这意味着学生或研究者使用了多层感知机模型对水杯图片进行训练，通过反向传播优化模型参数，同时记录并展示了训练过程中的损失函数值随训练迭代的减少情况，这通常是为了验证模型在训练过程中的学习效果和防止过拟合。 **标签解析：** 1. **mlp** - 多层感知机，一种前馈神经网络，常用于处理复杂的数据，如图像、语音等。 2. **loss** - 损失函数，衡量模型预测结果与真实结果的差异，是模型训练中的重要指标。 3. **人工智能** - 涵盖了模拟人类智能的各种技术，包括机器学习、深度学习等。 4. **人工智能导论作业** - 表明这是一个入门级别的课程项目，可能涵盖了基础的人工智能概念和算法。 5. **水杯分类** - 具体的图像识别任务，可能涉及到卷积神经网络（CNN）等图像处理技术。 **压缩包子文件的文件名称列表：** 仅有一个文件名 "mlp" 提供，这可能是模型代码的主文件或者包含了模型训练的整个流程。通常，这样的文件可能包含Python代码，使用深度学习库如TensorFlow或PyTorch实现多层感知机模型，进行数据预处理、模型定义、训练、评估和损失函数绘制等相关操作。这个压缩包文件包含了一个基于多层感知机的水杯图像分类项目。用户可能需要了解和应用以下知识点： 1. **多层感知机（MLP）** 的原理和实现，包括隐藏层、激活函数（如ReLU）、权重初始化等。 2. **损失函数（Loss Function）**，如交叉熵损失，以及如何计算和使用它来优化模型。 3. **反向传播（Backpropagation）** 算法，用于更新模型权重以减小损失。 4. **梯度下降（Gradient Descent）** 或其他优化器（如Adam）在训练过程中的作用。 5. **图像数据预处理**，包括归一化、数据增强等，以提高模型性能。 6. **模型训练与验证**，理解训练集、验证集和测试集的区别，以及如何监控模型性能。 7. **损失函数曲线绘制**，使用Matplotlib或TensorBoard等工具展示损失随时间的变化。 8. **深度学习框架**，如TensorFlow、PyTorch的使用，包括模型构建、训练、保存和加载等操作。 9. **图像分类** 的基本概念和步骤，包括特征提取、分类器训练等。通过这个项目，学习者可以深入理解深度学习模型在实际任务中的应用，以及如何通过调整模型参数和训练策略来改善模型性能。

资源推荐

资源详情

资源评论

收起资源包目录

mlp.zip （7个子文件）

mlp

mlp.pyc 3KB

mlp.py 2KB

__init__.pyc 177B

run_model.py 6KB

dataset

train_forstu.pickle 5.6MB

valid_forstu.pickle 1.77MB

__init__.py 0B

import random import numpy as np import matplotlib.pyplot as plt import pickle,pprint from classifier.knn import KNearestNeighbor from classifier.svm import LinearSVM from classifier.softmax import Softmax from classifier.mlp import MultiLayerPerceptron print "hello nut" ##load dataset pkl_file = open('dataset/train_forstu.pickle', 'rb') train = pickle.load(pkl_file) #pprint.pprint(train) pkl_file.close() train_x=np.array(train[0]) train_y=np.array(train[1]) print train_x.shape print train_y.shape ############################################################################ ######### type in the path of test dataset in the next line ########### ############################################################################ pkl_file = open('dataset/valid_forstu.pickle', 'rb') ############################################################################ ######### type in the path of test dataset in the above line ########### ############################################################################ valid = pickle.load(pkl_file) pkl_file.close() valid_x=valid[0] valid_y=valid[1] print valid_x.shape, valid_x.min() print valid_y.shape ,valid_y.min() ############################################################################ ######### type in the model in the next line ########### ############################################################################ model='mlp' #can be 'knn','svm','softmax','mlp' ############################################################################ ######### type in the model in the above line ########### ############################################################################ if model=='knn': classifier = KNearestNeighbor() classifier.train(train_x, train_y) pred_y=classifier.predict(valid_x,k=10) num_correct = np.sum(pred_y == valid_y) accuracy = float(num_correct) / valid_y.shape[0] print 'Got %d / %d correct using knn => accuracy: %f' % (num_correct, valid_y.shape[0], accuracy) result_file = open('predict/knn_predict.txt', 'w') for i in xrange(valid_y.shape[0]): result_file.write(str(pred_y[i])+'\n') result_file.close( ) elif model=='svm': mean_x = np.mean(train_x, axis=0) print "mean_x",mean_x.shape train_x-=mean_x valid_x-=mean_x train_x= np.hstack([train_x, np.ones((train_x.shape[0], 1))]) valid_x= np.hstack([valid_x, np.ones((valid_x.shape[0], 1))]) print train_x.shape,valid_x.shape classifier = LinearSVM() history1=classifier.train(train_x, train_y, learning_rate=1e-7, reg=5e4, num_iters=1000,batch_size=200, verbose=True) history2=classifier.train(train_x, train_y, learning_rate=1e-9, reg=5e4, num_iters=1000,batch_size=200, verbose=True) y_train_pred = classifier.predict(train_x) print 'training accuracy: %f' % (np.mean(train_y == y_train_pred), ) y_val_pred = classifier.predict(valid_x) print 'validation accuracy: %f using svm' % (np.mean(valid_y == y_val_pred), ) result_file = open('predict/svm_predict.txt', 'w') for i in xrange(valid_y.shape[0]): result_file.write(str(y_val_pred[i])+'\n') result_file.close( ) plt.plot(history1) plt.xlabel('Iteration number') plt.ylabel('Loss value') plt.show() elif model=='softmax': mean_x = np.mean(train_x, axis=0) print "mean_x",mean_x.shape train_x-=mean_x valid_x-=mean_x train_x= np.hstack([train_x, np.ones((train_x.shape[0], 1))]) valid_x= np.hstack([valid_x, np.ones((valid_x.shape[0], 1))]) print train_x.shape,valid_x.shape classifier = Softmax() history1=classifier.train(train_x, train_y, learning_rate=1e-7, reg=5e4, num_iters=2000,batch_size=200, verbose=True) #history2=classifier.train(train_x, train_y, learning_rate=1e-9, reg=5e4, num_iters=1000,batch_size=200, verbose=True) y_train_pred = classifier.predict(train_x) print 'training accuracy: %f' % (np.mean(train_y == y_train_pred), ) y_val_pred = classifier.predict(valid_x) print 'validation accuracy: %f using softmax' % (np.mean(valid_y == y_val_pred), ) result_file = open('predict/softmax_predict.txt', 'w') for i in xrange(valid_y.shape[0]): result_file.write(str(y_val_pred[i])+'\n') result_file.close( ) plt.plot(history1) plt.xlabel('Iteration number') plt.ylabel('Loss value') plt.show() elif model=='mlp': #mean_x = np.mean(train_x, axis=0) #train_x-=mean_x #valid_x-=mean_x #train_x/=255 #valid_x/=255 train_y1=np.zeros((train_y.shape[0],6),dtype='float32') valid_y1=np.zeros((valid_y.shape[0],6),dtype='float32') for i in xrange(train_y.shape[0]): train_y1[i,int(train_y[i])]=1 for i in xrange(valid_y.shape[0]): valid_y1[i,int(valid_y[i])]=1 #print valid_y print "x ",train_x.shape,valid_x.shape print "y ",train_y.shape,valid_y.shape classifier=MultiLayerPerceptron() pred_y,loss_history_tr,loss_history_va=classifier.run_mlp(train_x,train_y1,valid_x,valid_y1,num_iters=1000,verbose=True) print pred_y.shape print 'validation accuracy: %f using mlp' % (np.mean(valid_y == pred_y), ) result_file = open('predict/mlp_predict.txt', 'w') for i in xrange(valid_y.shape[0]): result_file.write(str(pred_y[i])+'\n') result_file.close( ) plt.plot(loss_history_tr,"g-",label="train accuracy") plt.plot(loss_history_va,"r-.",label="valid accuracy") plt.xlabel('Iteration number') plt.ylabel('accuracy') plt.legend() plt.show() else: print "please check the model type, they should be from 'knn','svm','softmax','mlp'"

评论收藏

内容反馈

版权申诉