基于Python神经网络的通用股票预测模型资源-CSDN文库

共33个文件

py：13个

png：13个

md：3个

版权申诉

python

神经网络

91 浏览量 2024-07-11 19:44:09 上传评论收藏 5.01MB ZIP 举报

根据给定的信息，我们可以提炼出以下相关知识点： ### 一、学术蓝论文答辩PPT模板概述 - **适用对象**：此模板适用于那些希望在不同技术领域进行学习的小白或是进阶学习者。无论是用于本科或是研究生阶段的毕业设计、课程设计、大作业还是初期项目立项，这款模板都能够满足需求。 - **应用场景**： - **毕业设计**：在完成毕业设计的过程中，学生需要向导师展示自己的研究成果，此时一个专业的PPT模板就显得尤为重要。 - **课程设计**：许多课程会要求学生提交课程设计报告，并进行口头汇报。一个好的PPT模板可以帮助学生更好地组织思路，清晰地表达设计理念和技术实现过程。 - **大作业**：在一些综合性较强的课程中，教师可能会布置一些需要团队合作的大作业，此时利用模板可以统一报告风格，增强团队协作效果。 - **工程实训**：在参加各类工程实训时，学员需要将所学知识应用于实际项目中，通过PPT展示项目成果是常见的汇报方式之一。 - **初期项目立项**：对于创业者而言，在寻求投资或合作伙伴时，使用一个精心准备的PPT来介绍项目背景、市场分析、商业模式等内容至关重要。 ### 二、PPT模板结构解析 #### 1. 选题意义及目的这一部分通常放在PPT的开头，用来阐述研究课题的重要性和研究目的。例如，在模板中提到的“选题意义及目的”部分，尽管给出的内容并未直接与学术内容相关联，但可以推测这部分应该包括对选题背景、研究目标以及其对学术界或实际应用领域的贡献等方面进行详细介绍。撰写时应具体说明研究的必要性，比如解决了哪些现有问题，或者填补了哪些研究空白等。 #### 2. 主要研究内容这部分内容是整个PPT的核心，主要介绍研究过程中涉及的关键技术和方法。从模板提供的信息来看，虽然具体的研究内容没有给出，但是可以推测这部分应当详细列出研究的主要组成部分，包括但不限于采用的技术手段、实验设计、数据处理方法等。这一部分的目的是让听众了解研究是如何进行的，以及采用了哪些关键步骤来支持研究结论。 #### 3. 项目原理分析这是对研究背后理论基础的阐述，通常会介绍研究的基本原理、模型假设、算法选择等。对于复杂的技术项目而言，这一环节非常重要，因为它帮助听众理解研究的逻辑框架和发展脉络。 #### 4. 项目方案设计这一部分则更侧重于实施细节，比如如何根据研究目的和理论框架来设计实验方案、数据收集策略等。这部分内容往往更加具体化，旨在展示研究者是如何将理论转化为实践的。 #### 5. 研究结果及应用研究完成后，需要对结果进行分析，并探讨这些结果的实际应用价值。这部分不仅要呈现研究数据，还要解释这些数据的意义，以及它们如何支持最初的研究假设或目标。 #### 6. 总结建议 PPT的结尾部分通常是对整个研究工作的总结，以及对未来研究方向的展望。在这里可以提出一些改进建议或未来可能的研究路径，为听众留下深刻印象。学术蓝论文答辩PPT模板提供了一个结构清晰、内容丰富的框架，适合各种学术和技术领域的汇报使用。通过合理运用这一模板，可以使汇报更加专业、有条理，有助于提高汇报质量和效果。

资源推荐

资源详情

资源评论

收起资源包目录

stock_prediction-master.zip （33个子文件）

stock_prediction-master

target.py 20KB

utils.py 5KB

SECURITY.md 619B

init.py 3KB

fig_lstm

000001SZ_Pre.png 2.43MB

train_loss.png 1.71MB

bert_train_test.py 4KB

getdata.py 10KB

stock_data_spider.py 7KB

LICENSE 34KB

predict.py 39KB

readme

candleline.png 258KB

LSTM2.png 47KB

prediction.png 96KB

MAPE.png 4KB

loss.png 13KB

LSTM.png 38KB

LSTM3.png 96KB

MSE.png 2KB

RNN.png 82KB

bert_train.py 7KB

CODE_OF_CONDUCT.md 5KB

common.py 38KB

requirements.txt 255B

.gitignore 2KB

install_files

yfinance-0.1.59.tar.gz 25KB

data_preprocess.py 1KB

test.py 504B

README.md 33KB

bert_data_preprocess.py 2KB

get_bert_data.py 412B

fig_transformer

000001SZ_Pre.png 2.42MB

train_loss.png 1.95MB

___基于神经网络的通用股票预测模型 A general stock prediction model based on neural networks___ ## New >* 20230522 >* 1. 经过长时间的训练，分析和学习，我深深感觉到单纯使用lstm和transformer进行价格的预测是相当的困难。我下面的更新方向将向三个方向进行：一是开发一种新的模型以更加适配金融预测的特点；二是继续完成NLP方向的情感分析，做到分析大众和专业机构的恐慌程度；三是彻底重写一个新的预测程序，从预测价格转变为预测走势，降低预测的难度，提高预测的准确度。有能力或者有想法的朋友，欢迎给我提意见。 >* After a long time of training, analysis and learning, I deeply feel that it is quite difficult to use lstm and transformer alone to predict prices. My update direction below will go in three directions: one is to develop a new model to better adapt to the characteristics of financial forecasting; the second is to continue to complete the sentiment analysis in the NLP direction, so as to analyze the panic degree of the public and professional institutions; the third is to completely rewrite a new prediction program, from predicting prices to predicting trends, reducing the difficulty of prediction and improving the accuracy of prediction. Friends with ability or ideas are welcome to give me advice. >* 20230508 >* 1. 增加支持时间区间训练及预测，predict_days参数为正数时，使用区间模型，为负数时，使用单点模型 >* Add support for time interval training and prediction. When the predict_days parameter is a positive number, the interval model is used. When it is a negative number, the single point model is used. >* 20230506 >* 1. 按照原始论文，重新构建了transformer模型，使得训练速度提高20倍 >* According to the original paper, the transformer model is reconstructed, which makes the training speed 20 times faster >* 20230502 >* 1. 增加使用yfinance接口下载数据，如果是中国大陆用户，需要使用代理，或者使用其他接口 >* Add the use of yfinance interface to download data. If you are a user in mainland China, you need to use a proxy or other interface. >* 20230501 >* 1. 支持可变维度的输入，可变长度的输入，最大输入维度需要在init.py中设置 >* Support variable dimension input, variable length input, and the maximum input dimension needs to be set in init.py >* 2. 修改删除nan数据整行的模式为将nan数据替换为-0.0 >* Modify the mode of deleting the whole row of nan data to replace the nan data with -0.0 >* 3. 增加transformer模型的mask机制，以支持可变长度的输入 >* Add the mask mechanism of the transformer model to support variable length input >* 4. 修改很多数据处理及数据接口相关代码，目前默认是akshare接口，如果需要使用tushare，请自行修改代码, 且考虑删除对于tushare的支持，后期如需要使用tushare，请自行回退至上一个版本 >* Modify a lot of data processing and data interface related code, the default is akshare interface, if you need to use tushare, please modify the code by yourself, and consider deleting the support for tushare. If you need to use tushare later, please roll back to the previous version by yourself. >* 5. 修正了在akshare接口下，预测数据的bug >* Corrected the bug of predicting data under the akshare interface >* 20230428 >* 1. 增加新的数据接口，解决原接口速度慢，很多数据还需要付费的问题 >* Add a new data interface to solve the problem of slow speed of the original interface and many data still need to be paid for. >* 2. 增加了对于复权数据的支持，注意在tushare数据源中，复权数据是需要更高的权限的（付费） >* Added support for adjusted data. Note that in the tushare data source, adjusted data requires higher permissions (paid). >* 3. 复权数据有其自身的优点和缺点，请自行选择不复权，前复权，后复权 >* Adjusted data has its own advantages and disadvantages. Please choose unadjusted, pre-adjusted, and post-adjusted. >* 20230416 >* 1. 已尝试修复之前发现的bug，目前没有复发的问题，如果有问题，请及时反馈 >* The bug found previously has been tried to be fixed, and there is no problem that has been reoccurred. If there is a problem, please feedback as soon as possible. >* 2. 增加针对文字进行情感分析的模型，支持中文，目前仅有数据处理及训练的代码，尚未合并到预测功能中，有兴趣的朋友可以自行尝试，欢迎提出建议 >* Add a model for sentiment analysis of text, supporting Chinese, with only data processing and training code at present, not yet merged into the prediction function. Friends who are interested can try it on their own, welcome to make suggestions. >* 3. NLP的模型，由于版权和其他问题，我没有提供数据，或数据下载的方式，请自行寻找数据源，或者使用自己的数据源，需要的格式为csv文件，包含label和text两列，label为0或1，text为文本内容，如果有兴趣，可以自行尝试，欢迎提出建议。 >* Due to copyright and other issues, I did not provide data or data download methods for the NLP model. Please find your own data source, or use your own data source. The required format is a csv file with two columns, label and text. The label is 0 or 1, and the text is the text content. If you are interested, you can try it on your own, welcome to make suggestions. >* 20230413 >* 目前发现的bug: Bugs found so far: >* 1. predict模式会倒是load data失败 (尝试修复) >* predict mode will fail to load data (try to fix) >* 2. 长时间训练，有概率导致multiprocess.queue异常 (原因未知，请有能力的朋友帮我一起debug) >* long training time may cause multiprocess.queue exception (cause unknown, please help me debug if you are capable) >* 20230412 >* 1. 修复重大bug：计算输入维度时，少计算了初始的8个维度，更新了这个版本后，之前训练的模型将不能使用，需要重新训练；如果还需要使用之前的模型，请手工修改init.py中的INPUT_DIMENSION为20（最小为4，且不能小于输出维度OUTPUT_DIMENSION），并检查common.py中的add_target函数中的相关内容. >* Fix major bug: when calculating the input dimension, 8 dimensions were less calculated. After updating this version, the previously trained models will no longer be available and need to be retrained. If you still need to use the previous model, please manually modify INPUT_DIMENSION in init.py to 20 (minimum 4, and cannot be less than OUTPUT_DIMENSION), and check the related content in the add_target function in common.py. >* 20230402 >* 1. 修改dataset读取方式，使用data queue以及buffer，减少IO次数，提高训练速度 >* Modify the dataset reading method to use data queue and buffer to reduce the number of IO operations and improve training speed. >* 2. 将全局变量移动到init.py中，方便修改 >* Move global variables to init.py for easy modification. >* 20230328 >* 1. 修改预处理数据文件格式，增加ts_code和date两个字段，方便后续使用 >* Modify the format of the preprocessed data file, add two fields ts_code and date, for future use. >* 2. 修改lstm和transformer模型，以支持混合长度输入 >* Modify the lstm and transformer models to support mixed length input. >* 3. 在transformer模型，增加了 decoder层，期望增加预测精度 >* Added a decoder layer in the transformer model, hoping to increase the prediction accuracy. >* 20230327 >* 1. 修改了部分运行逻辑，配合load pkl预处理文件，极大的提高了训练速度 >* Modified some running logic and used preprocessed pkl files to greatly improve training speed. >

评论收藏

内容反馈

版权申诉