# SummerBusinessAnalystProject
This is some of the work I did over my 2020 summer internship.
## Files
### GiannaManiaci_2020SummerIntern_Presentation[6503].pptx
* Final Presentation for the Summer which explains my entire process in the presenter notes.
### gianna_work_train.py
* This is the data pipeline I created to clean and organize the data. It transforms the text data into vectorized data so that the computer can properly work in its natural language, numerical data. The user can choose between 4 methods of vectorized data representation, BoW, TF-IDF, LDA, or Doc2Vec
评论0