Skip to content

vivektangudu123/why-so-harsh

 
 

Repository files navigation

TEAM TWO
Vivek Tangudu 	    IMT2020110
Harshadeep Donapati IMT2020085

folder structure:
.
├── img
│   ├── comment_frequency.png
│   └── wordcloud.png
├── input_csv
│   ├── sample.csv
│   ├── test.csv
│   └── train.csv
├── ML_TeamTwo_Report.pdf
├── ML_TeamTwo_script.ipynb
├── ML_TeamTwo_script.pdf
├── ML_TeamTwo_Slides.pdf
├── output_csv
│   ├── final_kaggle_submission.csv
│   ├── submission_cc.csv
│   ├── submission_lr_lemma.csv
│   ├── submission_lr_stemm.csv
│   ├── submission_rid_lemma.csv
│   ├── submission_rid_stemm.csv
│   ├── submission_sgdc_lemma.csv
│   └── submission_sgdc_stemm.csv
├── pickle
│   ├── model_cc.pkl
│   ├── model_lr.pkl
│   └── model_sgdc.pkl
├── README.txt
└── util
    ├── frequency_bigramdictionary_en_243_342.txt
    └── frequency_dictionary_en_82_765.txt

5 directories, 23 files

final_kaggle_submission.csv:      final highest score kaggle submission csv file
model_lr.pkl:                     Logistic Regression pickle file
ML_TeamTwo_script.pdf:            PDF version of jupyter notebook


Preproccesed data pickle files are huge, so not including them here.
They can be viewed online at https://www.kaggle.com/datasets/harshalps/whysoharsh

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%