-
Notifications
You must be signed in to change notification settings - Fork 23
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
0 parents
commit 334722b
Showing
1 changed file
with
54 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,54 @@ | ||
# fcc_nn_research | ||
|
||
Cleaned-up versions of notebooks used in researching public comments for FCC Proceeding 17-108 (Net Neutrality Repeal). I am posting the notebook for Exploratory Data Analysis first, and will include others as they are cleaned up. Please share with the rest of us what interesting insights you glean from the data! Tweet at me [@jeffykao](https://twitter.com/jeffykao). | ||
|
||
## Recent Updates (11-25-2017) | ||
|
||
None. First commit! | ||
|
||
## Background Information | ||
|
||
I did this project as a part of the coursework for [Metis](https://www.thisismetis.com/) and was shocked to see my analysis blow up online. Humbled by the attention but I'm sure experienced data scientists out there could glean even more insights from the work. :-) | ||
|
||
### Media | ||
- [Blog post on the results](https://hackernoon.com/more-than-a-million-pro-repeal-net-neutrality-comments-were-likely-faked-e9f0e3ed36a6) | ||
- [Washington Post article](https://www.washingtonpost.com/news/the-switch/wp/2017/11/24/fcc-net-neutrality-process-corrupted-by-fake-comments-and-vanishing-consumer-complaints-officials-say/) | ||
- [Fortune article](http://fortune.com/2017/11/25/1-million-fake-fcc-comments-net-neutrality-were-probably-fake/) | ||
|
||
## Getting Started | ||
|
||
This is just a rough sketch of the instructions to the get project up and running on your local machine. Once you get Anaconda installed on your machine, the libraries should be easy to install and the notebooks should be fairly straightforward to run. Instructions to install each library should be easily googlable (sp?). | ||
|
||
### Prerequisites | ||
|
||
#### Data | ||
|
||
First set of data (text and duplicate counts only) [posted on kaggle](https://www.kaggle.com/jeffkao/proc_17_108_unique_comments_text_dupe_count). I'm working hard to get non-text data up as well and will let you know the progress by tweet [@jeffykao](https://twitter.com/jeffykao). | ||
|
||
#### Python/Anaconda version | ||
|
||
- Python 3.6.1 (64-bit) | ||
- conda 4.3.29 | ||
|
||
#### Libraries used | ||
|
||
- [NumPy](http://www.numpy.org) | ||
- [scikit-learn](http://scikit-learn.org/stable/) | ||
- [matplotlib](http://matplotlib.org) | ||
- [pandas](http://pandas.pydata.org) | ||
- [HDBSCAN](https://github.com/scikit-learn-contrib/hdbscan) | ||
- [spaCy](https://spacy.io/usage/) | ||
|
||
## License | ||
|
||
This project is licensed under the MIT License - see the [LICENSE.md](LICENSE.md) file for details | ||
|
||
## Acknowledgments | ||
|
||
* [@drob](https://twitter.com/drob) for putting the blog post on blast and giving me some great advice in the aftermath | ||
* [@leland_mcinnes](https://twitter.com/leland_mcinnes) for authoring HDBSCAN | ||
* [@bekcunning](https://twitter.com/bekcunning) for sending me the link that made me finally _write that g***** blog post!_ | ||
* [@prb_data](https://twitter.com/prb_data) & Joe Eddy, my instructors at [Metis](https://www.thisismetis.com/) | ||
* [@AndrewDBS](https://twitter.com/AndrewDBS) who convinced me to get a twitter account | ||
* My amazing & creative wife/editor who read through & greatly improved my drafts | ||
* Sweat pants. |