A simple BeautifulSoup web-scraping script that retrieves the publically available classical music data (currently, .pdf and .mid files) from The Mutopia Project (http://www.mutopiaproject.org). The compositions are separated in to subdirectories based on their composers.
- Run sheetmusic_scraper.py
- Upon request by the script, provide the path at which you'd like the dataset to be saved.
- Run midimusic_scraper.py
- Upon request by the script, provide the path at which you'd like the dataset to be saved.