Skip to content

Latest commit

 

History

History
25 lines (15 loc) · 1.42 KB

README.md

File metadata and controls

25 lines (15 loc) · 1.42 KB

Quality of Life Meets Metropolitan Ambition

Utilizing Natural Language Processing, we apply topic modelling on data scraped from instagram about copenhagen.

Results

Instaloader scrape

The folder 'instaloader' contains the scraping script to scrape whatever hashtag desired. To run the script, open the directory in your terminal and type: python instaloader "#INSERTHASTHTAG" --count INSERTCOUNT

Example: python instaloader "#norrebro" --count 10000

The script will then initialize the scrape and show a count of how many posts was scraped. The data is stored in a new directory with the name of the hashtag you entered. So if you entered #norrebro, there will now be a folder named #norrebro in your instaloader directory containing a .txt file for each post scraped.

Data manager

The data manager script basically loads in all files in a given directory. You initialize the script by doing the following:

  1. enter the main directory mini-1 through your terminal
  2. type: python data_manager.py INSERTDIRECTORYNAME

Example: python data_manager.py norrebro

The directory name given in your sys.argv[1] is the one that you would like to open and manage.