Run like so: python3 scrape.py json_filepath.json num_concurrent_threads num_questions_to_scrape, ie. python3 scrape.py stack-exchange.json 200 50000 I highly recommend running this on some faster ...