So I and my teammate in many projects use a raspberry pi connected to to the internet, stored in a remote location. We use it as a git server and ssh into it whenever we want to see what either of us is working on every now and then.
I recently wanted to scrape a lot of images and I couldn’t keep my personal computer on 24/7.
- Install chromedriver and selenium to the rpi.
sudo pip install selenium
sudo dpkg -i chromium-chromedriver_65.0.3325.181-0ubuntu0.14.04.1_armhf.deb
sudo apt-get install chromium-browser –yes
The chrome driver gets installed in /usr/lib/chromium-browser/chromedriver.
There can be more dependencies, it can also for brew. You can install it from here: http://linuxbrew.sh
After everything is installed, change the directory to the image downloading python library.
sudo python3 google_images_download.py -k “keyword1, keyword2” –limit 400 -f jpg -s large -cd /usr/lib/chromium-browser/chromedriver