Define “scraping”. Judging from the tools you posted, you’re looking to give them an introduction to scraping that doesn’t require writing code. I don’t know what the tools in that realm are like, for better or for worse.
For people who are writing code, I would recommend using Python + requests library if what they’re pulling are straight files/HTML (for example, the tax bills scraper). To parse HTML, I would recommend BeautifulSoup4 in Python.
However, I’ve found headless systems are very slow, and it can be better to use Chrome dev tools to isolate out the AJAX requests that get the necessary data, and make/read those directly. Oftentimes those will return well-formatted JSON to boot!
Pdftotext is a great library if you’re trying to extract text from PDFs that are text-based; tesseract will work otherwise but is slow & memory intensive (as OCR is).