Changelog

Version 1.0.2 - 24.05.2024

  • Enhanced scraping robustness with centralized handling of user-agent rotation, cookie management, and retry logic.
  • Refactored scraping utilities into a dedicated module (`scraper_utils.py`) for better maintainability and reuse.
  • Updated `webpage_scraper.py` and `articles_scraper.py` to leverage the new centralized utilities.

Version 1.0.1 - 30.08.2024

  • Added support for scraping additional content types.
  • Improved error handling and logging for better debugging.
  • Optimized performance for large-scale scraping tasks.

Version 1.0.0 - 20.08.2024

  • Initial release of Scrape That.
  • Implemented core scraping functionality for extracting links and content from web pages.
  • Added support for handling sitemaps and parsing HTML content.