Download feeds (e.g. RSS) and all linked pages, like a crawler/scraper, and store the collected RSS articles into ZIP files.
Only the RSS <article>
parts are stored.
- Python 3.10+
- Poetry (see https://python-poetry.org/docs/#installation)
- set up:
poetry install --only=main --sync
(once) poetry run python feeddownloader/downloader.py <RSS-URL> <OUTPUT-FOLDER>
Example:
poetry run python feeddownloader/downloader.py "https://www.tagesschau.de/index~rss2.xml" output/