Add exclusion improvement TODO to README

diff --git a/ b/ @@ -45,3 +45,4 @@ Run: "poetry run python -m pytest" - TODO: exclude raw-text blocks from indexed content - TODO: strip control characters from logged output like URLs - TODO: fix bug in calulation of backlinks (iirc the bug is visible on +- TODO: refactor manual exclusion logic to be regex-based instead of prefix-based. we could get more nuanced with exclusion logic this way