I see multiple mistakes and wrong assumptions here:
First, coming back to your last post:
As you can see in your own screenshot, that captcha prompt is coming from "ally.sh" and not from reddit or from any site that you were trying to block.
That again is not on your "blacklist" so no wonder you're getting captcha prompts of that
Now to your most recent post:
Quote:
Originally Posted by verheiratet1952
plugin setup - reddit - crawler text mode - NEVER
|
Have you even read the description of that setting or ever tested it?
This will not(!) disable the reddit crawler.
This will only disable crawling text-content from reddit posts.
If you do not want JD to crawl links from provider XY, there is two ways to do this:
1. Settings -> Advanced Settings -> "GeneralSettings.crawlercrawlerpluginblacklist" and "GeneralSettings.crawlerhostpluginblacklist".
2. The filter settings you already found.
Now about the screenshot of your filters:
You got "reddit.com" in "Downloadurl...contains not" and at the same time "Sourceurl...contains" that alone makes no sense.
Without further explanation, I'd do it like this:
Screenshot:

If you want to filter YT too, you need to add
|youtube\.com to that filter field.
I also want to point out that there is a test button at the bottom of that window which can be used to test this rule on links. That saves time compared to testing it in your linkgrabber.
Also, if you want to block "all dialogs" such as those captcha dialogs, there is a "Enable silent mode" checkbox in the main toolbar which, if enabled, will block some(not all!) popups.
Quote:
Originally Posted by verheiratet1952
(my source text files have 50k lines and more...)
|
Maybe you should think about a whitelist instead of a blacklist?