JDownloader Community - Appwork GmbH

Thread Tools Display Modes
Prev Previous Post   Next Post Next
Old 28.07.2023, 20:47
Nimboid Nimboid is offline
Vacuum Cleaner
Join Date: Jan 2023
Location: UK
Posts: 18
Default Handling DEEPDECRYPT relative URLs

So, I am trying to acquire links to attachments in a phpBB-hosted forum. This requires being logged-in, so I have provided my user/pass to JD via 'Basic Authentication' settings. Natively, provided with a suitable forum page URL, JDownloader is only finding the thumbnails, and not following the href links:
<a class="file-preview " href="/phpBB2/index.php?attachments/capture-jpg.1988988/" target="_blank">
	<img src="/phpBB2/data/attachments/1915/1915251-5369dc6cb5e0da0fb7fd46255176b083.jpg" alt="Capture.JPG" width="264" height="200" loading="lazy">
The logs don't reveal any errors. Am I right in thinking that spending time on a LinkCrawler Rule is the way to go?

To this end, I have made a DEEPDECRYPT rule. I have established that its pattern catches the page URL, because the LinkCrawlerRule...log contains the page source, and the Rule cookies update, but it does not result in any additions to the LinkGrabber pane, nor any traces in logs.
My deepPattern:
  "deepPattern"        : "(?i)<a class=\"file-preview \" (href=\"[^\"]+\") target=\"_blank\">",
Do I need to follow with a second REWRITE LinkCrawler Rule to convert the relative href to an absolute URL? And if so, what should its pattern be? Should it expect to receive and look for the whole of the 'upstream' deepPattern, or just its matching group? I have tried a followup rule, without success, as it doesn't generate its LinkCrawlerRule...log and without any errors appearing in logs either.

It's frustrating to trawl this forum and finding so much of the content of LinkCrawler Rule discussions is submerged under "**External links are only visible to Support Staff**"!
Reply With Quote

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

All times are GMT +2. The time now is 23:42.
Provided By AppWork GmbH | Privacy | Imprint
Parts of the Design are used from Kirsch designed by Andrew & Austin
Powered by vBulletin® Version 3.8.10 Beta 1
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.