Quote:
Originally Posted by Jiaz
@pspzockerscene: it IS the same user @I3ordo 
|
Ohh I see. Lack of concentration was clearly there :D
Quote:
Originally Posted by I3ordo
Grabs only the rar/archive files but leaves out the image of each article.
SO it s not like having JD ctrl+v'ing the clipboard.
|
Sure it's not. With DEEPDECRYPT LinkCrawler Rules you explicitely state "which part of the page you want to crawl content from".
I've modified your rule and enhanced the regular expression to include imgur.com URLs.
Please keep in mind that this will fail if the article image is hosted on another website and sure it is possible to make the rule more dynamic if needed.
Please also keep in mind that I recommend learning regular expressions so you can work on such rules yourself in the future.
Our support article links tools you can use to test/learn regular expressions:
https://support.jdownloader.org/Know...kcrawler-rules
Quote:
Originally Posted by I3ordo
Can the crawler rule be modified to include also the article images? I dont see any thing in pockerscene's rule that includes rar files and excludes everything else. it seems
|
You haven't understood how those rules work yet.
For rules of type DEEPDECRYPT, they look for certain stuff inside the websites' html code.
What to look for is up to you and regular expressions are used to find/filter those things.
Modified LinkCrawler rule(s):
Code:
[
{
"deepPattern": "<link>(https?://[^<]+)</link>",
"maxDecryptDepth": 1,
"name": "crawl all single item URLs from 'down3dmodels.com/feed/'",
"packageNamePattern": null,
"pattern": "**External links are only visible to Support Staff**,
"rewriteReplaceWith": null,
"rule": "DEEPDECRYPT",
"enabled": true,
"logging": false,
"updateCookies": true
},
{
"cookies": [],
"deepPattern": "((https?://i\\.imgur\\.com/[A-Za-z0-9]+\\.[A-Za-z]+)|>(https?://(?!down3dmodels\\.com/)[^<]+)</)",
"maxDecryptDepth": 1,
"name": "crawl all URLs inside all URLs from 'down3dmodels.com' except 'down3dmodels.com/feed/'",
"packageNamePattern": "<title>(.*?)</title>",
"pattern": "**External links are only visible to Support Staff**,
"rewriteReplaceWith": null,
"rule": "DEEPDECRYPT",
"enabled": true,
"logging": false,
"updateCookies": true
}
]
Rule as plaintext:
pastebin.com/raw/SHAH8dhn