#1
|
|||
|
|||
[LinkCrawler Rule] aliexpress.com image and videos
so when i'm enabled this rule
Code:
"name" : null, "id" : 1649932308721, "enabled" : true, "pattern" : "https?://www\\.aliexpress\\.com/item/\\d+\\.html", "formPattern" : null, "passwordPattern" : null, "updateCookies" : true, "logging" : true, "rewriteReplaceWith" : null, "deepPattern": "<img src=\"(https?://[^\"]+)", "packageNamePattern" : "<title>(.+?)</title>", "maxDecryptDepth" : 1 but when i'm deleted this line "deepPattern": "<img src="(https?://[^"]+)", it will have so many resized images like this **External links are only visible to Support Staff****External links are only visible to Support Staff** which can remove _50x50.jpg_.webp for non-resized images but i dont know how to rewrite it to removed \d+x\d+.jpg and that link have one mp4 file but jd is not captured by JD **External links are only visible to Support Staff****External links are only visible to Support Staff** what i'm tried so far is : 1 - make deeppattern working 2 - rewrite image url Last edited by wanko; 14.04.2022 at 15:24. |
#2
|
||||
|
||||
Quote:
The images are available plain in json within the html. In case you need help with it, please let us know
__________________
JD-Dev & Server-Admin Last edited by Jiaz; 14.04.2022 at 18:48. |
#3
|
|||
|
|||
so not highlighted element and view selection source ?
if you mean this data: {"actionModule":{" but if so, i only see product img in source view, not description images |
#4
|
||||
|
||||
@wanko: yes, you cannot use the rendered dom. JDownloader only *see* the pure/raw downloaded html file, no dom/no css/no js.
The descriptions are also available in source, eg Quote:
__________________
JD-Dev & Server-Admin |
#5
|
|||
|
|||
oh i see, i just found description but it's inside desc.htm
**External links are only visible to Support Staff****External links are only visible to Support Staff** Last edited by wanko; 14.04.2022 at 20:32. |
#6
|
|||
|
|||
update2: ok with this i feel like it work ok
"deepPattern" : "actionModule|(https?://[^_"]+)|^(?=.*zip).*", 1- capture all link have _(resized images) and zip (description) 2- packagizer add prefix into name to prevent mirrored link problems but look like JD can't render that zip htm file Last edited by wanko; 15.04.2022 at 19:15. |
#7
|
||||
|
||||
what do you mean? JDownloader only loads raw html files, no rendering/no js/no css.
your deepPattern doesn't make much sense, for example actionModule just matches on the "actionModule"! the second will only match for https links until it hits " or _ the third part of the pattern also doesn't make any sense as it will start at line start ^ I think you wanted to match on zip urls but that's not how it will work. what kind of links are you looking for ?
__________________
JD-Dev & Server-Admin Last edited by Jiaz; 16.04.2022 at 11:34. |
#8
|
|||
|
|||
yeah that | look like OR , not condition and that actionmodule is useless in this case, zip too,
about [^_"]+ i'm not noticed ", thank you Quote:
ae01.alicdn.com looking for all jpg (but without suffix \d+x\d+ (resized) **External links are only visible to Support Staff****External links are only visible to Support Staff** -> **External links are only visible to Support Staff****External links are only visible to Support Staff** that zip link htm contains images in description, i want images inside this sector too "descriptionUrl":"**External links are only visible to Support Staff** |
#9
|
||||
|
||||
can you provide one, two more example links of aliexpress ?
__________________
JD-Dev & Server-Admin |
#10
|
|||
|
|||
**External links are only visible to Support Staff****External links are only visible to Support Staff**
**External links are only visible to Support Staff****External links are only visible to Support Staff** **External links are only visible to Support Staff****External links are only visible to Support Staff** **External links are only visible to Support Staff****External links are only visible to Support Staff** |
#11
|
||||
|
||||
Two chained rules could do what you want e.g.:
Code:
[ { "enabled": true, "logging": false, "maxDecryptDepth": 1, "name": "example rule for aeproductsourcesite.alicdn.com/product/description", "pattern": "https?://aeproductsourcesite\\.alicdn\\.com/product/description/pc/.+", "rule": "DEEPDECRYPT", "packageNamePattern": null, "passwordPattern": null, "deepPattern": "class=\"detail-desc-decorate-image\" src=\"(https?://[^\"]+.jpg)\"" }, { "enabled": true, "logging": false, "maxDecryptDepth": 1, "name": "example rule for aliexpress.com single items", "pattern": "**External links are only visible to Support Staff**, "rule": "DEEPDECRYPT", "packageNamePattern": null, "passwordPattern": null, "deepPattern": "(https?://aeproductsourcesite\\.alicdn\\.com/[^\"]+)" } ] pastebin.com/raw/h2GmChSp Please invest some time to learn this yourself as we've already created a lot of linkcrawler rules for you... -psp- EDIT To get videos too, you will need to slightly modify the first rule...
__________________
JD Supporter, Plugin Dev. & Community Manager
Erste Schritte & Tutorials || JDownloader 2 Setup Download |
#12
|
|||
|
|||
oh thank you for that rule
i must modified it with "|" and look like trying to get video will not work because it's required script render, so view source wont help, so it's impossible to get video **External links are only visible to Support Staff****External links are only visible to Support Staff** Last edited by wanko; 08.05.2022 at 04:27. |
#13
|
||||
|
||||
I'm unable to find any video on that website.
If it's not possible using LinkCrawler Rules you might want to try this method. -psp-
__________________
JD Supporter, Plugin Dev. & Community Manager
Erste Schritte & Tutorials || JDownloader 2 Setup Download |
#14
|
||||
|
||||
@wanko: video is done via javascript player so JDownloader doesn't see it
but method hinted by pspzockerscene should work fine
__________________
JD-Dev & Server-Admin |
|
|