Thread: [Solved] crawler rule help
View Single Post
  #15  
Old 28.06.2021, 19:58
wanko wanko is offline
JD VIP
 
Join Date: Aug 2015
Posts: 300
Default

Quote:
Originally Posted by pspzockerscene View Post
They seem to have totally different pages/allignments:
Wider attempt which doesn#t work for all URLs either:
Spoiler:
Code:
[
  {
    "maxDecryptDepth": 0,
    "name": "nudecosplaygirls.com rule",
    "pattern": "https?://(?:www\\.)?nudecosplaygirls\\.com/[\\w\\-]+/",
    "rule": "DEEPDECRYPT",
    "deepPattern": "(?:class=\"alignnone size-(?:full|medium) wp-image-[0-9]+\" |wp-block-image size-(?:large|medium).*?)src=\"(**External links are only visible to Support Staff**]+)\"",
    "packageNamePattern": "<title>(.*?)( - nudecosplaygirls)?</title>"
  }
]


Widest attempt which just grabs all wordpress .jpg image URLs:
Code:
[
  {
    "maxDecryptDepth": 0,
    "name": "nudecosplaygirls.com rule",
    "pattern": "https?://(?:www\\.)?nudecosplaygirls\\.com/[\\w\\-]+/",
    "rule": "DEEPDECRYPT",
    "deepPattern": "(**External links are only visible to Support Staff**,
    "packageNamePattern": "<title>(.*?)( - nudecosplaygirls)?</title>"
  }
]
As plaintext:
pastebin.com/fELqVEHc

-psp-
have only 1 picture but it grab related post thumbnails too
**External links are only visible to Support Staff****External links are only visible to Support Staff**
https://imgur.com/a/gipcXQY


**External links are only visible to Support Staff****External links are only visible to Support Staff**
have only 1 but

it also grab **External links are only visible to Support Staff****External links are only visible to Support Staff**
**External links are only visible to Support Staff****External links are only visible to Support Staff**


but this dont grab any related thumbnails
**External links are only visible to Support Staff****External links are only visible to Support Staff**

i have filtered : \d\dx\d\d and rta/scaled so it wont grab any resized images and rta

Last edited by wanko; 28.06.2021 at 20:13.
Reply With Quote