JDownloader Community - Appwork GmbH
 

Reply
 
Thread Tools Display Modes
  #1  
Old 11.12.2016, 19:03
nathan1 nathan1 is offline
Storm
 
Join Date: Apr 2012
Posts: 214
Default Join and download splitted pdf into one from recursive http link single level depth

I would to crawl from this site
**External links are only visible to Support Staff**www.saberhumano.emnuvens.com.br

Is possible also to join splitted pdf into one following from this link scheme embedded in a single level depth ?

**External links are only visible to Support Staff****External links are only visible to Support Staff**
**External links are only visible to Support Staff****External links are only visible to Support Staff**
**External links are only visible to Support Staff****External links are only visible to Support Staff**

Pdf is splitted into many pdf articles so I shoul copy link by link but this is awful

individual links are like so

**External links are only visible to Support Staff****External links are only visible to Support Staff**
**External links are only visible to Support Staff****External links are only visible to Support Staff**
**External links are only visible to Support Staff****External links are only visible to Support Staff**
Reply With Quote
  #2  
Old 12.12.2016, 15:44
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 62,508
Default

You can add support for those urls yourself by creating Linkcrawler Rules (use board search).
You need 2 rules. One to auto process the /showToc url. That rules will find all /view/xy/xy urls within that url and then another rule to process the single pdf links. In the end you will end up with autocrawling of all pdf parts of one showToc link. Auto merge/concat of those pdfs is not supported. You will have to use external/other tools for that. In case you need help with creating rules for those links, please let us know
__________________
JD-Dev & Server-Admin
Reply With Quote
  #3  
Old 13.12.2016, 19:03
nathan1 nathan1 is offline
Storm
 
Join Date: Apr 2012
Posts: 214
Default

thanks JIAZ,
Can you show me with some pictures how to set /view/xy/xy urls within that url ?
I don't understand well, I make one rule so



Quote:
you will end up with autocrawling
Maybe I need to help to understand better this tip
Reply With Quote
  #4  
Old 13.12.2016, 19:06
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 62,508
Default

I will answer here by tomorrow
__________________
JD-Dev & Server-Admin
Reply With Quote
  #5  
Old 20.12.2016, 16:01
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 62,508
Default

Quote:
[ {
"maxDecryptDepth" : 2,
"pattern" : "https?://saberhumano\\.emnuvens\\.com\\.br/sh/(issue/view/\\d+/showToc|article/view/\\d+/\\d+)",
"rule" : "DEEPDECRYPT",
"deepPattern" : "(https?://saberhumano\\.emnuvens\\.com\\.br/sh/article/(?:view|download)/\\d+/\\d+)"
}, {
"pattern" : "https?://saberhumano\\.emnuvens\\.com\\.br/sh/article/download/\\d+/\\d+",
"rule" : "DIRECTHTTP"
} ]
Put this into Settings-Advanced Settings-LinkCrawler.linkcrawlerrules
__________________
JD-Dev & Server-Admin
Reply With Quote
  #6  
Old 18.02.2017, 19:42
nathan1 nathan1 is offline
Storm
 
Join Date: Apr 2012
Posts: 214
Default

thanks, it works good
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

All times are GMT +2. The time now is 12:53.
Provided By AppWork GmbH | Privacy | Imprint
Parts of the Design are used from Kirsch designed by Andrew & Austin
Powered by vBulletin® Version 3.8.10 Beta 1
Copyright ©2000 - 2019, Jelsoft Enterprises Ltd.