|
[Solved] Join and download splitted pdf into one from recursive http link single level depth |
|
Thread Tools | Display Modes |
#1
|
|||
|
|||
Join and download splitted pdf into one from recursive http link single level depth
I would to crawl from this site
**External links are only visible to Support Staff**www.saberhumano.emnuvens.com.br Is possible also to join splitted pdf into one following from this link scheme embedded in a single level depth ? **External links are only visible to Support Staff****External links are only visible to Support Staff** **External links are only visible to Support Staff****External links are only visible to Support Staff** **External links are only visible to Support Staff****External links are only visible to Support Staff** Pdf is splitted into many pdf articles so I shoul copy link by link but this is awful individual links are like so **External links are only visible to Support Staff****External links are only visible to Support Staff** **External links are only visible to Support Staff****External links are only visible to Support Staff** **External links are only visible to Support Staff****External links are only visible to Support Staff** |
#2
|
||||
|
||||
You can add support for those urls yourself by creating Linkcrawler Rules (use board search).
You need 2 rules. One to auto process the /showToc url. That rules will find all /view/xy/xy urls within that url and then another rule to process the single pdf links. In the end you will end up with autocrawling of all pdf parts of one showToc link. Auto merge/concat of those pdfs is not supported. You will have to use external/other tools for that. In case you need help with creating rules for those links, please let us know
__________________
JD-Dev & Server-Admin |
#3
|
|||
|
|||
thanks JIAZ,
Can you show me with some pictures how to set /view/xy/xy urls within that url ? I don't understand well, I make one rule so Quote:
|
#4
|
||||
|
||||
I will answer here by tomorrow
__________________
JD-Dev & Server-Admin |
#5
|
||||
|
||||
Quote:
__________________
JD-Dev & Server-Admin |
#6
|
|||
|
|||
thanks, it works good
|
|
|