JDownloader Community - Appwork GmbH
 

Reply
 
Thread Tools Display Modes
  #1  
Old 07.01.2025, 16:38
tr909 tr909 is offline
I will play nice!
 
Join Date: Jan 2025
Posts: 2
Default How to crawl a forum for rapidgator links?

Hello, i've been trying to research and get this working on my own over the holidays, but i failed badly so i would like to ask for some help here.

I'd like to crawl the forum

**External links are only visible to Support Staff****External links are only visible to Support Staff**

in specific subsections for rapidgator links.

For example:

**External links are only visible to Support Staff****External links are only visible to Support Staff**
until
**External links are only visible to Support Staff****External links are only visible to Support Staff**

I can easily create a text file with links to each forum threadslist page since the page numbers are just increased with no other changes. However I would need to crawl 2 levels deep to actually go into the threads and collect the links. I have read some threads about crawling deeper than the default setting but i failed to implement/adapt them.

I also have problems with setting the correct filter to only catch rapidgator links, what is the best way to set the filter up for this? I've tried countless different ways of filtering but none worked, either the whole page was added or nothing at all.

Any help is greatly appreciated
Reply With Quote
  #2  
Old 08.01.2025, 08:59
pspzockerscene's Avatar
pspzockerscene pspzockerscene is offline
Community Manager
 
Join Date: Mar 2009
Location: Deutschland
Posts: 75,259
Default

Here are the rough options you got:

1. LinkCrawler rules:
https://support.jdownloader.org/know...kcrawler-rules

2. Collect links with browser addons:
https://support.jdownloader.org/know...orted-websites

In addition to this you could also setup folder rules in JDownloader to ignore everything that is not a rapidgator.net link.
See Settings -> Linkgrabber filter
__________________
JD Supporter, Plugin Dev. & Community Manager

Erste Schritte & Tutorials || JDownloader 2 Setup Download
Spoiler:

A users' JD crashes and the first thing to ask is:
Quote:
Originally Posted by Jiaz View Post
Do you have Nero installed?
Reply With Quote
  #3  
Old 08.01.2025, 10:19
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 82,935
Default

@tr909: Linkcrawler Rule can look for download links AND also support the *next* page and thus auto crawl through the pages. Please give it a try by yourself first but of course you can ask for help when you're stuck
__________________
JD-Dev & Server-Admin
Reply With Quote
  #4  
Old 09.01.2025, 14:14
tr909 tr909 is offline
I will play nice!
 
Join Date: Jan 2025
Posts: 2
Default

I spend the last 2 hours working on this and learning about regular expressions and i'm getting closer to make the linkcrawler work but i came up with something i don't know how to handle.


the threads are set up like this: **External links are only visible to Support Staff**www.domain.com/threads/page1
but the single threads containing the rapidgator links are setup like this:

**External links are only visible to Support Staff**www.domain.com/threads/examplethread

how should i setup my pattern and deepPattern? I already have my filter up and running only allowing rapidgator links to be collected.

so should my deepPattern target the /threads/ URL or rapidgator?
Reply With Quote
  #5  
Old 09.01.2025, 19:40
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 82,935
Default

you need 2 rules:
1.) rule (deep decrypt) that matches on ../forums/... and /forums.../page-number
and deep pattern should match on the *next* pages and the threads
2.) rule (deep decrypt) that matches on .../threads/.+
and deep pattern should match on the messages section (to avoid finding other stuff)

you can always ask for more help but reads good when you already have come so far
__________________
JD-Dev & Server-Admin
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

All times are GMT +2. The time now is 18:32.
Provided By AppWork GmbH | Privacy | Imprint
Parts of the Design are used from Kirsch designed by Andrew & Austin
Powered by vBulletin® Version 3.8.10 Beta 1
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.