#1
|
|||
|
|||
I am asking for better improvements in link recognition
I am asking for better improvements in link recognition.
See: https://i.postimg.cc/sXjXCzRR/Screen...t-11-58-AM.jpg |
#2
|
||||
|
||||
How about posting the actual example link or do you expect me to type the link from that screenshot?
In case the link contains supported/known file extension then it should be picked up by generic http plugin. Else it should be supported by deep decryption in case it leads to downloadable content.
__________________
JD-Dev & Server-Admin |
#3
|
|||
|
|||
I mean now, because I found a way to extract links from a binary file. But I would like JD to recognize links without percentages.
|
#4
|
||||
|
||||
percentages, i assume from his previous queries he is referring to urlencoding.
__________________
raztoki @ jDownloader reporter/developer http://svn.jdownloader.org/users/170 Don't fight the system, use it to your advantage. :] |
#5
|
||||
|
||||
@djmakinera: as I already explained, urlencoding is normal for URLs and are already supported in given situations
__________________
JD-Dev & Server-Admin |
#6
|
|||
|
|||
This means that the wrong link will be analyzed, e.g. with a space. This is not normal.
|
#7
|
||||
|
||||
How about some example links? spaces in URLs must be urlencoded!
__________________
JD-Dev & Server-Admin |
#8
|
|||
|
|||
It's just that the editor correctly recognizes only URL. in JD not always. In some cases, it just does not work. And you can not even start the normal parsing of the link!
Example: https%3A%2F%2Fsoundcloud.com%2Fdj-itronix%201 |
#9
|
||||
|
||||
will be fixed with next core update but the example link is invalid
__________________
JD-Dev & Server-Admin |
#10
|
|||
|
|||
It is correct, just simply incorrectly recognized urls
%3A%2F%2Fsoundcloud.com%2Fdj-itronix%201 soundcloud.com/dj-itronixwhitespace1 Blue - correct urls Red - plaint text Orange - white space |
#11
|
||||
|
||||
how should JDownloader know that the 'whitespace 1' doesn't belong to the URL?
__________________
JD-Dev & Server-Admin |
#12
|
|||
|
|||
It still depends on the site, but the soundcloud links do not contain spaces, so you can end the link where the white space begins
(?i)\b((?:[a-z][\w-]+:(?:\/{1,3}|[a-z0-9%])|www\d{0,3}[.]|[a-z0-9.\-]+[.][a-z]{2,4}\/)(?:[^\s()<>]+|\(([^\s()<>]+|(\([^\s()<>]+\)))*\))+(?:\(([^\s()<>]+|(\([^\s()<>]+\)))*\)|[^\s`!()\[\]{};:'".,<>?«»“”‘’])) |
#13
|
|||
|
|||
I wrote a regular expression that partially solves this problem... with a space will work, as long as there is no other word
See screenshot: https://postimg.cc/Kk6C7MBc |
#14
|
||||
|
||||
urls contain urlencoding for space %20 thus is valid according to your source.
__________________
raztoki @ jDownloader reporter/developer http://svn.jdownloader.org/users/170 Don't fight the system, use it to your advantage. :] Last edited by raztoki; 14.06.2019 at 08:35. Reason: missing l in urls |
#15
|
||||
|
||||
The processing of the urlencoding happens LONG LONG before any plugin kicks in.During that processing JDownloader cannot know that the space may be part of the URL as it's fully valid url or seperator
__________________
JD-Dev & Server-Admin |
#16
|
||||
|
||||
copy the source code, it be either urlencoded on the website, or within quotation marks '' or "" thus tells you the beginning or ending of url (jd parser supports this). if you copy within quotation marks (yourself), its good practice then to urlencode as it could contain spaces, else you can have issues.
__________________
raztoki @ jDownloader reporter/developer http://svn.jdownloader.org/users/170 Don't fight the system, use it to your advantage. :] |
#17
|
|||
|
|||
The solution is to open the file as ASCII (binary), which separates the invalid characters, detects only the URL in the editor, treats all other characters as non-url, but still JD2 still adds strange unicode.
Example: Last edited by Jiaz; 14.06.2019 at 18:13. |
#18
|
||||
|
||||
JDownloader doesn't add them. They are there. JDownloader doesn't magically add stuff to your clipboard content
__________________
JD-Dev & Server-Admin |
Thread Tools | |
Display Modes | |
|
|