JDownloader Community - Appwork GmbH
 

Reply
 
Thread Tools Display Modes
  #1  
Old 13.05.2020, 10:52
zreenmkr zreenmkr is offline
JD Addict
 
Join Date: Feb 2020
Posts: 169
Default JD can't add urls with Unicode letters

it took me a while to realize why jd cant import some urls. I ran the external links checker and they are active but for some odd reasons jd wont import them. b/c I use *.crawljob exclusively to import I didn't see what was happening. Until I tried to add it via Clipboard, and there it is.

the links has been stripped down to the culprits from the orignal urls.

Input urls
hxxp://example.com/с
hxxp://example.com/о

jd 'analyse and add link' input box and *.crawljob file read it as;
hxxp://example.com/%D1%81
hxxp://example.com/%D0%BE

i guess my question is why
Reply With Quote
  #2  
Old 13.05.2020, 10:55
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 67,320
Default

Please provide working example link. Just tested and working fine here
__________________
JD-Dev & Server-Admin
Reply With Quote
  #3  
Old 20.05.2020, 01:34
zreenmkr zreenmkr is offline
JD Addict
 
Join Date: Feb 2020
Posts: 169
Default

here is from my note with the unicode letter, see attached
Attached Files
File Type: txt Unicode Url.txt (23 Bytes, 2 views)
Reply With Quote
  #4  
Old 20.05.2020, 15:36
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 67,320
Default

I'm sorry but your example is not a working linke
it is
Quote:
exampe.com/с
Please provide a real example link that you have problem with
__________________
JD-Dev & Server-Admin
Reply With Quote
  #5  
Old 22.05.2020, 03:15
zreenmkr zreenmkr is offline
JD Addict
 
Join Date: Feb 2020
Posts: 169
Default

that is the only few examples I had in my note with the unicode letters. I didn't keep track with the real link. will let you know once i run into another one.

try copy everything in the text file and paste it by click on 'add new links' you will see it decode those letters into %D1%81%D0%BE
Reply With Quote
  #6  
Old 22.05.2020, 11:08
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 67,320
Default

Your example didn't contain any unicode, I just posted the content of the attachment file.
And
Quote:
%D1%81%D0%BE
equals
Quote:
co
It's normal URLEncoding.
__________________
JD-Dev & Server-Admin
Reply With Quote
  #7  
Old 28.05.2020, 02:44
zreenmkr zreenmkr is offline
JD Addict
 
Join Date: Feb 2020
Posts: 169
Default

Quote:
Originally Posted by Jiaz View Post
Your example didn't contain any unicode, I just posted the content of the attachment file.
And equals It's normal URLEncoding.
Here is the screenshot of the copied url from txt attached previously on my end

hxxps://imgur.com/aCQV7Sd
Attached Images
File Type: png _jd_unicode_url.png (7.2 KB, 3 views)
Reply With Quote
  #8  
Old 28.05.2020, 10:16
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 67,320
Default

I'm sorry but I still don't see any problem.

I've already explained that there is no unicode involved at all.
Quote:
%D1%81
is URLEncoded
Quote:
c
, see
urlencoder.org
__________________
JD-Dev & Server-Admin
Reply With Quote
  #9  
Old 04.06.2020, 07:40
zreenmkr zreenmkr is offline
JD Addict
 
Join Date: Feb 2020
Posts: 169
Default

Quote:
Originally Posted by Jiaz View Post
I've already explained that there is no unicode involved at all. is URLEncoded
I see what you are saying. wish i hadn't deleted the url

Quote:
urlencoder.org
copy 'c' from the txt file attached above into the link it does decode into %D1%81
but if i type 'c' from the keyboard decode as c

but i'm still struggle to understand is how or why and where 'c' from txt file original came from. the url then has other random letters and numbers too so why weren't those letters URLEndcoded as well

Quote:
I'm sorry but I still don't see any problem.
because it decoded c into %D1%81 as shown in screenshot, jd link ananlyzer failed to crawl it as a valid url so no new link were added. once I manually deleted %D1%81 and typed in 'c' from the keyboard it then able to add to linkgrabber. (again i wish i'd saved the url so you could test it)

Last edited by zreenmkr; 04.06.2020 at 07:49.
Reply With Quote
  #10  
Old 04.06.2020, 13:50
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 67,320
Default

Quote:
Originally Posted by zreenmkr View Post
I see what you are saying. wish i hadn't deleted the url)
No problem Just report back in case you can reproduce the issue/got hands on a new link
__________________
JD-Dev & Server-Admin
Reply With Quote
  #11  
Old 04.06.2020, 13:55
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 67,320
Default

Quote:
Originally Posted by zreenmkr View Post
copy 'c' from the txt file attached above into the link it does decode into %D1%81
but if i type 'c' from the keyboard decode as c

с from the file is "CYRILLIC SMALL LETTER ES"
fileformat.info/info/unicode/char/0441/index.htm
c from keyboard is "LATIN SMALL LETTER C"
fileformat.info/info/unicode/char/0063/index.htm
__________________
JD-Dev & Server-Admin

Last edited by Jiaz; 04.06.2020 at 13:59.
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

All times are GMT +2. The time now is 15:43.
Provided By AppWork GmbH | Privacy | Imprint
Parts of the Design are used from Kirsch designed by Andrew & Austin
Powered by vBulletin® Version 3.8.10 Beta 1
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.