JDownloader Community - Appwork GmbH
 

Reply
 
Thread Tools Display Modes
  #1  
Old 18.09.2019, 11:58
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default How to recover special characters in file names?

How to recover special characters in file names. I don't know why it doesn't show valid characters even though the application supports Unicode (File saved as UTF-8) with signature.

Check the first character in the editor, although there are several different characters.

http://www.imagebam.com/image/6564621317793859
Reply With Quote
  #2  
Old 18.09.2019, 13:15
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

How about example links? How should your screenshot help?
__________________
JD-Dev & Server-Admin
Reply With Quote
  #3  
Old 18.09.2019, 14:23
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

**External links are only visible to Support Staff****External links are only visible to Support Staff**

I just can't change some special characters when changing names in JD2.
I can only copy to the text editor, but I accidentally checked the settings. Must disable UseDirectWrite.

Code:


Reply With Quote
  #4  
Old 18.09.2019, 14:36
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

I'm sorry but I don't understand. There are no example links just broken utf-8
__________________
JD-Dev & Server-Admin
Reply With Quote
  #5  
Old 18.09.2019, 14:38
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

0x11 is a control code and no visible sign
utf8-chartable.de/unicode-utf8-table.pl?utf8=0x
__________________
JD-Dev & Server-Admin
Reply With Quote
  #6  
Old 18.09.2019, 14:38
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

It depends on the tool how to render them or ignore them
__________________
JD-Dev & Server-Admin
Reply With Quote
  #7  
Old 23.09.2019, 00:19
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

Either way, the problem will still occur because turning on or off DirectWrite causes both functions to cause problems at the same time.
I have no idea what to do to display all UTF-8 characters fully

New Example:
Code:
∘
U+2218
UTF-8: 0xE2 0x88 0x98
RING OPERATOR
Or maybe save encoding?
UTF-16 BE with/without sign.
UTF-16 LE with/without sign.
Reply With Quote
  #8  
Old 23.09.2019, 18:22
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

I'm sorry but I still don't understand the problem or what on/off DirectWrite of another application has to do with JDownloader.
__________________
JD-Dev & Server-Admin
Reply With Quote
  #9  
Old 24.09.2019, 11:52
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

I will try to explain better.
I want to write to a text file, except that some Unicode is not displayed correctly in the editor.
If I turn on DirectWrite - some Unicode shows
If it turns off then the other doesn't show
And vice versa. Some Unicode, at the expense of displaying another Unicode, which is problematic.
From what I know, this problem occurs in Windows 7 (not present in Windows 10). But I don't want to upgrade to Windows 10 (at least not now)
JD2 -> Action -> Copy information -> Name or Packagename
Copy package or file names and save the file as ... (what encoding)?
Reply With Quote
  #10  
Old 24.09.2019, 13:12
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

Quote:
Originally Posted by djmakinera View Post
JD2 -> Action -> Copy information -> Name or Packagename
Copy package or file names and save the file as ... (what encoding)?
You don't have control over this and it doesn't matter because that information is provided in multiple encoding representations and it's job of the other application (reading from clipboard) to choose the best/supported one
__________________
JD-Dev & Server-Admin
Reply With Quote
  #11  
Old 24.09.2019, 13:15
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

Quote:
Originally Posted by djmakinera View Post
I have no idea what to do to display all UTF-8 characters fully

UTF-16 BE with/without sign.
UTF-16 LE with/without sign.
UTF-8, UTF-16 BE and UTF-16 LE are Encoding! They are just different representations of how to store Unicode codepoints.
None of those encodings care about the symbol/character
__________________
JD-Dev & Server-Admin

Last edited by Jiaz; 24.09.2019 at 13:19.
Reply With Quote
  #12  
Old 24.09.2019, 13:18
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

Quote:
Originally Posted by djmakinera View Post
From what I know, this problem occurs in Windows 7 (not present in Windows 10).
The unicode specification is *alive* and you have to check if the editor/os/application already supports the required unicode version in order to correctly render the specific unicode codepoints. see en.wikipedia.org/wiki/Unicode
an application/rendering library that only supports unicode 6 won't be able to render any newer codepoints.
__________________
JD-Dev & Server-Admin
Reply With Quote
  #13  
Old 24.09.2019, 14:48
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

All Unicode shows correctly in ordinary Notepad, not necessarily in other editors.

I'm wondering, maybe the problem is the font?
How does (standard) font support most Unicode?
Reply With Quote
  #14  
Old 24.09.2019, 14:53
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

Quote:
Originally Posted by djmakinera View Post
All Unicode shows correctly in ordinary Notepad, not necessarily in other editors.
I heavily doubt that , but that's another topic. Or did you really test all 137.929 characters?
__________________
JD-Dev & Server-Admin
Reply With Quote
  #15  
Old 24.09.2019, 14:55
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

Quote:
Originally Posted by djmakinera View Post
I'm wondering, maybe the problem is the font?
How does (standard) font support most Unicode?
Of course the font must support the unicode codepoint. There are special fonts see
en.wikipedia.org/wiki/Unicode_font
__________________
JD-Dev & Server-Admin
Reply With Quote
  #16  
Old 24.09.2019, 17:53
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

Different editor. User font - > Consolas
Notepad. User Font -> Arial
Reply With Quote
  #17  
Old 24.09.2019, 18:04
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

I've now set up the Arial Unicode MS
is OK with the unicode display,
It's just that I don't like the font. :\
Reply With Quote
  #18  
Old 24.09.2019, 18:41
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

There are other unicode fonts available, use search engine of your choice
__________________
JD-Dev & Server-Admin
Reply With Quote
  #19  
Old 26.09.2019, 16:56
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

I tested different Unicode fonts, the problem is that not all of them show well.
Unicode is millions of different (I do not penetrate) someone needs it, but unfortunately this problem cannot be solved with Unicode. I know someone was asking about Unicode fonts (this is not only my question on the internet, but what can I do more?
Yes, apparently only browsers support all Unicode, but what about Editors (even those promoting full Unicode support). No idea. Ech...
Reply With Quote
  #20  
Old 27.09.2019, 10:15
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

Unicode is not millions but 137.929 (latest Unicode 12.1)
Quote:
Unicode allows for 17 planes, each of 65,536 possible characters (or 'code points'). This gives a total of 1,114,112 possible characters. At present, only about 10% of this space has been allocated.
stackoverflow.com/questions/5924105/how-many-characters-can-be-mapped-with-unicode

Also browsers don't support all Unicode It totally depends on the used font.
__________________
JD-Dev & Server-Admin
Reply With Quote
  #21  
Old 04.10.2019, 04:00
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

There are still some problems with the display of Polish letters.
I don't know what's going on... I write it as UTF-8


See screen:
https://postimg.cc/JDGGXhJc

Invalid characters for encoding UTF-8
Reply With Quote
  #22  
Old 04.10.2019, 20:46
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

Encoding windows-1250 (visible character)

Encoding UTF-8 (NOT visible character)

2 screenshot:
https://postimg.cc/gallery/1lymlspj6/
Reply With Quote
  #23  
Old 05.10.2019, 04:40
raztoki's Avatar
raztoki raztoki is offline
English Supporter
 
Join Date: Apr 2010
Location: Australia
Posts: 17,614
Default

this is not related to JD2 or JDownloader as I can see. never the less hopefully you can see/understand there is no one size fits all solution, as your experiencing right now.
__________________
raztoki @ jDownloader reporter/developer
http://svn.jdownloader.org/users/170

Don't fight the system, use it to your advantage. :]
Reply With Quote
  #24  
Old 05.10.2019, 06:43
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

You can't save the entire list of file names to a text file?
I have a large selection list of encodings to choose from?

3 screenshot:
https://postimg.cc/crCGkHHj
https://postimg.cc/Hjpg53CT
https://postimg.cc/9zJjBTWZ
Reply With Quote
  #25  
Old 07.10.2019, 18:06
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

@djmakinera: please understand that we don't have time to help with non JDownloader related issues.
there are many other places where you can ask for help
__________________
JD-Dev & Server-Admin
Reply With Quote
  #26  
Old 08.10.2019, 19:22
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

Quote:
Originally Posted by Jiaz View Post
@djmakinera: please understand that we don't have time to help with non JDownloader related issues.
there are many other places where you can ask for help
It's related in some sense to JD2, it just chooses the wrong coding when unpacking. It is not packed in UTF-8.
This is KOI-8 encodings or CP-XXX etc.
Cyrillic is damaged.
Code:
12 Őážáş ŐážĘąó - ŬÓʻᚠ(Alex Ch Remix 2k15).mid

ZIP file:
**External links are only visible to Support Staff****External links are only visible to Support Staff**


08.10.19 18.53.44 <--> 08.10.19 19.22.04 jdlog://8770130900751/
Reply With Quote
  #27  
Old 08.10.2019, 20:06
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

JD2 Bug Unpacking archive /wrong detect encoding


WinRar CTRL+E = select enconding OEM 866

see screenshot:
https://i.postimg.cc/G34Nr4tZ/Screen...t-08-04-PM.jpg
Reply With Quote
  #28  
Old 10.10.2019, 17:27
Jiaz's Avatar
Jiaz Jiaz is offline
JD Manager
 
Join Date: Mar 2009
Location: Germany
Posts: 79,523
Default

Extraction library currently doesn't support unicode extension
thanks for example link

__________________
JD-Dev & Server-Admin
Reply With Quote
  #29  
Old 07.11.2019, 14:53
djmakinera djmakinera is offline
Banned
 
Join Date: May 2010
Location: Poland
Posts: 8,387
Default

Is it possible to convert a damaged font to Cyrillic?

Îäíà áîëüøàÿ ñöåíà.txt
Åé âäðóã ïðèøëà ìûñëü.txt
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

All times are GMT +2. The time now is 18:08.
Provided By AppWork GmbH | Privacy | Imprint
Parts of the Design are used from Kirsch designed by Andrew & Austin
Powered by vBulletin® Version 3.8.10 Beta 1
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.