Opened 11 years ago

Closed 8 years ago

Last modified 8 years ago

#9395 closed enhancement (fixed)

[STXT Translator] Wrap and use ICU's Character Set Detection feature.

Reported by: siarzhuk Owned by: nobody
Priority: normal Milestone: R1
Component: Add-Ons/Translators Version: R1/alpha4.1
Keywords: Cc:
Blocked By: Blocking: #3065, #6252, #6447, #7954
Platform: All

Description

There are set of tickets describing problems with text documents loading:

#3065 [StyledEdit] Bad argument type passed to function #6252 No UTF16 support in StyledEdit or Pe #6447 StyledEdit doesn't check for valid utf-8 #7954 [StyledEdit] shows only small part of file

Documents in some encodings cannot be loaded at all. Current encoding guessing in STXT translator was borrowed from BSD file tool and is not perfect nowadays. From the other side latest version of ICU provides the Character Set Detection support that can be wrapped by our Local Kit and used in STXT translator for more complete support of encodings in Haiku.

PS: This is another post-GCI task that was unfortunately not claimed during GCI 2012.

Change History (8)

comment:1 by siarzhuk, 11 years ago

Blocking: 3065 added

comment:2 by siarzhuk, 11 years ago

Blocking: 6252 added

comment:3 by siarzhuk, 11 years ago

Blocking: 6447 added

comment:4 by siarzhuk, 11 years ago

Blocking: 7954 added

comment:5 by siarzhuk, 9 years ago

Owner: changed from siarzhuk to nobody
Status: newassigned

Those were taken some years ago as potential GCI tasks. Unfortunately no place is available for them in my schedule at this days.

comment:6 by pulkomandy, 8 years ago

Resolution: fixed
Status: assignedclosed

Fixed in hrev50552.

comment:7 by diver, 8 years ago

Just tried to open text files with cp1251 and KOI8 (after rmattr "be:encoding" file) and StyledEdit can't open them - Unsupported format.

comment:8 by pulkomandy, 8 years ago

Sample files welcome!

Version 0, edited 8 years ago by pulkomandy (next)
Note: See TracTickets for help on using tickets.