Thursday, February 09, 2012 Register  Login
Encoding problem in splitted files
Last Post 26 Jan 2012 04:27 AM byHector. 5 Replies.
Printer Friendly
Sort:
PrevPrev NextNext
You are not authorized to post a reply.
AuthorMessages
JuneUser is Offline
New Member
New Member
Send Private Message
Posts:1

--
14 Jul 2009 10:43 PM  

Hi, I found your greate tool and I am glad it works. However, I noticed that the encoding is not as original file to be splitted. For example, I have a UTF-8 file after split, the splitted files are ANSI which caused the character curruptions. Do you think you will be able to fix it? Thanks!

HectorUser is Offline
Grand Widget Maker
Basic Member
Basic Member
Send Private Message
Posts:178

--
15 Jul 2009 02:21 AM  

This has already been requested here:

http://www.systemwidgets.com/Suppor...fault.aspx

I started looking around for a solution. The main issue is that .NET does not have built-in functions to detect file encodings. I did find a couple promising leads. I'll have to experiment.

HectorUser is Offline
Grand Widget Maker
Basic Member
Basic Member
Send Private Message
Posts:178

--
16 Jan 2010 09:01 PM  
This is now included in version 2.0.0 and newer.
gcsUser is Offline
New Member
New Member
Send Private Message
Posts:1

--
13 Oct 2011 04:29 AM  
I have version 2.2.1, and the problem exists when the original file is ANSI because the output files are UTF-8. The split files are not usable for me until converted back to ANSI, which defeats the original "batch" file flow I intended to use TextFileSplitter for. Would you please confirm this incorrect conversion does still exist? You must have all of the code necessary to ensure this does not happed with the output. I look forward to your solution.
HectorUser is Offline
Grand Widget Maker
Basic Member
Basic Member
Send Private Message
Posts:178

--
13 Oct 2011 07:32 AM  
Hi gcs,

These sort of issues are impossible to troubleshoot without a sample text file. I saw your email. Could you send a sample text file, and what strategy you were using to support AT systemwidgets DOT com?

Let me upfront with about this. .NET does not have built-in detection of file encodings. I have to do a lot of guessing in the code that looks for encoding. I've tried to cover the most common situations, so I know that I missed some cases. TextFileSplitter will use the same encoding in the output as it detected in the source file. That's why I need to see a sample file, so that I can take a look at it with a hex editor, and a few other tools that I have in my toolkit.

I don't have a lot of time these days, but I'll see if I can take a gander at this issue this weekend.
HectorUser is Offline
Grand Widget Maker
Basic Member
Basic Member
Send Private Message
Posts:178

--
26 Jan 2012 04:27 AM  
Alright, this shortcoming has been addressed in version 2.5.0. It is now up in the downloads section.
You are not authorized to post a reply.

Active Forums 4.2
Terms Of Use | Privacy Statement | SystemWidgets
Copyright 2002-2012 by SystemWidgets
Google Analytics Alternative