Questions tagged [character-encoding]
A character encoding system consists of a code that pairs each character from a given repertoire with something else — such as a bit pattern, sequence of natural numbers, octets, or electrical pulses — in order to facilitate the transmission of data (generally numbers or text) through telecommunication networks or for data storage.
309 questions
-3
votes
1
answer
88
views
Why do subtitles appear constantly in bold in Windows 11? Only embedded subtitles display correctly [closed]
I just reinstalled Windows (Windows 11 25H2, 10.0.26200 Build 26200). Now the subtitles look like this and appear in bold or bigger shadow/outline. Before I noticed this problem, I changed several ...
23
votes
2
answers
3k
views
Running a BAT file and accidentally finding obscure Chinese poem
Accidentally found obscure Chinese poem (BAT file)
Something really weird just happened. I ran this in a batch file:
wmic timezone get caption>>tmp_ist.bak
time/date>>tmp_ist.bak
This ...
0
votes
0
answers
15
views
How to grep on accented vowels? [duplicate]
I have a logfile, written in French, containing the following information:
Nouvelles données [Status : 32
I was looking for all possible values of the mentioned status, but I didn't find anything:
...
0
votes
1
answer
746
views
UTF-8 Decoders fail to decode the encoded strings
I have some encoded values values which I believe is UTF-8. Now I dont really know if it is UTF-8 or not because other online tool and steps to decode UTF-8 is not working, BUT an open source tool ...
0
votes
1
answer
1k
views
notepad is displaying txt file contents as weird symbols
I am writing integer values into file from kotlin (kotlin Int type) using something like this
var1BufferedWriter?.write(String.format("%d\n", intvar ) )
var2BufferedWriter?.write(String....
2
votes
1
answer
386
views
VIM uses wrong encoding - but only in status messages
I ran into a strange issue with my ArchLinux setup. Vim uses correct encoding for reading/displaying files but these status messages (which displays the current mode or reports back when the buffer is ...
5
votes
1
answer
2k
views
How to grep search for text in an ISO-8859-1 encoded file?
I'm attempting to use grep to search for text patterns from an ISO-8859-1 encoded file:
https://github.com/jfoclpf/words-pt
When I execute a search, all of the matches are returned, but the accented ...
4
votes
1
answer
5k
views
How to identify a file encoding?
I'm trying to figure out the encoding of a text file. I did try a lot of the common ones (with Notepad++), but I've failed so far.
A few hints: The file was originally an Eudora mbx file, with mostly ...
3
votes
0
answers
480
views
ffmpeg printing unknown glyph between characters on utf-8 subtitle
FFmpeg is printing unknown glyphs between some two characters but the weird thing is it's not replacing a character that doesn't exist in the font, it's just printing a new unknown glygh and I can't ...
0
votes
0
answers
3k
views
How to change Excel character set?
I have an automatically-generated CSV file which contains accented characters. These appear fine when the file is opened with Notepad++. However, accented characters appear mangled in Excel (e.g. é ...
0
votes
1
answer
3k
views
Wrong character encoding in ssh session – but not for all connectios
I have an odd issue when connecting to my (Ubuntu) server via SSH.
If I connect from my Gentoo box, all is fine. All Umlauts etc. work, I can type "ÄÖÜ" and so on.
If I do the same from my ...
0
votes
1
answer
1k
views
How to read Linux text files in Windows system?
For example, I run the top command and store it to a file in Linux, after that I open that file in Windows it contains some gibberish. Here is the file viewed in Notepad++:
The option to convert to ...
1
vote
1
answer
571
views
does batch program support any ascii characters or is there's a way to create encoder and decoder program with another tools?
i wrote batch encoder and decoder from adrianvdh and customize some of the text string input, but the decoder one aren't working, because i put special symbols inside there.
here's the string of the ...
0
votes
1
answer
343
views
AWK: "invalid regexp: Invalid collation character" -- how do I make it valid?
I have an awk script that must process millions of records, but I need to remove any containing a multibyte character.
In one environment where I work, the following simplified shell sequence ...
1
vote
0
answers
649
views
How to use ISO8859-9 encoding in terminal?
I maked a file containing "ırmak" with a text editor via encoding ISO8859-9.
Then, I tried to print the content with "cat" command in the terminal. But I could not.
I use the ...