Questions tagged [unicode]
Unicode is intended to be a universal character set for describing all the characters required for written text incorporating all writing systems, technical symbols and punctuation.
78 questions
8
votes
2
answers
105
views
Unicode transfer format conversions as range adapters
This is my attempt to implement view adapters that convert between UCS-4, UTF-16 and UTF-8 (the UTF-16 ⟷ UTF-8 conversions must go via UCS-4). Compared to the deprecated standard ...
6
votes
3
answers
985
views
A String View Library in C
I was recently working through the PintOS projects and became curious if there was a better way to do some string processing in C. Specifically, instead of strtok_r,...
6
votes
1
answer
755
views
Transcoding UTF-8 to UTF-16-LE in VBA
VBA is a language that's lacking a lot of basic functionality. (Pun intended)
Most libraries, if they exist in the first place, are OS-specific, and even some of the inbuilt functions don't work on ...
0
votes
1
answer
180
views
supporting unicode in rust lexer
I am working on a lexer in rust.
It has been through many different iterations, going from iterating over a Vec<char> to ...
6
votes
1
answer
2k
views
Client server communications through unix signals in C
For school, I have to create a server-client communication through Unix signals and only with SIGUSR1 and SIGUSR2.
Client ...
1
vote
2
answers
184
views
Parse string using character value for Unicode characters
I added the getCValue method to reduce the cyclomatic complexity, but the cyclomatic complexity still persists. How can I reduce it? Can I change this code using ...
5
votes
1
answer
483
views
Converting IDN domains to Punycode in Perl
Description
This script takes any domain input from STDIN and converts unicode domains into punycode.
Features
Any domains that throw an error get ignored.
When fed any ASCII domains, they just pass ...
4
votes
2
answers
802
views
UTF-8 to UTF-16 using C++20 concepts and ranges
A follow-up for this previous question.
I took into account previous reviews, and tried to make a simple API. I had never done anything non-trivial with C++20 concepts and ranges until now, so I am ...
6
votes
4
answers
3k
views
UTF-8 to UTF-16 (char8_t string to char16_t string)
Below is an implementation of a UTF-8 string to UTF-16 string. Kind of like MultiByteToWideChar on Win32, but it's cross-platform and ...
3
votes
1
answer
161
views
Appending a codepoint to an UTF8 std::string using icu4c
My code is
...
10
votes
2
answers
1k
views
Text based rendering/animation engine for the terminal
This project was highly inspired by the popular drawille project, that lets one draw to the terminal using the braille unicode characters.
The advantage of drawing with braille characters compared to ...
8
votes
1
answer
4k
views
JavaScript string to Unicode (Hex)
This set of questions are related to a project I've published for converting characters, or strings, to Hex based Unicode; eg...
...
1
vote
1
answer
2k
views
The conversion from UTF-16 to UTF-8
I have created a function that converts from UTF-16 to UTF-8.
This function converts from UTF-16 to codepoint firstly, then from codepoint to UTF-8.
...
22
votes
6
answers
4k
views
Transcode UCS-4BE to UTF-8
Below is my entire program. You can read what it does thanks to the comments and specifications in particular.
My question is: can it be improved? Would it be possible, for example, to avoid writing a ...
8
votes
1
answer
430
views
Printing Colored, CLId Unicode Chess Board - follow-up
Follow up to: Printing Command Line Unicode Chess Board
A couple people from the previous question mentioned looking into coloring the command line with ANSI escape sequences. In Python, ...