Skip to main content

Questions tagged [unicode]

Unicode is intended to be a universal character set for describing all the characters required for written text incorporating all writing systems, technical symbols and punctuation.

Filter by
Sorted by
Tagged with
8 votes
2 answers
105 views

Unicode transfer format conversions as range adapters

This is my attempt to implement view adapters that convert between UCS-4, UTF-16 and UTF-8 (the UTF-16 ⟷ UTF-8 conversions must go via UCS-4). Compared to the deprecated standard ...
Toby Speight's user avatar
  • 86.2k
6 votes
3 answers
985 views

A String View Library in C

I was recently working through the PintOS projects and became curious if there was a better way to do some string processing in C. Specifically, instead of strtok_r,...
Alex Lopez's user avatar
6 votes
1 answer
755 views

Transcoding UTF-8 to UTF-16-LE in VBA

VBA is a language that's lacking a lot of basic functionality. (Pun intended) Most libraries, if they exist in the first place, are OS-specific, and even some of the inbuilt functions don't work on ...
GWD's user avatar
  • 195
0 votes
1 answer
180 views

supporting unicode in rust lexer

I am working on a lexer in rust. It has been through many different iterations, going from iterating over a Vec<char> to ...
xxxxxxxxxx's user avatar
6 votes
1 answer
2k views

Client server communications through unix signals in C

For school, I have to create a server-client communication through Unix signals and only with SIGUSR1 and SIGUSR2. Client ...
dieri's user avatar
  • 61
1 vote
2 answers
184 views

Parse string using character value for Unicode characters

I added the getCValue method to reduce the cyclomatic complexity, but the cyclomatic complexity still persists. How can I reduce it? Can I change this code using ...
stromboli's user avatar
5 votes
1 answer
483 views

Converting IDN domains to Punycode in Perl

Description This script takes any domain input from STDIN and converts unicode domains into punycode. Features Any domains that throw an error get ignored. When fed any ASCII domains, they just pass ...
T145's user avatar
  • 3,149
4 votes
2 answers
802 views

UTF-8 to UTF-16 using C++20 concepts and ranges

A follow-up for this previous question. I took into account previous reviews, and tried to make a simple API. I had never done anything non-trivial with C++20 concepts and ranges until now, so I am ...
Aykhan Hagverdili's user avatar
6 votes
4 answers
3k views

UTF-8 to UTF-16 (char8_t string to char16_t string)

Below is an implementation of a UTF-8 string to UTF-16 string. Kind of like MultiByteToWideChar on Win32, but it's cross-platform and ...
Aykhan Hagverdili's user avatar
3 votes
1 answer
161 views

Appending a codepoint to an UTF8 std::string using icu4c

My code is ...
Alexey Romanov's user avatar
10 votes
2 answers
1k views

Text based rendering/animation engine for the terminal

This project was highly inspired by the popular drawille project, that lets one draw to the terminal using the braille unicode characters. The advantage of drawing with braille characters compared to ...
766F6964's user avatar
  • 901
8 votes
1 answer
4k views

JavaScript string to Unicode (Hex)

This set of questions are related to a project I've published for converting characters, or strings, to Hex based Unicode; eg... ...
S0AndS0's user avatar
  • 978
1 vote
1 answer
2k views

The conversion from UTF-16 to UTF-8

I have created a function that converts from UTF-16 to UTF-8. This function converts from UTF-16 to codepoint firstly, then from codepoint to UTF-8. ...
Lion King's user avatar
  • 147
22 votes
6 answers
4k views

Transcode UCS-4BE to UTF-8

Below is my entire program. You can read what it does thanks to the comments and specifications in particular. My question is: can it be improved? Would it be possible, for example, to avoid writing a ...
lettomobile's user avatar
8 votes
1 answer
430 views

Printing Colored, CLId Unicode Chess Board - follow-up

Follow up to: Printing Command Line Unicode Chess Board A couple people from the previous question mentioned looking into coloring the command line with ANSI escape sequences. In Python, ...
T145's user avatar
  • 3,149

15 30 50 per page
1
2 3 4 5 6