Unicode is a computing industry standard for the consistent encoding, representation and handling of text expressed in most of the world's writing systems.
Questions tagged [unicode]
493 questions
35
votes
5 answers
How can I check if a UTF-8 text file has a BOM?
How can I check if a UTF-8 text file has a BOM from command line?
file command shows me:
UTF-8 Unicode text
But, I don't know if it means there is no BOM in the file.
I'm using Ubuntu 12.04.

ironsand
- 5,205
9
votes
1 answer
interpret \UXXXXXXXX Unicode codes in text file
I have the following file :
$ cat test
Villes visit\U000000e9es
How can I interpret those \UXXXXXXXX codes, e.g. how can I get :
$ cat test | pipe into something
Villes visitées

ChennyStar
- 1,743
3
votes
0 answers
Unicode and Telegram Desktop app
Is there any way to enable unicode input in Telegram Desktop App for unix/linux systems?
Addendm:
The problem seems to be due to some qt-ibus configuration problem according to this unfortunately they don't say how to work around it.

Giorgio Mossa
- 131
2
votes
1 answer
Unicode support in talk?
I live in Poland, and we have some accented characters - ąćęłńóśźż. Due to various reasons I resurrected "talk" usage on my server, which works great, but I wasn't able to figure out how to make it work with accents.
On both ends of talk I have…

user70436
- 75
1
vote
2 answers
Looking up and Inputing arbitrary unicode characters in console/terminal
I'm looking for a simple, generic way to input arbitrary unicode characters in a text document on the terminal(e.g. in a terminal editor).
A basic method I can imagine is having a simple text(utf-8) file containing two columns, the character's name…

Charles Langlois
- 181
1
vote
2 answers
Unicode Strings in Text file
I have a text file containing the following unicode strings with regular text.
Cat a.txt
{"relationship":{"type:Memberkey","id""824-\u0001\u0019BFGHDICA2166-01-01","source"}
Here \u0001 and \u0019 are unicode strings and is causing our program to…

Neo
- 11
- 2
0
votes
1 answer
How iconv and od handle endianness?
echo hello | iconv -f ascii -t utf-16le | od -x
produces what seems to me like a big-endian result
0068 0065 006c 006c 006f 000a
whereas the same line without the 'le' produces, on a utf16le system (Osx)
echo hello | iconv -f ascii…

Dmitry Starostin
- 111
0
votes
1 answer
create unix alias for "ssh username@", and I would like to use it for any ipaddress.
create unix alias for "ssh username@", and I would like to use it for any ip address.
for example,
alias go="username@"
usecase:
$go 10.1.1.1 ; implement => ssh username@10.1.1.1
Currently if i do $go 10.1.1.1, it is taking extra white space…
-1
votes
2 answers
Is ∞ allowed in UTF-8 Encoded files?
Are lemniscates, ∞, allowed in UTF-8 Encoded files?
I am hoping that students with less than six months of computer programming experience can use a search engine to type something like "is infinity a valid UTF-8 character", and that they will find…