[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] unicode with strcat



> On Oct 11, 2023, at 4:06 PM, Rita <rmorgan466@xxxxxxxxx> wrote:
> 
> I would like to put UTF-8 encoding in my strcat command for better messages. Is that possible with HTcondor?


Iâm unsure if youâre talking about the strcat command-line tool or the strcat() ClassAd function, so Iâll address both.

The jobâs stdout and data files can contain any data format you want. 

The ClassAd language naively tolerates UTF-8 encoding in string values. It has no understanding of unicode code points or multi-byte characters. It will preserve the values, but treat each byte as an independent character. Thus, the strcat() and strcmp() functions will do byte-by-byte copying and comparison of string values with UTF-8 encoding. But the substr() function will count bytes and happily bisect a multi-byte character.

 - Jaime