855
submitted 2 months ago by Maven@lemmy.zip to c/programmerhumor@lemmy.ml
you are viewing a single comment's thread
view the rest of the comments
[-] Ptsf@lemmy.world 163 points 2 months ago

Isn't it all unicode at the end of the day, so it supports anything unicode supports? Or am I off base?

[-] dohpaz42@lemmy.world 72 points 2 months ago

Ssh! 🫢 You’ll ruin the joke!

[-] thanks_shakey_snake@lemmy.ca 25 points 2 months ago

Okay but how does starting a secure shell help?

[-] tetris11@lemmy.ml 10 points 2 months ago

Are you serious? I just explained that to you two seconds ago

[-] dohpaz42@lemmy.world 3 points 2 months ago

Well for one, it encrypts all communications so that people can’t snoop on what you’re doing.

[-] Ptsf@lemmy.world 6 points 2 months ago
[-] NeatNit@discuss.tchncs.de 25 points 2 months ago

Yes, but the language/compiler defines which characters are allowed in variable names.

[-] thevoidzero@lemmy.world 5 points 2 months ago

I thought the most mode sane and modern language use the unicode block identification to determine something can be used in valid identifier or not. Like all the 'numeric' unicode characters can't be at the beginning of identifier similar to how it can't have '3var'.

So once your programming language supports unicode, it automatically will support any unicode language that has those particular blocks.

[-] NeatNit@discuss.tchncs.de 4 points 2 months ago

Sanity is subjective here. There are reasons to disallow non-ASCII characters, for example to prevent identical-looking characters from causing sneaky bugs in the code, like this but unintentional: https://en.wikipedia.org/wiki/IDN_homograph_attack (and yes, don't you worry, this absolutely can happen unintentionally).

[-] toastal@lemmy.ml 8 points 2 months ago* (last edited 2 months ago)

OCaml’s old m17n compiler plugin solved this by requiring you pick one block per ‘word’ & you can only switch to another block if separated by an underscore. As such you can do print_แมว but you couldn’t do pℝint_c∀t. This is a totally reasonable solution.

[-] NeatNit@discuss.tchncs.de 2 points 2 months ago

That's pretty cool

[-] lunarul@lemmy.world 2 points 2 months ago

I can't imagine how something like homograph attacks can happen accidentally. If someone does this in code, they probably intended to troll other contributors.

[-] NeatNit@discuss.tchncs.de 3 points 2 months ago

Multilingual users have multiple keyboard layouts, usually switching with Alt+Shift or similar key combo. If you're multitasking you might not realize you're on the wrong keyboard layout. So say you're chatting with someone in Russian, then you alt+tab to your source code and you spot a typo - you wrote my_var_xopy instead of my_var_copy. You delete the x and type in c. You forget this happened and you never realized the keyboard layout was wrong.

That c that you typed is now actually с, Cyrillic Es.

What do you say, is that realistic enough?

[-] lunarul@lemmy.world 4 points 2 months ago* (last edited 2 months ago)

I use multilingual keyboard layouts, so I know that at least on Windows the selected layout is specific to each window. If I chat with someone in one language, then switch to my IDE, it will not keep the layout I used in the chat window.

But I also have accidently hit the combination to change layouts while doing something, so it can happen. I'm just surprised that Cyrillic с is on the same key as C, instead of S.

[-] NeatNit@discuss.tchncs.de 2 points 2 months ago

I believe there's a setting for whether it's global or per-window. Personally I prefer global, because I can't keep track of more than one state and I absolutely hate the experience of typing something and getting a different language than you expect.

[-] thevoidzero@lemmy.world 1 points 2 months ago

Sorry, I forgot about this. I meant to say any sane modern language that allows unicode should use the block specifications (for e.g. to determine the alphabets, numeric, symbols, alphanumeric unicodes, etc) for similar rules with ASCII. So that they don't have to individually support each language.

[-] NeatNit@discuss.tchncs.de 1 points 2 months ago

Oh, that I agree with. But then there's the mess of Unicode updates, and if you're using an old version of the compiler that was built with an old version of Unicode, it might not recognize every character you use...

[-] sukhmel@programming.dev 3 points 2 months ago

Yes, but it still is about language, not game engine.

Albeit technically, the statement is correct, since it is more specific.

[-] HK65@sopuli.xyz 6 points 2 months ago

Yeah, but this particular language is a feature of the game engine. It's its own thing called GDScript.

[-] sukhmel@programming.dev 2 points 2 months ago

Oh, I didn't know that, neat. Then there's no space for nit-picking

[-] HK65@sopuli.xyz 2 points 2 months ago

Godot is neat. There is C# support as well if you find that easier, but coming from Unreal, it's night and day. I know Unreal has so much more features, but for a hobbyist like me, Godot is much better. It's just this small executable, and you have everything you need to get creative.

[-] Faresh@lemmy.ml 9 points 2 months ago

I think they exclude some unicode characters from being use in identifiers. At least last I tried it wouldn't allow me to use an emoji as a variable name.

[-] histic@lemmy.dbzer0.com 2 points 2 months ago

Another guy just posted emojis in their code in the comments no idea if it actually works

[-] Faresh@lemmy.ml 8 points 2 months ago

That code was C++ or something like that. Not GDScript.

I tested this on Godot 4.2.1. You can write identifiers using a different writing system other than latin and you are allowed to have emojis in strings, but you aren't allowed to use emojis in identifiers.

[-] histic@lemmy.dbzer0.com 4 points 2 months ago

Ah I’m unfamiliar with most languages I just use python and random others for personal projects

[-] bleistift2@sopuli.xyz 1 points 2 months ago

Coding must be a nightmare if you’re choosing programming languages at random 😱

But you must also be learning quite a lot.

[-] histic@lemmy.dbzer0.com 1 points 2 months ago

I’m not choosing at random lol that would be crazy but I mostly use python and have been teaching myself go and some rust

[-] lunarul@lemmy.world 3 points 2 months ago

There's probably a rule that requires variables to start with a letter or underscore. Emoji are nor marked as letters. Something like _👍 will probably work.

this post was submitted on 11 Oct 2024
855 points (98.2% liked)

Programmer Humor

32706 readers
671 users here now

Post funny things about programming here! (Or just rant about your favourite programming language.)

Rules:

founded 5 years ago
MODERATORS