Latest Tweets:
This is a tumbling log of things and thoughts that pass my way.
If you got here by accident, you may want to visit my website instead.
"
So what can we do? Well, we could disallow these explicit characters (U+202A-U+202E) which is pretty easy. This does mean that anybody who wants to use them to include Neutrals at the edges of their Arabic usernames will be out of luck - and that sucks more when it’s a comment they’re posting, where the period jumps to the ‘beginning’ of the text.
If we want to allow use of these characters, the solution is fairly simple (if hard to implement): we need to make sure that every opening marker has a paired closing marker (PDF) so that the state stack coming out of the string is at the same state as when we went in. We also need to be careful that we don’t allow any PDFs to be used without accompanying push markers, else we can’t use any ourselves outside of the block.
"Understanding Bidirectional (BIDI) Text in Unicode. Fun stuff.