Skip Menu |

This queue is for tickets about the Unicode-Homoglyph-Replace CPAN distribution.

Report information
The Basics
Id: 129906
Status: open
Priority: 0/
Queue: Unicode-Homoglyph-Replace

People
Owner: Nobody in particular
Requestors: enno.nagel [...] t-online.de
Cc:
AdminCc:

Bug Information
Severity: (no value)
Broken in: (no value)
Fixed in: (no value)



Subject: Thanks Unicode::Homoglyph::Replace
Date: Tue, 25 Jun 2019 09:56:21 -0300
To: bug-Unicode-Homoglyph-Replace [...] rt.cpan.org
From: Enno Nagel <enno.nagel [...] t-online.de>
Dear David, Thank you for compiling the list of Unicode homoglyphs and the accompanying Perl script to normalize them at https://fastapi.metacpan.org/source/BIGPRESH/Unicode-Homoglyph-Replace-0.01/lib/Unicode/Homoglyph/Replace.pm Useful, in my case, for a Vim script that highlights these (and another one to normalize them, analogue to your Perl script). Let me remark that the character s (= \X{0073}) appears twice and that on https://www.irongeek.com/homoglyph-attack-generator.php in the meanwhile new homoglyphs have been found; it does not stop. Best wishes Enno PS: Sorry for having sent this e-mail to your private email address in the first place. -- PGP key: https://konfekt.bitbucket.io/keys/epn.asc
On 2019-06-25 14:03:50, enno.nagel@t-online.de wrote: Show quoted text
> Dear David, > > Thank you for compiling the list of Unicode homoglyphs and the > accompanying Perl script to normalize them at <snip>
No problems, glad to hear it's useful - it's always nice to hear work is appreciated, so thank you for taking the time to say so! Show quoted text
> Let me remark that the character s (= \X{0073}) appears twice
Ah yes, so it does - removed the duplicate, thanks. Show quoted text
> and that on https://www.irongeek.com/homoglyph-attack-generator.php > in the meanwhile new homoglyphs have been found; it does not stop.
Hmm - according to the changelog on that page, the last change was 11/28/2017, adding "ḍ" - but it's quite possible others have been added without updating the changelog of course. The source of characters I used initially appears to be char_codes.txt from https://github.com/codebox/homoglyph which hasn't been updated since. I'll try to find a moment to write a script to pull the list of homoglyphs from https://www.irongeek.com/homoglyph-attack-generator.php and "merge" them with the list already in the module. Cheers Dave P (BIGPRESH)