|
Message-ID: <55AFA9DE.3060103@gmail.com> Date: Wed, 22 Jul 2015 16:34:06 +0200 From: Marek Wrzosek <marek.wrzosek@...il.com> To: john-users@...ts.openwall.com Subject: Re: Bleeding jumbo now defaults to UTF-8 W dniu 01.06.2015 o 22:33, Marek Wrzosek pisze: > W dniu 01.06.2015 o 18:04, magnum pisze: >> On 2015-06-01 16:47, magnum wrote: >>> You can do a try-catch in Perl (actual command is 'eval' iirc). >>> Pseudo-code: >>> >>> For each UTF-8 line of input { >>> skip any pure ASCII >>> try encoding to CP1234 >>> if it worked, print it >>> } >>> >>> Unless you need this a lot you shouldn't create new files (they only add >>> a burden of maintenance). Just write this as a simple filter where >>> actual encoding would be a command-line option, and feed it to john >>> >>> Example: >>> $ ./john -w:all.utf8.lst -rules:whatever hashfile >>> $ codepage.pl <all.utf8.lst -t cp1234 | ./john -pipe -enc:cp1234 >>> -rules:whatever hashfile >>> $ codepage.pl <all.utf8.lst -t cp1235 | ./john -pipe -enc:cp1235 >>> -rules:whatever hashfile >>> ... >>> >>> Let me see if I can whip up an actual implementation of that filter in >>> Perl. I'll be back. >> >> Attached is a quick hack implementing this. >> >> magnum >> > WOW, that was fast! Thanks, magnum! > And last but not least... What is the one - proper way to use --inc=utf8 in new bleeding-jumbo? I mean, which encoding option we should use - --input-encoding=utf-8, --target-encoding=utf-8, --internal-encoding=utf-8 or just --encoding=utf-8. Because none seems to work in case of --inc=utf8. For --inc=latin1 --target-encoding=cp1252 is mandatory for pot file to be utf-8 only and not mixed with other encodings. Best Regards -- Marek Wrzosek marek.wrzosek@...il.com
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.