|
Message-ID: <CANnLRdicZvcJy=D1DGxuMon9uvHFf5m3KWt8Csc3ULDCYBp8vQ@mail.gmail.com> Date: Tue, 8 Oct 2013 10:11:06 -0600 From: Stephen John Smoogen <smooge@...il.com> To: john-users <john-users@...ts.openwall.com> Subject: How best to compute this via john http://arstechnica.com/security/2013/10/how-the-bible-and-youtube-are-fueling-the-next-frontier-of-password-cracking/2/ I am guessing that some scripts would be needed to parse through a large library of project Gutenberg and Wikipedia articles and then output something like: Sentence as it is Sentence lowercased Sentence no punctuation/lowercased First 2 words of sentence First 3 words of sentence .... I am guessing we are running out of time in the universe at this point. I would try to speed it up by taking all the sentences (break up anything in quotes as a separate sentence.) and then sort them by order of frequency. That way "To be or not to be" would be higher than "The genus spectrum of the common toad spans a far distance." Lord this is going to be a load of work :). -- Stephen J Smoogen.
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.