|
Message-ID: <20120319021750.GB22403@openwall.com> Date: Mon, 19 Mar 2012 06:17:50 +0400 From: Solar Designer <solar@...nwall.com> To: owl-dev@...ts.openwall.com Subject: Re: blists status? Hi Lauri, Sorry for the delayed response. On Sun, Feb 05, 2012 at 05:47:49PM +0200, Lauri Kokkonen wrote: > Okay. I would like to be able to search messages by author, by subject > and by message body. > > This is my initial idea for building a search index from scratch: For > each word store pointers to all messages that contain that word and > encode these pointers as ranges of messages in time or in thread. So > store the first message and the number of following messages that also > contain that word. > > For an example, searches like the following would be quite > straightforward: > word1 AND word2 AND NOT word3 > > For searches like "word1 word2" (paragraph) we would need to do: > word1 AND word2 > and then fetch the message bodies to check if the words are next to > each other. > > But this would certainly need a lot more time, consideration, testing, > statistics and so on to become something useful. Right. So we'd need to decide on using a homebrew approach like this vs. just going with Xapian or the like. > So, I could start by doing something small first, like writing comments > before few functions in mailbox.c, then proceed to write a flat threaded > view and maybe remove duplicate code in html.c if needed. Yes, please feel free to contribute patches for that. Thanks, Alexander
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.