From mboxrd@z Thu Jan 1 00:00:00 1970 From: guillaume.quintard@gmail.com (Guillaume Quintard) Date: Sat, 01 Aug 2009 20:31:55 +0200 Subject: [sup-talk] Fwd: xapian merged into next In-Reply-To: <1249149855-sup-2211@pion.club.cc.cmu.edu> References: <1248711109-sup-7061@entry> <1e5fdab70907270916o2f8e1768vbe7e3bcc1c807e39@mail.gmail.com> <1248711777-sup-9329@entry> <1e5fdab70907270931t7dfbe285h67197a7355b611d6@mail.gmail.com> <1248712876-sup-1446@entry> <1e5fdab70907270947n1866decdoc8a568cc9a2733ae@mail.gmail.com> <1248713360-sup-5448@entry> <1e5fdab70907271009v46639384w4bb3461ccaccf0cc@mail.gmail.com> <1248716073-sup-7443@masanjin.net> <1e5fdab70908010934l30373447r4a405c5ca0e406f9@mail.gmail.com> <1e5fdab70908011044q6743d213o554a7bd039e237c2@mail.gmail.com> <1249149855-sup-2211@pion.club.cc.cmu.edu> Message-ID: <1249151189-sup-3864@altis> Excerpts from Rich Lane's message of Sat Aug 01 20:14:34 +0200 2009: > I think we'd be safe not adding terms for email addresses longer than > 244 characters on the assumption that the user isn't going to want to > search for them. http://files.getdropbox.com/u/155904/grepped I did a simple grep, tell me if it's not enough (I'd rather not dive into the humongous mbox file). The mails come from the mailing-list admin tool (sympa), encoding problem it looks, the mangled part is "Propri?taires de liste" (list owners in french) -- Guillaume