From mboxrd@z Thu Jan 1 00:00:00 1970 Received: by 10.42.229.71 with SMTP id jh7cs113416icb; Wed, 26 Jan 2011 05:17:48 -0800 (PST) Received: by 10.42.166.138 with SMTP id o10mr441849icy.145.1296047868052; Wed, 26 Jan 2011 05:17:48 -0800 (PST) Return-Path: Received: from rubyforge.org (rubyforge.org [205.234.109.19]) by mx.google.com with ESMTP id 12si29214355qcd.178.2011.01.26.05.17.47; Wed, 26 Jan 2011 05:17:47 -0800 (PST) Received-SPF: pass (google.com: domain of sup-talk-bounces@rubyforge.org designates 205.234.109.19 as permitted sender) client-ip=205.234.109.19; Authentication-Results: mx.google.com; spf=pass (google.com: domain of sup-talk-bounces@rubyforge.org designates 205.234.109.19 as permitted sender) smtp.mail=sup-talk-bounces@rubyforge.org Received: from rubyforge.org (rubyforge.org [127.0.0.1]) by rubyforge.org (Postfix) with ESMTP id 3F39419782F8; Wed, 26 Jan 2011 08:17:47 -0500 (EST) Received: from steel.pruts.nl (pruts.nl [82.94.235.106]) by rubyforge.org (Postfix) with ESMTP id D1DD718583C7 for ; Wed, 26 Jan 2011 08:12:09 -0500 (EST) Received: from localhost (localhost.localdomain [127.0.0.1]) by steel.pruts.nl (Postfix) with ESMTP id 2DB22E20B9 for ; Wed, 26 Jan 2011 14:12:09 +0100 (CET) X-Virus-Scanned: amavisd-new at pruts.nl Received: from steel.pruts.nl ([127.0.0.1]) by localhost (steel.local [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 9j1QIViJbkzC for ; Wed, 26 Jan 2011 14:12:03 +0100 (CET) Received: by steel.pruts.nl (Postfix, from userid 1000) id 2355DE20BB; Wed, 26 Jan 2011 14:12:03 +0100 (CET) From: Ico Doornekamp To: sup-talk Date: Wed, 26 Jan 2011 14:12:03 +0100 Message-Id: <1296046853-sup-2234@pruts.nl> User-Agent: Sup/0.12 Subject: [sup-talk] Non-english outlook block quote regexp X-BeenThere: sup-talk@rubyforge.org X-Mailman-Version: 2.1.12 Precedence: list List-Id: User & developer discussion of Sup List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Sender: sup-talk-bounces@rubyforge.org Errors-To: sup-talk-bounces@rubyforge.org Hi, I'm unfortunate enough to have regular correspondence with Dutch outlook-users, which is mostly annoying because of the way outlook handles quoting of original messages. I found that Sup is able to handle block quotes from English outlook users where the regexp ^-----\s*Original Message\s*----+$ is used to find out where the quote starts. This fails unfortunately for other languages, because the 'Original Mesage' text seems to be localized. In dutch for example, the text 'Oorspronkelik Bericht' is used instead. Would it be an impovement to change this to a more generic regexp to match more languages. I was not able to find a complete list of possible strings used here, so some heuristics would be necassery. Any opinions on matching the exact number of hashes, two uppercase words and again the exact number of hashes, something like: ^-----\s*([A-Z][a-z]+\s*){2}----+$ Would that be safe to do ? -- :wq ^X^Cy^K^X^C^C^C^C _______________________________________________ sup-talk mailing list sup-talk@rubyforge.org http://rubyforge.org/mailman/listinfo/sup-talk