Archive of RubyForge sup-talk mailing list
 help / color / mirror / Atom feed
* [sup-talk] Non-english outlook block quote regexp
@ 2011-01-26 13:12 Ico Doornekamp
  2011-01-27 13:29 ` Tero Tilus
  0 siblings, 1 reply; 4+ messages in thread
From: Ico Doornekamp @ 2011-01-26 13:12 UTC (permalink / raw)
  To: sup-talk

Hi,

I'm unfortunate enough to have regular correspondence with Dutch
outlook-users, which is mostly annoying because of the way outlook
handles quoting of original messages.

I found that Sup is able to handle block quotes from English outlook
users where the regexp

  ^-----\s*Original Message\s*----+$

is used to find out where the quote starts. This fails unfortunately for
other languages, because the 'Original Mesage' text seems to be
localized. In dutch for example, the text 'Oorspronkelik Bericht' is
used instead.

Would it be an impovement to change this to a more generic regexp to
match more languages. I was not able to find a complete list of possible
strings used here, so some heuristics would be necassery.

Any opinions on matching the exact number of hashes, two uppercase words
and again the exact number of hashes, something like:

  ^-----\s*([A-Z][a-z]+\s*){2}----+$

Would that be safe to do ?



-- 
:wq
^X^Cy^K^X^C^C^C^C
_______________________________________________
sup-talk mailing list
sup-talk@rubyforge.org
http://rubyforge.org/mailman/listinfo/sup-talk


^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2011-01-28  9:26 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-01-26 13:12 [sup-talk] Non-english outlook block quote regexp Ico Doornekamp
2011-01-27 13:29 ` Tero Tilus
2011-01-27 15:00   ` Michael Stapelberg
2011-01-28  9:24     ` Ico Doornekamp

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox