Re: find and replace with regular expressions
- From: Mensanator <mensanator@xxxxxxx>
- Date: Thu, 31 Jul 2008 13:56:04 -0700 (PDT)
On Jul 31, 3:07 pm, chrispoliq...@xxxxxxxxx wrote:
I am using regular expressions to search a string (always full
sentences, maybe more than one sentence) for common abbreviations and
remove the periods. I need to break the string into different
sentences but split('.') doesn't solve the whole problem because of
possible periods in the middle of a sentence.
So I have...
----------------
import re
middle_abbr = re.compile('[A-Za-z0-9]\.[A-Za-z0-9]\.')
# this will find abbreviations like e.g. or i.e. in the middle of a
sentence.
# then I want to remove the periods.
----------------
I want to keep the ie or eg but just take out the periods. Any
ideas? Of course newString = middle_abbr.sub('',txt) where txt is the
string will take out the entire abbreviation with the alphanumeric
characters included.
'A test, ie, an example.'middle_abbr = re.compile('[A-Za-z0-9]\.[A-Za-z0-9]\.')
s = 'A test, i.e., an example.'
a = middle_abbr.search(s) # find the abbreviation
b = re.compile('\.') # period pattern
c = b.sub('',a.group(0)) # remove periods from abbreviation
d = middle_abbr.sub(c,s) # substitute new abbr for old
d
.
- References:
- find and replace with regular expressions
- From: chrispoliquin
- find and replace with regular expressions
- Prev by Date: Re: Python parsing iTunes XML/COM
- Next by Date: Re: Boolean tests [was Re: Attack a sacred Python Cow]
- Previous by thread: find and replace with regular expressions
- Next by thread: Newbie having issues with threads
- Index(es):
Relevant Pages
|