Re: Regex with ASCII and non-ASCII chars
- From: Peter Otten <__peter__@xxxxxx>
- Date: Wed, 31 Jan 2007 17:00:43 +0100
TOXiC wrote:
Thx it work perfectly.
If I want to query a file stream?
file = open(fileName, "r")
text = file.read()
file.close()
Convert the bytes read from the file to unicode. For that you have to know
the encoding, e. g.
file_encoding = "utf-8" # replace with the actual encoding
text = text.decode(file_encoding)
regex = re.compile(u"(ÿÿ‹ð…öÂ)", re.IGNORECASE)
match = regex.search(text)
if (match):
result = match.group()
print result
WritePatch(fileName,text,result)
else:
result = "No match found"
print result
It return "no match found" (the file contain the string "ÿÿ‹ð…öÂ"
but...).
Thanks in advance for the help!
Peter
.
- Follow-Ups:
- Re: Regex with ASCII and non-ASCII chars
- From: TOXiC
- Re: Regex with ASCII and non-ASCII chars
- References:
- Regex with ASCII and non-ASCII chars
- From: TOXiC
- Re: Regex with ASCII and non-ASCII chars
- From: Peter Otten
- Re: Regex with ASCII and non-ASCII chars
- From: TOXiC
- Regex with ASCII and non-ASCII chars
- Prev by Date: Re: "Correct" db adapter
- Next by Date: Re: error messages containing unicode
- Previous by thread: Re: Regex with ASCII and non-ASCII chars
- Next by thread: Re: Regex with ASCII and non-ASCII chars
- Index(es):
Relevant Pages
|