Re: Using regular expressions in internet searches



Python would be good for this, but if you just want a chuck an rumble
solution might be.


bash $wget -r --ignore-robots -l 0 -c -t 3 http://www.cnn.com/
bash $ grep -r "Micheal.*" ./www.cnn.com/*

Or you could do a wget/python mix

like

import sys
import re
sys.os.command("wget -r --ignore-robots -l 0 -c -t 3
http://ww.cnn.com/";)
re_iraq=re.compile("iraq .+?",re.IGNORECASE)

while "file in dirs under ./www.cnn.com/ "
iraqs = re_iraq.findall(file.read())
print iraqs

.



Relevant Pages

  • Best way of finding terminal width/height?
    ... I use python for writing terminal applications and I have been bothered by how hard it seems to be to determine the terminal size. ... At the end I've included a code snippet from Chuck Blake 'ls' app in python. ...
    (comp.lang.python)
  • Re: VTM Bloodlines crash question
    ... >> What is python? ... all likelyhood for scripting purposes. ... Prev by Date: ...
    (comp.sys.ibm.pc.games.rpg)
  • Python Interest group in Minneapolis/StPaul?
    ... Just curious if there is a Python Interest Group ... in The TwinCities. ... Chuck ... Prev by Date: ...
    (comp.lang.python)
  • Re: How to stop reading a file?
    ... > Chuck F. said: ... Obviously he's not watched enough Python. ... (and I don't mean the programming language...) ... Prev by Date: ...
    (comp.lang.c)