Re: text parsing



McKirahan wrote:
"Carolyn Marenger" <cajunk@xxxxxxxxxxxx> wrote in message
news:74fb1$479501d1$cf70133e$7458@xxxxxxxxxxxx
Can someone point me in the direction of some good documentation on text
parsing?

I want to take a bunch of text files (rtf), read them in and dump the
contents in a database. The files are effectively a flat file database,
with I suspect some fairly intricate programming needed to process the
files. Unfortunately, they are laid out for human readability, not data
conversion.

A few questions.

How many is a "bunch"?
What would the target database be -- MySQL?
What table and column structures do you envision?
Perhaps simply a single table with two columns:
filename (key) and a memo field containing the data?
What is the purpose behind doing this?

A few answers

A bunch is about a dozen. Basically one large file that was broken into sixteen subsets, following the initial letter for each record.

The target database would be MySQL

I haven't looked too closely at the data, but I think one main table with a few linked tables for those cases where there may be more than one piece of data for a category. There are about 25 categories to each record. Eventually there would be additional structure added around the imported data, but that isn't relevant to importing the data itself. (I will confirm this before beginning to code.

The purpose: I am a D&D fan and I run games. I would like to be able to reference the material and automate much of the process so I don't have to lug and reference 20lbs of books.

Thanks, Carolyn

.



Relevant Pages

  • Re: huge box of ram?
    ... LVS load balancer pointing to a bunch of Squid caches on PC's equipped ... of mirrored 36GB 15K SCSI drives on separate controllers. ... Large SATA drives aren't designed for high-access database activities. ...
    (comp.arch.storage)
  • Re: lot of hits in one go - server overloading questions
    ... >> c. are there any techniques to reduce the load on the server (e.g. ... So I would avoid hitting the database ... is there a way to measure server load and calculate expected load ... > and opening up a bunch of browser windows. ...
    (comp.lang.php)
  • Re: Check for valid email address
    ... Denzil Kruse wrote: ... > I have a bunch of old email addresses in a database and would like ... Because you are not going to send any unsolicited crap anyway, ...
    (perl.beginners)
  • Re: Controlling PAM modules
    ... with out changing the service conf file. ... _are_ the database. ... There are a bunch of strategies that you could use to, e.g., maintain your alterations as a diff to the base-system config so to make upgrades easier, but a) to answer your question, no, there's nothing standard for that, and b) that is an especially risky approach - you could completely break your security, letting anyone in, or locking legitimate users out, etc. ...
    (FreeBSD-Security)
  • Re: Small Projects
    ... Well, off the top of my head, you would need to set up a small database, and ... create a Windows Forms application with a bunch of buttons, dropdown lists, ...
    (microsoft.public.dotnet.framework.adonet)