Re: best method to perform operations on word lists



Thanks Bart,

Well, the issue is really a matter of pragmatism. If I do the work
manually or with some VBA macros it will take ages. The situation I am
trying to address is not so uncommon to people working on glossary
issues. Therefore I am trying to find a language-independant solution
which works, say, for 90% of the words. It won't work in situations
with irregular plurals like the one you mention (or e.g. French for
"Work/works": "Travail / travaux", German for "House/Houses":
"Haus/Häuser") or with character swaps in suffixes but at least it
would reduce substantially the number of cases to deal with.

I can define the length of a suffix with something like this:
for ($suffix=0 ; $suffix <= 1 ; $suffix++) {
or as a length difference between 2 words

I can also find out what is the root and the suffix of terms with
something like this:
$wordend = substr ($term,-$suffix);
$startposition = rindex ($term,$wordend); # position of suffix from
the end
$root = substr ($term,0,$startposition);
But for the moment I am struggling getting the value of one term and
the next one in order to compare them...Hope Dies Last !

Francois

.



Relevant Pages

  • Re: Why Agnus Dei, not Agne Dei?
    ... the consonant stems have undergone massive levelling from ... PIE compound nouns (root + suffix) could be stressed on the ... roots, so the paradigm is: ...
    (sci.lang)
  • Re: [OT] OED OL C/P [was: Re: -ward]
    ... -weard, primarily forming adjs., with the sense ... the suffix the primary sense of the root is preserved, ... root), which in sense coincides with OTeut. ... There is no secret, or if there is I've not been let in on it. ...
    (alt.usage.english)
  • Re: An invention called the Jewish people
    ... the root is accented throughout the paradigm, ... and both shift to the suffix in the weak ... the accent shifting to the ending in the weak cases (nom. ... for IE primary l-stems or root nouns in *-l, ...
    (sci.lang)
  • Re: stem or root?
    ... Stanley Newman in Yokuts Language of California uses ... "base" for a unit intermediate between root and stem. ... a stem is derived from a base by adding a derivational suffix. ...
    (sci.lang)