Re: split, no repeat- Regular expression
- From: anno4000@xxxxxxxxxxxxxxxxxxxxxxx (Anno Siegel)
- Date: 31 Aug 2005 10:03:04 GMT
Nina <tin_tint@xxxxxxxxxxx> wrote in comp.lang.perl.misc:
> I have a file, the content is like this:
>
> ATATTTGATTGGCCAGCCCTGCGTTTGCGGTTTTTTTTTGTTTTTTTATTTCCTGTATTTTTTTTGGGGGGGAAAAATTGCAGTTCCACGGA
> 4f-rnp Gene 204:267
> ACCTTATCGACTAGTATAAAAGGCACTGTCAGCTCTCCAGCCCGAACAAAATCGATCAAAATGCGCCCGCAATCAGCTGCGTGTCTATTACT
> 44D JMB 166:101
> ATGGGAGCGGTATGCTTAAATAGGGGCACCTTTTAATCCCTCTGGCCATTGGCAATCGATCCATTTAGTGGGAGCCATGTTCAAGTTGCTGG
> 44L JMB 166:101
> AACTTATGTAATCATATAGATTCTATAATAAACAAAGAAACAAAACTAGTTGTAAAACAAACACGATTCCTGTGTGTCATTGCGGGATATGG
> 74F EMBO 3:289
> TTTCCACACGATCGTGCTGCCTCCCAATAAACCCGGTGCAGTGAGTCAGTGTGTTGTGTGCCCCAGTCGCGAGCGGACGATCCGTGGAGATC
> Abdb EMBO 7:3223
> TGCGGATCAATTAAACCGTAAAAAACAGAGCAGGCGAGCGTAAGCAAGAGAGAGAGGTGAAGCCAGAGGCGGAGGCGCAAGACAAAGTGCAT
> abl p1 Oncogene 3:33
> AAAAAACAGAGCAGGCGAGCGTAAGCAAGAGAGAGAGGTGAAGCCAGAGGCGGAGGCGCAAGACAAAGTGCATTTTCAGGGCGTGTTTTTGA
> abl p2 Oncogene 3:33
> TAATAGTCGCTCAAAAGCTGTCGAGAGAGAGGGAGAGAAAAGAGAGAGTGAAAGCATAGTCCCGCTATTTTGCCGAGAGAAATAAAGAGCAG
> ace JMB 210:15
>
> for example, the first sequence, what I want is after sequence: 4f-rnp;
> AND then collect all this name to a new file.
> so the new file is like:
> 4f-rnp
> 44D
> 44L JMB
> 74F
> Abdb
> abl
> *here I don;t want another alb, so the output should not be repeated.*
> ace
>
> I know how to make script to split and get the name, but How can I
> avoid this repeatment?
Use a hash. See the FAQ "How can I remove duplicate elements from a
list or array?". It talks about arrays, not files, but the technique
is the same.
Anno
--
If you want to post a followup via groups.google.com, don't use
the broken "Reply" link at the bottom of the article. Click on
"show options" at the top of the article, then click on the
"Reply" at the bottom of the article headers.
.
- References:
- split, no repeat- Regular expression
- From: Nina
- split, no repeat- Regular expression
- Prev by Date: FAQ 4.25 How do I expand tabs in a string?
- Next by Date: Re: split, no repeat- Regular expression
- Previous by thread: split, no repeat- Regular expression
- Next by thread: Re: split, no repeat- Regular expression
- Index(es):
Relevant Pages
|
|