
kasuals
New User
Jul 30, 2011, 10:41 AM
Post #6 of 7
(2208 views)
|
|
Re: [albatros] HTML::Parser to strip HTML
[In reply to]
|
Can't Post
|
|
Solution suggested by you (strip text after create file) is not a good idea for me, as well can use SED, PERL or other programs for strip part of text from created file, but I want a solution within the my perl program to simplify things. Thanks anyway for your interest and answers to my questions. CPAN can release new modules all day, but I still stand by regex replacement. Regular Expressions can and will remove any HTML tags from a file when done properly. Read each line, and use a regex expression that removes the accompanying HTML directives. *edit* can and will remove any HTML tags from a string in your case? *posteditedit* There are scripts that currently exist for this, I used one provided back in early 2000 so I would assume they are probably in-depth enough at this point to remove anything HTML based in an HTML file.
(This post was edited by kasuals on Jul 30, 2011, 10:47 AM)
|