Jul 30, 2011, 10:41 AM
Post #6 of 7
Re: [albatros] HTML::Parser to strip HTML
[In reply to]
Solution suggested by you (strip text after create file) is not a good idea for me, as well can use SED, PERL or other programs for strip part of text from created file, but I want a solution within the my perl program to simplify things.
Thanks anyway for your interest and answers to my questions.
CPAN can release new modules all day, but I still stand by regex replacement. Regular Expressions can and will remove any HTML tags from a file when done properly. Read each line, and use a regex expression that removes the accompanying HTML directives.
*edit* can and will remove any HTML tags from a string in your case?
*posteditedit* There are scripts that currently exist for this, I used one provided back in early 2000 so I would assume they are probably in-depth enough at this point to remove anything HTML based in an HTML file.
(This post was edited by kasuals on Jul 30, 2011, 10:47 AM)