
romild0
Novice
Jan 7, 2009, 1:13 AM
Post #1 of 15
(4264 views)
|
recursively find non-ascii characters in file
|
Can't Post
|
|
Hi! I have some nasty, non-ascii character in some files that contains php code. What I want to do here is to recursively find all the files that contains a specific non-ascii character in the file. And most importantly - i need to know the name of that file. So far, I found a script that looks into a file for non-ascii characters:
while (<>) { s/([\x80-\xff])/sprintf "\\x{%02x}",ord($1)/eg; print; } Ok, this is good, the non-ascii character that I'm looking for is: The problem here is that i can can't run this script to run recursively and I don't get the name of the file that contains this characters. I've tried with bash, but since it's standard output, I can't get any resault on this. Here is what I've tried:
find |xargs /usr/local/bin/check_for_non-ascii_characters.sh |grep -l 'x{ef}\\x{bb}\\x{bf}' So, I need a way to recursively find non-ascii characters (a specific pattern, mentioned before) in all files and I need the name of the files containing it. Thanks
(This post was edited by romild0 on Jan 7, 2009, 1:16 AM)
|