Aug 3, 2000, 4:12 AM
Post #2 of 4
Google is not programmed in perl.
It was programmed in c++ and delphi.
I asked them a while back.
Yahoo was programmed in c++, and python.
Perl is good when it comes to data handling but perl can only do so much. Perl couldn't preform the same functions as google but it can get really close.
You could mimic yahoo easily with perl. Same with altavista.
I'd search for some tutorials on building a search engines. There every where. Use google. or go to
That's where I learned
As for the spyders, I'm working on one right now. So I don't know all of the details yet. But first you have to figure out wether you want to use the sites' meta tags to get the info and have it index a data base full of sites that way or you want to do searches at popular engines and index their results.
The first method is easier, what it would do is loop through a data base full of urls and go to each site and split the html to get the meta tags and return the data. I'm sorry I can't just explain the code because I'd be here for a long time. You will learn eventually.
for a decent spyder go to
It will pack a data base with urls and descriptions and titles and stuff. It runs off your desktop.
But I know you want to right your own. So I'd start by reading docs (Here on the right) and just start tinkering around learning how to open data bases and basic regex and how to parse web pages. Then you won't have any problems. But if your a begginer to perl there is nothing I can just tell you that will enable you to wite search engines and spyders. You'll just have to learn like the rest of us. I don't think that it will take you long. Were here to help you after all.
I wasn't trying to tell you how to write the engines in this post but instead lead you into the direction I came from because it's the easiest way to learn.
Another great way to learn is to go to
and download search engines to see how other programmers did it. Then post about the code you don't understand.
Hope you didn't take offense to my pointing around.