Sorry to triple post .. but this program is stupid simple to use.
I had no issues with licenses ... in fact, to transfer to my ubuntu box all I did was copy the files to my ext. hard drive and then open the right file on ubuntu. After about 30 minutes of it running I've got 200 working anon proxies (and growing) out of a 19k list.
Fucking awesome Matt ... would pay 3x your cost for this product. Great work!
Suggestion: Timeout thresholds would be nice ... like if the proxy doesn't work in < x seconds don't include it in the list.
 . Can I execute this program from my web host? I remember seeing somebody say something about doing that over SSH. I downloaded Putty yesterday and setup SSH over at my shared Host Gator account.
. Can I execute this program from my web host? I remember seeing somebody say something about doing that over SSH. I downloaded Putty yesterday and setup SSH over at my shared Host Gator account.Did you use the default list that was included?? I let this sucker run on my Vista laptop for over 20 minutes without finding any Proxies. This was with the newest version that was posted to this thread.
Rage9, the more often you update the list, generally the better the alive/dead ratio will be. PMing you about dodgy proxies.
Yeah but scraping huge lists of random proxies you're bound to run into these problems. It's actually pretty expected.
I just wanted to put the word out that you need to be more vigilent in how you use them. For example I'm writing a scraper that has proxy support and if the data I expect is not being returned you have to be able to skip those dodgy proxies.
The way I'd solve it is when checking to see if the proxy is any good, query a website or server and check the output to see if it's good. For example you could setup a simple PHP script (or whatever language you choose) on a server and it just returns a small amount of data. If you get that data back from the proxy it would mean it isn't being used in another way. At the same time having a bunch of us constantly bashing that server may be another problem.
That's why this is a proxy harvester not a proxy in depth elite tester.

*cough* scrapebox with certain footprints, been mentioned on wf by gutterseo *cough*
it now does additional checks to strip out planetlab and codeen proxies.
 
	