|
|||||||
| Shooting The Shit This is where the action is for all webmasters alike. Anything goes, seriously. Come meet and network with your peers, it's a fun way to take a break out of your busy day of posting at other boring forums. |
|
Welcome to the WickedFire - Affiliate Marketing Forum - Internet Marketing Webmaster SEO Forum forums. You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today! If you have any problems with the registration process or your account login, please contact contact us. |
![]() |
|
|
LinkBack | Thread Tools | Display Modes |
|
|
#2 (permalink) |
![]() Join Date: Sep 2007
Location: Beantown, MA
Posts: 1,481
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
quick log on your second account and promote your product
__________________
Nickycakes.com: Reformed Blackhat Nickycakes' Newbie Guide #cakes irc.freenode.net |
|
|
|
|
#5 (permalink) |
|
Senior Member
Join Date: Aug 2006
Location: WA
Posts: 494
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I'm feeling generous today. Create two files. One with a list of URLs (one per line) and one with a list of keywords to scan for. Have fun.
Code:
import pycurl
from StringIO import StringIO
import re
import sys
def getpage(url):
c = pycurl.Curl()
c.setopt(pycurl.URL, url)
resp = StringIO()
c.setopt(pycurl.WRITEFUNCTION, resp.write)
try: c.perform()
except: return None
return resp.getvalue()
def scanpage(html, keywords):
count = {}
for kwd in keywords:
count[kwd.strip()] = len(re.findall(kwd.strip(), html, re.I))
return count
if __name__ == '__main__':
if len(sys.argv) < 3:
print 'Usage: python scanner.py [url_file] [kwd_file]'
sys.exit()
f = file(sys.argv[1], 'r')
urls = f.readlines()
f.close()
f = file(sys.argv[2], 'r')
keywords = f.readlines()
f.close()
res = {}
for url in urls:
html = getpage(url.strip())
if html is None: break
res[url.strip()] = scanpage(html, keywords)
print res
|
|
|
|
|
#9 (permalink) |
|
Vista Ready
Join Date: Nov 2007
Location: nashville
Posts: 379
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Do this for each url in the list:
Code:
$url = 'your url here';
$term = 'what to look for';
$count = substr_count(strip_tags($url), $term);
if($count > 0) {//do stuff here}
__________________
http://www.faking.net - social engineering at its finest |
|
|
|
|
#10 (permalink) |
![]() Join Date: Sep 2007
Location: Beantown, MA
Posts: 1,481
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
@ everyone except davidr:
pretty sure he means get the text from the page AT each url =P
__________________
Nickycakes.com: Reformed Blackhat Nickycakes' Newbie Guide #cakes irc.freenode.net |
|
|
|
|
#11 (permalink) | |
|
Member
Join Date: Sep 2007
Posts: 42
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Quote:
@mike82 bust open a unix cmd line and enter: for url in `cat urls.txt`;do lynx $url -dump | grep "my_keyword" | wc -l && echo $url;done *that'll be $37.00, send it via paypal
|
|
|
|
![]() |
| Thread Tools | |
| Display Modes | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Selling: list of unregistered English-word Domains | zany zoroaster | Sell, Buy & Trade | 1 | 06-04-2008 05:20 PM |
| Free list management software | phil9922 | Shooting The Shit | 2 | 02-17-2008 11:43 AM |
| Anyone interested in a list of a Million+ Trackback URLs? | linkwhore | Sell, Buy & Trade | 4 | 06-01-2007 05:39 PM |
| Top Earning Keywords (List) | Jescro | Affiliate Marketing | 13 | 02-15-2007 10:51 AM |
| Introduction and more... | edgee | Shooting The Shit | 9 | 09-26-2006 06:04 PM |