I'm looking for a web crawler that I can use to crawl website's source codes. Has to be able to do the following.
-Search websites source code and be able to detect if a certain piece of code is in the HTML then pull the URL into a TXT file
-If it finds a website with the defined piece of code it then crawls the source for a different piece of code and if that is present it will then not pull the URL.
-Can input a list of URL's for it to crawl.
-A not manual method of finding websites to crawl.
Anything like this exist or will I have to get a coder to code it for me? What would be the best language to get it coded in?
-Search websites source code and be able to detect if a certain piece of code is in the HTML then pull the URL into a TXT file
-If it finds a website with the defined piece of code it then crawls the source for a different piece of code and if that is present it will then not pull the URL.
-Can input a list of URL's for it to crawl.
-A not manual method of finding websites to crawl.
Anything like this exist or will I have to get a coder to code it for me? What would be the best language to get it coded in?