Fun With Finding Unique Content

Status
Not open for further replies.

Aequitas

New member
Feb 19, 2007
2,954
73
0
Canada
Fuck I was so board about 2 hours ago I decided to start work on an automated unique content finder, I figured I'd create a quick one and give it away for free or some shit, this is what I've got so far, I put it up on a empty domain name of mine which is located here.

Basically what it does is you toss in a URL, it grabs some content off that page and checks to see if its unique in the eyes of Google, if it is, then it tells you in bold that its unique content, if its not then it tells you in bold that its not.

It probably won't work with http(s) sites but I haven't tried it yet, maybe if I've got time tomorrow I'll clean it up a bit, automate it, and see where it goes from there.

This is nothing special just something to play around with so you'll notice a lot of crap I haven't filtered out and shit but there it is, I'll bundle it up and give it away for free whenever I get the time to play with it some more.

EDIT: It works best with blog urls.
 


Cool little tool, although it didn't work on some of my domains.

Still, I'd be interested in using it when you get it 100% done. :)

Yeah its pretty basic right now, it was a quick do up but I'll get around to making it work with the majority of the sites and make it test the content to make sure its a decent length and crap, filter out the dates/footers/ and other crap.
 
If it tells me my content isn't unique, how can I figure out where it was copied from so I can bust some ass?
 
Well Google said it was from 2 directories, the index and the /feed folder. So no big deal, false alarm.
 

Hey I didn't even know they exsisted haha, yeah its pretty much like that tool.

Does it only read within the body tags?

No it doesn't read everything in the body tags it grabs all of the paragraph tags then strips out all HTML elements, leaving only the text, most properly formatted blogs will have the gist of the information between the paragraph tags but thats not always the case.

Once it grabs the content it goes to google and does a search with double quotes around it to see if any results were found, if it found results then the content is not unqiue to Google and if it didn't find any results then its unique to google.
 
dont count on putting quotes around stuff forcing google to bring back axact matches anymore
 
but isnt it a matter of time before that content is considered unique by google? specifically, to that site?
 
You took it down! Damn you! Put it back! :)

I needed to use that site for testing other shit my fucking localhost wen't to hell and back on me yesterday so I said fuck it I'll fix it later and test with an empty domain name hahaha.

but isnt it a matter of time before that content is considered unique by google? specifically, to that site?

I'm not 100% sure if this is true but I explained it all on Principle Of Marketing
 
Forbidden

You don't have permission to access / on this server.

Additionally, a 500 Internal Server Error error was encountered while trying to use an ErrorDocument to handle the request. Apache/1.3.33 Server at professionalwordpresstheme.com Port 80



Hmmm I think this is not unique content. Seen that elsewhere.
 
You took it down! Damn you! Put it back! :)

Forbidden

You don't have permission to access / on this server.

Additionally, a 500 Internal Server Error error was encountered while trying to use an ErrorDocument to handle the request. Apache/1.3.33 Server at professionalwordpresstheme.com Port 80



Hmmm I think this is not unique content. Seen that elsewhere.

haha what you talking about man that shit is unique, unqiue to me, fine I'll put it back up for more people to play with.
 
Status
Not open for further replies.