Hi,
I am scraping the contents of a page (this isn't for scraping for spammy sites or anything. 2 seperate servers need to share information sometimes, quickest/easiest way is to just scrape it).
But everytime there is a bullet point, em dash or anything, it puts in extra characters like ”.
How can i stop this? It displays fine on the page it is being scraped from. And they have the same character encoding on both pages.
I am doing it in php, btw.
I am scraping the contents of a page (this isn't for scraping for spammy sites or anything. 2 seperate servers need to share information sometimes, quickest/easiest way is to just scrape it).
But everytime there is a bullet point, em dash or anything, it puts in extra characters like ”.
How can i stop this? It displays fine on the page it is being scraped from. And they have the same character encoding on both pages.
I am doing it in php, btw.