- This topic is empty.
-
AuthorPosts
-
September 2, 2006 at 6:27 am #596629AnonymousInactive
OK, I’m trying to learn about scrapers. I use Google Alerts and this one keeps coming to my inbox:
xxhttp://2512.butigdw.org/
If you scroll down, you can see that he/she added a bunch of info off of other sites (is this what scraping means?), then there is a bunch of text that means nothing, probably just to fill the page to get the keyword density. My URL is in there, but its not hyperlinked.
Is this what you call scraping and I assume they use some certain software to gather all this info and create the pages? Is this the kind of things you guys report to the affiliate managers?
September 2, 2006 at 1:44 pm #704334AnonymousInactiveYes.
It is a program that cmes to your site and steals some of your content and uses it to make a new site, making your site subject to penalties for duplicate content and riding on your work and ranking.
It’s theft and it hurts you.
September 2, 2006 at 3:07 pm #704344AnonymousInactiveI agree with Dominique, this is your typical serp scrape. A subdomain and I would also bet the domain butigdw.org/ is 404. First course of action would be for butigdw.org/ to get indexed with a legitimate site. After the culprit will operate soley on subdomains.
The number 2512 is the amount of subdomains this person is operating under, perhaps even more. For instance if your typed in xxx2512.butigdw.org you would pull up another spam site.
For xxx2512.butigdw.org there is hidden text worth viewing.
greek39
September 2, 2006 at 4:36 pm #704354AnonymousInactiveIf the original subdomain isn’t even registered yet and is a 404, how can all the subdomains show up? And is this little bit of duplicate content even worth reporting?
Also – how are you viewing the hidden text? I am using Firefox with the webmasters toolbar, but I cannot see how to view these things. If I look at the code, there isnt much to see at all.
Can you help me to understand?
September 2, 2006 at 4:46 pm #704356AnonymousInactiveTo view hidden text scroll down hit Ctrl+a . This is not just one person, but many. View the hidden text text carefully then view the source. Next locate the IP for butigdw.org . Just trying to help people to learn to spot this garbage on their own. greek39
September 2, 2006 at 4:59 pm #704357AnonymousInactiveNext do a whois lookup with the domain and start the complaints runaround. If the whois is fake submit the url to the host and complain and if they follow up on this stuff they will take it down. *fingers crossed*
September 2, 2006 at 5:01 pm #704358AnonymousInactiveThanks – it looks like the page has already changed – all the stuff he scraped from other websites is gone already. Hmmmm…..
September 2, 2006 at 5:04 pm #704359AnonymousInactiveThe site has changed already? well that is good news I would say. greek39
September 2, 2006 at 5:15 pm #704361AnonymousInactiveewhitaker are you happy with what you are seeing now? greek39
-
AuthorPosts