Get exclusive CAP network offers from top brands

View CAP Offers

Is this scraping?

[bsa_pro_ad_space id=2]
  • This topic is empty.
Viewing 9 posts - 1 through 9 (of 9 total)
  • Author
    Posts
  • #596629
    Anonymous
    Inactive

    OK, I’m trying to learn about scrapers. I use Google Alerts and this one keeps coming to my inbox:

    xxhttp://2512.butigdw.org/

    If you scroll down, you can see that he/she added a bunch of info off of other sites (is this what scraping means?), then there is a bunch of text that means nothing, probably just to fill the page to get the keyword density. My URL is in there, but its not hyperlinked.

    Is this what you call scraping and I assume they use some certain software to gather all this info and create the pages? Is this the kind of things you guys report to the affiliate managers?

    #704334
    Anonymous
    Inactive

    Yes.

    It is a program that cmes to your site and steals some of your content and uses it to make a new site, making your site subject to penalties for duplicate content and riding on your work and ranking.

    It’s theft and it hurts you.

    #704344
    Anonymous
    Inactive

    I agree with Dominique, this is your typical serp scrape. A subdomain and I would also bet the domain butigdw.org/ is 404. First course of action would be for butigdw.org/ to get indexed with a legitimate site. After the culprit will operate soley on subdomains.

    The number 2512 is the amount of subdomains this person is operating under, perhaps even more. For instance if your typed in xxx2512.butigdw.org you would pull up another spam site.

    For xxx2512.butigdw.org there is hidden text worth viewing.

    greek39

    #704354
    Anonymous
    Inactive

    If the original subdomain isn’t even registered yet and is a 404, how can all the subdomains show up? And is this little bit of duplicate content even worth reporting?

    Also – how are you viewing the hidden text? I am using Firefox with the webmasters toolbar, but I cannot see how to view these things. If I look at the code, there isnt much to see at all.

    Can you help me to understand?

    #704356
    Anonymous
    Inactive

    To view hidden text scroll down hit Ctrl+a . This is not just one person, but many. View the hidden text text carefully then view the source. Next locate the IP for butigdw.org . Just trying to help people to learn to spot this garbage on their own. greek39

    #704357
    Anonymous
    Inactive

    Next do a whois lookup with the domain and start the complaints runaround. If the whois is fake submit the url to the host and complain and if they follow up on this stuff they will take it down. *fingers crossed*

    #704358
    Anonymous
    Inactive

    Thanks – it looks like the page has already changed – all the stuff he scraped from other websites is gone already. Hmmmm…..

    #704359
    Anonymous
    Inactive

    The site has changed already? well that is good news I would say. greek39

    #704361
    Anonymous
    Inactive

    ewhitaker are you happy with what you are seeing now? greek39

Viewing 9 posts - 1 through 9 (of 9 total)