topbanner_forum
  *

avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • Thursday March 28, 2024, 11:37 am
  • Proudly celebrating 15+ years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Author Topic: Software download sites - scrapers and non-scrapers  (Read 3654 times)

suleika

  • Supporting Member
  • Joined in 2007
  • **
  • Posts: 117
    • View Profile
    • Read more about this member.
    • Donate to Member
Software download sites - scrapers and non-scrapers
« on: August 07, 2007, 01:12 PM »
Inspired by some comments in the thread  Article: Please don't steal this Web content:

It can be a pretty discouraging situation to be in when someone steals your content like this. It really doesn't do much for encouraging the victim to keep working hard and producing new content.

And I have some mixed feelings about software download sites that do this. On the one hand, they are promoting my software.

On the other, they are copying partial content from my web pages, leaving out important information or presenting it in a way that makes no sense. This is unfair and can make me and my applications look bad.

Most of the time it is for applications I don't have PAD files for. If I had wanted these applications to be included on software sites, I would have created PAD files for them and submitted it, myself.

Is there a thread somewhere on this site on the good, average and less good software sites, based on how they treat the authors, link back (or not), nick copy without asking (or not)? I'd be curious.

Whenever I googled for software I got very fed up of pages of results from these scraping sites.  They all include the same short descriptions of the software, with no reviews or discussion or comparison, and sometimes no links to the software home pages.  I got so fed up that a couple of weeks ago I created a custom google search which exclude these sites.  I can modify it any time by "including" sites (which are then emphasized in the search results) or by adding more excluded sites.  It works very well for me.

Here are my excluded sites so far:

   download3000.com
   topshareware.com
   softsia.com
   cleansofts.com
   softlandmark.com
   shareme.com
   bestfreewaredownload.com
   safesite.com
   freewarepub.org
   hotlib.com
   freesoft411.com
   freeware-guide.com
   bluesofts.com
   freeware-market.com
   softslist.com
   dirfile.com
   brothersoft.com
   filebuzz.com
   softplatz.com
   redsofts.com




iphigenie

  • Supporting Member
  • Joined in 2006
  • **
  • Posts: 1,170
    • View Profile
    • Donate to Member
Re: Software download sites - scrapers and non-scrapers
« Reply #1 on: August 08, 2007, 02:53 PM »
Ah, none of those are in my bookmarks  8)


Darwin

  • Charter Member
  • Joined in 2005
  • ***
  • Posts: 6,984
    • View Profile
    • Donate to Member
Re: Software download sites - scrapers and non-scrapers
« Reply #2 on: August 08, 2007, 03:00 PM »
Excellent idea suleika and thanks for the list of sites. This will help me to do the same - the scaping sites make me a bit crazy, too (not that I'm calling you crazy, mind!).

mouser

  • First Author
  • Administrator
  • Joined in 2005
  • *****
  • Posts: 40,896
    • View Profile
    • Mouser's Software Zone on DonationCoder.com
    • Read more about this member.
    • Donate to Member
Re: Software download sites - scrapers and non-scrapers
« Reply #3 on: August 08, 2007, 03:11 PM »
nice idea suleika, i like the idea of creating a custom google software search thing which excludes bad sites.

let me say i don't fully understand the objection to software listing sites taking info from public web pages describing the software.  to me this isn't really something i expect to be asked for permission about.

however for me this feature is key:
sometimes no links to the software home pages
this is completely unacceptable to me.  any software listing site that doesn't have a prominent link to the original web page of the program is not one to be trusted.