• strawberry@kbin.run
    link
    fedilink
    arrow-up
    21
    ·
    7 months ago

    you think either of those companies pays attention to robots.txt? its not legally binding or anythjng

    • Skull giver@popplesburger.hilciferous.nl
      link
      fedilink
      English
      arrow-up
      15
      arrow-down
      1
      ·
      7 months ago

      They generally do, because very few people bother to block them anyway. Complying with robots.txt is a good way to show in court that you did put effort into complying with the usual standards, without it ever impacting the useful information you’re scraping.

    • MrMcGasion@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      ·
      7 months ago

      At the executive level, no I don’t think they care or pay attention, but considering both have said “here’s how to block our crawler,” I do hope that that some mistreated developer did actually program a check in to the crawler. I still think it’s worth doing, even though I don’t fully trust them.