• 0_o7@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      5
      ·
      1 month ago

      Or they haven’t been caught yet.

      The article explains PerplexityBot respects robots.txt, but then sends a different request with a different IP and different user-agent. They could very well be using a different method to walk around it.

  • Kay Ohtie@pawb.social
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 month ago

    Perplexity’s firing back assumes website owners distinguish between automated scraping and on-demand scraping.

    I don’t think most people make that distinction.

    And that falls in line perfectly with the typical “assumption of access” all of these “AI” companies make.