• roguetrick
      arrow-up
      30
      arrow-down
      2
      ·
      8 months ago
      link
      fedilink

      Well, they can (and will) still scrape us if they want. Just nobody’s making a buck off of it.

      • stoly
        arrow-up
        1
        arrow-down
        0
        ·
        8 months ago
        link
        fedilink

        That’s going to be a lot more work since comments and posts are decentralized here. You can probably easily get some of it but it will be hard to get all of it.

        • roguetrick
          arrow-up
          1
          arrow-down
          0
          ·
          8 months ago
          link
          fedilink

          It’s actually even easier than that. Instead of setting up an tool to make up requests for the API, you can just set up a bridge that will dump everything right into your database. The wonders of federation.

          • LWD
            arrow-up
            1
            arrow-down
            0
            ·
            8 months ago
            link
            fedilink

            If you can set up a Lemmy instance and apply a little elbow grease to manually follow a few instances, that’s pretty much all you need to have the data come in automatically. You’d probably need more knowledge about how to actually get the data out of the DB than the initial setup, which could be done by somebody just copying and pasting text.

    • The Bard in Green
      arrow-up
      13
      arrow-down
      1
      ·
      8 months ago
      link
      fedilink

      The reality though is I can train LLMs off Lemmy data all I want and I don’t have to pay ANYONE a dime