cross-posted from: https://lemmy.dbzer0.com/post/27579423

This is my first try at creating a map of lemmy. I based it on the overlap of commentors that visited certain communities.

I only used communities that were on the top 35 active instances for the past month and limited the comments to go back to a maximum of August 1 2024 (sometimes shorter if I got an invalid response.)

I scaled it so it was based on percentage of comments made by a commentor in that community.

Here is the code for the crawler and data that was used to make the map:

https://codeberg.org/danterious/Lemmy_map

  • DanteriousEnglish
    arrow-up
    5
    arrow-down
    1
    ·
    1 month ago
    link
    fedilink

    Yeah I’ve noticed there aren’t many clusters that encode specific ideas (there are a few like the anime, nsfw, or sometimes instance level clusters). Most of it just seems to be a blend. Sorta disappointing.

    Anti Commercial-AI license (CC BY-NC-SA 4.0)

    • AsidonhopoEnglish
      arrow-up
      1
      arrow-down
      0
      ·
      1 month ago
      link
      fedilink

      Are they clustered based on shared userbase?

    • CanadaPlusEnglish
      arrow-up
      1
      arrow-down
      0
      ·
      1 month ago
      edit-2
      1 month ago
      link
      fedilink

      There’s not enough data yet for the noise to cancel itself out, I think.

      Place and language-specific clusters are pretty coherent, if you go looking.