• MrKaplan@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    11 hours ago

    we’re currently aware of delayed federation from lemmy.ml towards lemmy.world and still working identifying the root cause - see https://lemmy.world/post/22196027 (still needs updating that it’s happening again).

    aussie.zone has been about 6 weeks behind lemmy.world for a few weeks i think at this point, which at least means they’re no longer losing activities, but it’s still taking ages to reduce the lag.

    i don’t know what issue there might be with discuss.online right now, but for startrek.website the explanation is rather simple. as you can see in the sidebar, there are 0 local subscribers for the community. when there aren’t any subscribers to a community on an instance, the instance will not receive any updates for posts in that community. this includes posts, comments, as well as votes.

    startrek.website also had federation issues over the last weeks due to accidentally blocking lemmy instances in some situations.

    lemdro.id has recently had some db performance issues that caused it to get around 3d behind lemmy.world, they’ve been slowly catching up again over the last days.

    • OpenStars@piefed.social
      link
      fedilink
      English
      arrow-up
      1
      ·
      11 hours ago

      Thanks for the info! If discuss.online was out for a few hours, would that explain the missing content, if it happened during the outage and so it just gets lost forever, or given enough time will it catch up?

      Do you know when Lemmy.World plans to update to 0.19.6 or 0.19.7? I really hope that helps bring stability! Although I can understand not wanting to do it at the same time as the sync issue with lemmy.ml still happening.

      • MrKaplan@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        10 hours ago

        downtime should not result in missing content when the sending instance is lemmy 0.19.0 or newer. 0.19.0 introduced a persistent federation queue in lemmy, which means it will retry sending the same stuff until the instance is available. depending on the type of down, it can also be possible that there is a misconfiguration (e.g. “wrong” http status code on a maintenance page) that could make the sending instance think it was successfully sent. if the sending instance was unreachable (timeout) or throwing http 5xx errors, everything should be preserved.

        we are planning to post an announcement about the current situation with lemmy updates and our future plans in the coming days, stay tuned for that. you can find some info in my comment history already if you are curious.

        • OpenStars@piefed.social
          link
          fedilink
          English
          arrow-up
          1
          ·
          10 hours ago

          Ah, then something is indeed very wrong if discuss.online is missing so much content from a week ago (I thought after something like 7 days it will simply give up and stop trying), and startrek.website is doing far worse than that even.

          Though sh.itjust.works caught up even as we were talking about it so… there’s some hope I suppose. And either way, thanks for any efforts you are doing to help with it - well, on the LW side at least:-).

          • MrKaplan@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            ·
            10 hours ago

            there is indeed a cutoff. there is exponential delay for retrying and at some point lemmy will stop trying until it sees the instance as active again.

            there is also a scheduled task running once a week that will delete local activities older than a week. downtimes of a day or two can generally be easily recovered from, depending on latency it can take a lot more time though. if an instance is down for an extended time it shouldn’t expect to still get activities from the entire time it was offline.

            • OpenStars@piefed.social
              link
              fedilink
              English
              arrow-up
              1
              ·
              edit-2
              49 minutes ago

              Although the example I mentioned earlier is only 6 days old, so if the cutoff is 7 days then that point should not have been reached yet.

              That post also says “Comments are disabled.”, so now I wonder if something special is going on with it, which either caused the comments to be disabled or perhaps the disabling of comments (for whatever reason? I didn’t see anything happening inside that looked remotely contentious, but all the posts submitted around that time seem to have had that happen to them? perhaps a new mod learning the ropes, maybe even fat-fingering buttons unintentionally?) caused federation issues specifically with that content. On the other hand, many other examples abound, even if this were not a great example on its own.

              But it is a quite striking one: the original post still has 185 upvotes, when viewed from Lemmy.World it has a handful more, but when viewed from discuss.online it is still far behind at merely 97 upvotes, and when viewed from StarTrek.website still has just the default 1 upvote… and zero comments visible underneath it. At this point if looks like if the delay lasts for one more day the situation will become permanent for that post.

              And I’ve already decided to abandon the Startrek.website instance due to constant issues like this - especially if a post takes a day or two to federate, or almost worse it does federate but then the comments cannot make it back from LW to the STW instance for me to be able to respond to them (as has happened before), then on that instance I am essentially out of the loop wrt my own postings on Lemmy, unable to respond to what I can’t even see, at least there on STW where I have a login account, although I can see it on LW where I do not.

              I really hope that the upcoming 0.19.6-7 helps make usage of smaller instances more viable as we continue this experiment in federated Lemmy.:-)