I have been using a company computer running Ubuntu 22.04. There are frequent and unexplained problems, like segmentation faults, stack errors, files disappearing, computer freezing or not booting, or turning off immediately after I turn it on. I don’t know what to do. The IT staff came to my office to check the computer and said “it was all good. I am not allowed to boot from a USB stick or enter BIOS or open the case. I ran a command line memory check several times with no errors. There is an NVIDIA card, but it’s running X.org and usually headless. I mostly set up tasks via SSH.

What would you do?

  • TimeSquirrel
    arrow-up
    20
    arrow-down
    0
    ·
    4 months ago
    edit-2
    4 months ago
    link
    fedilink

    Segmentation and stack errors are most certainly bad memory, I’m 99% sure of it, reboot and run mem test from GRUB if you have the option. The “stack” is the non-dynamically allocated space your program is assigned to run in. Stack errors mean some pointers somewhere are likely getting corrupted and it’s trying to access addresses beyond what it’s allowed to access.

    • LogarithmicCamelOP
      arrow-up
      2
      arrow-down
      1
      ·
      4 months ago
      link
      fedilink

      I can’t run memtest unfortunately. The option isn’t there and I don’t have permission to boot from a USB stick.

      • Avid Amoeba
        arrow-up
        6
        arrow-down
        0
        ·
        4 months ago
        edit-2
        4 months ago
        link
        fedilink

        If you have root you could theoretically add Memtest86+ to the boot order. There’s tools that allow adding boot entries in EFI. You could probably place a Memtest86+ binary in your EFI partition and register it with the EFI firmware. But I’m not suggesting to do it since you could make the machine unbootable and the problem might be on the storage path. I’m just thinking of should be possible.

        • LogarithmicCamelOP
          arrow-up
          7
          arrow-down
          0
          ·
          4 months ago
          link
          fedilink

          I can sudo. Last time I looked into this, Memtest86+ version 6 was required to work with UEFI but it wasn’t available for Ubuntu 22.04. Now it seems that 24.04 has it, so I might update and see if I can get the test running. Thanks for the suggestion!

          • Avid Amoeba
            arrow-up
            2
            arrow-down
            0
            ·
            4 months ago
            link
            fedilink

            You can get the binary from the project’s website. Still not suggesting to f around with it.

            • Dave.
              arrow-up
              3
              arrow-down
              0
              ·
              4 months ago
              link
              fedilink

              They should be able to put memtest on the boot partition and then break to an EFI shell on boot and Ioad it manually.

              There will be a bit of swearing and googling required but it’s doable in a way that doesn’t mess with the current boot arrangement.

      • exuEnglish
        arrow-up
        1
        arrow-down
        0
        ·
        4 months ago
        link
        fedilink

        You could download stressapptest and run that memory benchmark in the normal system.

        I’m not sure how well the current version of Memtest does, but when I was overclocking I was told not to use it as it couldn’t reliably get memory to crash. (Funny problem to have). The two recommended tools are Windows only, so I found stressapptest as the best alternative.

        • LogarithmicCamelOP
          arrow-up
          2
          arrow-down
          0
          ·
          4 months ago
          link
          fedilink

          I did run a similar test and there were no errors detected. Thanks anyway!

  • Strit
    arrow-up
    15
    arrow-down
    0
    ·
    4 months ago
    link
    fedilink

    As it’s running Ubuntu you could provide your IT department with the logs from the crashes, so they can see there is a problem.

    If they provided the Ubuntu install it’s their job to support it.

    • LogarithmicCamelOP
      arrow-up
      6
      arrow-down
      0
      ·
      4 months ago
      link
      fedilink

      That’s a good idea. If I can get it to boot today, I will check the logs, thanks!

  • NegativeLookBehindEnglish
    arrow-up
    7
    arrow-down
    0
    ·
    4 months ago
    link
    fedilink

    Do you have a way to reproduce the problem so IT can see it? Have you taken screenshots (or just pictures with your phone) of the problem if it’s hard to reproduce?

    • LogarithmicCamelOP
      arrow-up
      5
      arrow-down
      0
      ·
      4 months ago
      link
      fedilink

      It’s not hard to reproduce, but it’s annoying that when they finally came here to check it, no problems happened. I had to bug them so much to even get them to have a look.

      • connaisseur
        arrow-up
        10
        arrow-down
        0
        ·
        4 months ago
        link
        fedilink

        Have a smartphone ready and record a video clip when/if it happens again.

  • absGeekNZEnglish
    arrow-up
    3
    arrow-down
    0
    ·
    4 months ago
    link
    fedilink

    I was having a lot of random crashes and weird errors on my Mint install, using the logs, I tracked it down to a SSD fault.

    I really didn’t want to send it back, since I got it from Amazon and I’m in NZ So after a bit of checking I found that the FW on the SSD was not the latest. Updated the FW, went from at least 1 crash per workday, to no crashes in the last 6 months.

    My SSD is a WD SN850X 4TB

    • Cyborganism
      arrow-up
      3
      arrow-down
      0
      ·
      4 months ago
      link
      fedilink

      I’ve been running Linux on Nvidia graphics for 24 years. It’s never been an issue.

        • Avid Amoeba
          arrow-up
          3
          arrow-down
          0
          ·
          4 months ago
          link
          fedilink

          Most machines I owned that had kernel panics had either an NVIDIA or an AMD GPU graphics adapter, along with bad memory.

          FTFY

        • prole
          arrow-up
          2
          arrow-down
          0
          ·
          4 months ago
          link
          fedilink

          Aren’t those, like, the only two brands of GPU?

          • Hawke
            arrow-up
            1
            arrow-down
            0
            ·
            4 months ago
            link
            fedilink

            There’s intel as well. Probably a few other small players. Is Matrox still around?