gopher-proxy – /0//phlog/upgrading-server-gpu.gopher.txt

Server upgrade! 4x RAM + GPU! Including CUDA+ollama!

  ___                               
 / __|  ___   _ _  __ __  ___   _ _ 
 \__ \ / -_) | '_| \ V / / -_) | '_|
 |___/ \___| |_|    \_/  \___| |_|  
                                    

                                       _         _ 
  _  _   _ __   __ _   _ _   __ _   __| |  ___  | |
 | || | | '_ \ / _` | | '_| / _` | / _` | / -_) |_|
  \_,_| | .__/ \__, | |_|   \__,_| \__,_| \___| (_)
        |_|    |___/                               

  _ _        
 | | |  __ __
 |_  _| \ \ /
   |_|  /_\_\
             

  ___     _     __  __ 
 | _ \   /_\   |  \/  |
 |   /  / _ \  | |\/| |
 |_|_\ /_/ \_\ |_|  |_|
                       

    _   
  _| |_ 
 |_   _|
   |_|  
        

   ___   ___   _   _   _ 
  / __| | _ \ | | | | | |
 | (_ | |  _/ | |_| | |_|
  \___| |_|    \___/  (_)
                         

  ___               _             _   _               
 |_ _|  _ _    __  | |  _  _   __| | (_)  _ _    __ _ 
  | |  | ' \  / _| | | | || | / _` | | | | ' \  / _` |
 |___| |_||_| \__| |_|  \_,_| \__,_| |_| |_||_| \__, |
                                                |___/ 

   ___   _   _   ___      _       _           _   _                         _ 
  / __| | | | | |   \    /_\    _| |_   ___  | | | |  __ _   _ __    __ _  | |
 | (__  | |_| | | |) |  / _ \  |_   _| / _ \ | | | | / _` | | '  \  / _` | |_|
  \___|  \___/  |___/  /_/ \_\   |_|   \___/ |_| |_| \__,_| |_|_|_| \__,_| (_)
                                                                              

╔─*──*──*──*──*──*──*──*──*──*──*──*──*──*──*──*─╗
║1   ........................................   1║
║2*  ........................................  *2║
║3   ........................................   3║
║1   ...........Posted: 2024-12-04...........   1║
║2*  Tags: sysadmin linux hardware ai debian   *2║
║3   ........................................   3║
║1   ........................................   1║
╚────────────────────────────────────────────────╝

This is both a server announcement and a guide!

I quadrupled my server memory and installed a GPU.

Quadro P620

## Lenovo ST50 "Unqualified DIMM"

## Making the most out of my Nvidia p620 (?)

### Getting the GPU working

Time to install the appropriate drivers.

```
sudo apt update
sudo apt install nvidia-driver
```

I got some warnings about some driver bugs with certain Linux kernel, but after
confirming I wasn't affected by checking my kernel version `uname -r` I just
accepted/moved on. My display stopped displaying during the install process.

After reboot, check if it's working:

```
nvidia-smi
```

I got:

```
baudrillard@simulacra ~ % nvidia-smi
Wed Dec  4 16:18:35 2024       
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.183.01             Driver Version: 535.183.01   CUDA Version: 12.2     |
|-----------------------------------------+----------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |         Memory-Usage | GPU-Util  Compute M. |
|                                         |                      |               MIG M. |
|=========================================+======================+======================|
|   0  Quadro P620                    On  | 00000000:02:00.0  On |                  N/A |
| 34%   44C    P8              N/A /  N/A |    188MiB /  2048MiB |      0%      Default |
|                                         |                      |                  N/A |
+-----------------------------------------+----------------------+----------------------+
                                                                                         
+---------------------------------------------------------------------------------------+
| Processes:                                                                            |
|  GPU   GI   CI        PID   Type   Process name                            GPU Memory |
|        ID   ID                                                             Usage      |
|=======================================================================================|
|    0   N/A  N/A     10606      G   /usr/lib/xorg/Xorg                          185MiB |
+---------------------------------------------------------------------------------------+
```

Can check OpenGL, if you're using an actual display (maybe works in xrdp, too):

```
sudo apt update
sudo apt install mesa-utils
glxinfo | grep OpenGL
```

#### Caveats

##### xrdp

After I installed the GPU I noticed xrdp stopped working!

IDK why but I rebooted or something and now it's working, so just try connecting
a few times, restarting, etc.

### Testing `ollama`

I checked these pages to make sure my GPU is supported:

* https://developer.nvidia.com/cuda-gpus
* https://github.com/ollama/ollama/blob/main/docs/gpu.md

I saw the P620 is supported on both pages, so pretty sure!

I installed the `nvidia-cuda-toolkit`:

```
sudo apt install nvidia-cuda-toolkit
```

Verify CUDA installation:

```
nvcc --version
```

Although this prompted me about how `gcc-11` has a serious bug that should not
be shipped in `trixie`. I just accepted the risk.

I'm going to install `nvtop` to see if my GPU is being used by `ollama` (I ran
`ollama run llama3`. While `nvtop` was running I asked `ollama` to "write a
story that's mathematically complex" and I saw my GPU usage shoot up. I was also
able to see by running `nvidia-smi`, you could also do something like `watch -n
0.5 nvidia-smi`.

All looks pretty good to me!

## Final notes

Darn, that fan is sure annoying on that little low-profile P620.