Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!


Shells Virtual Desktop
BMail.ag - Secure Email Service
Server.net
CPLicense.net
VPS Server
Buy VPN
Vultr
VMs for AI
HostDare
ReliableSite White-Label Dedicated Hosting for Resellers
InterServer VPS
BMail.ag - Secure Email Service
Best VPN
High-Performance Bare Metal Server Solutions
Karvl.com
Server Mania Cloud Hosting
DataWagon Hosting
AlphaVPS Hosting
Evoxt.com
Clouvider
VPS Hosting with NVMe
Residential IPs in the US & 4G Mobile Proxies in EU & US with Unlimited Bandwidth
ReliableSite White-Label Dedicated Hosting for Resellers
Rabisu - Hosting Solutions
Shells Virtual Desktop
New on LowEndTalk? Please Register and read our Community Rules.

All new Registrations are manually reviewed and approved, so a short delay after registration may occur before your account becomes active.

Kimsufi/Soyoustart/OVH Rise New Price

1349350352354355662

Comments

  • XedonXedon Member

    @JerryHou said:

    @Xedon said:
    Does anyone happen to have a normal KS Mystery server which is not needed?
    I'm looking for one with an E5 CPU, 64 or 128GB RAM and 2x 450GB NVMe. Preferred location is LIM, but SBG or GRA would work too.

    https://lowendtalk.com/discussion/204301/multiple-ovh-servers-for-transfer-trade#latest

    Another post selling as well.

    I just have an EU Account

  • @Xedon said:

    @JerryHou said:

    @Xedon said:
    Does anyone happen to have a normal KS Mystery server which is not needed?
    I'm looking for one with an E5 CPU, 64 or 128GB RAM and 2x 450GB NVMe. Preferred location is LIM, but SBG or GRA would work too.

    https://lowendtalk.com/discussion/204301/multiple-ovh-servers-for-transfer-trade#latest

    Another post selling as well.

    I just have an EU Account

    ie is EU

    Thanked by 2Xedon vr10
  • wdmgwdmg Member, LIR
    edited April 2025

    Looking for a KS-MYSTERY with 2x4T HDD or similar in SGB or GRA. Underlying specs don’t matter much to me, just the storage does. Must be transferable to CA account.

  • XedonXedon Member

    @surfcu said:

    @Xedon said:

    @JerryHou said:

    @Xedon said:
    Does anyone happen to have a normal KS Mystery server which is not needed?
    I'm looking for one with an E5 CPU, 64 or 128GB RAM and 2x 450GB NVMe. Preferred location is LIM, but SBG or GRA would work too.

    https://lowendtalk.com/discussion/204301/multiple-ovh-servers-for-transfer-trade#latest

    Another post selling as well.

    I just have an EU Account

    ie is EU

    Oh, I didn’t know that. I thought ie was like ca
    Thanks!

  • Also, looking for KS-MYSTERY with 2x450 (or greater SSD), and 2x 4TB in Europe that can be transferred to CA account.

  • loayloay Member
    edited April 2025

    llama-bench on AMD EPYC 7351P for Gemma3 12b qat version here

    debian@ns:~/llama.cpp$ llama-bench -m models/gemma-3-12b-it-q4_0_s.gguf -n 0 -n 16 -p 64 -t 1,2,3,4,5,6,7,8,9,10,
    11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32
    | model                          |       size |     params | backend    | threads |          test |                  t/s |
    | ------------------------------ | ---------: | ---------: | ---------- | ------: | ------------: | -------------------: |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       1 |          pp64 |          1.30 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       1 |          tg16 |          0.86 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       2 |          pp64 |          2.58 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       2 |          tg16 |          1.72 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       3 |          pp64 |          3.81 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       3 |          tg16 |          2.49 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       4 |          pp64 |          5.02 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       4 |          tg16 |          3.15 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       5 |          pp64 |          6.24 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       5 |          tg16 |          3.94 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       6 |          pp64 |          7.44 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       6 |          tg16 |          4.53 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       7 |          pp64 |          8.49 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       7 |          tg16 |          4.92 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       8 |          pp64 |          8.89 ± 0.56 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       8 |          tg16 |          5.43 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       9 |          pp64 |          9.14 ± 0.25 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |       9 |          tg16 |          5.63 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      10 |          pp64 |          8.64 ± 0.30 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      10 |          tg16 |          6.18 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      11 |          pp64 |          9.47 ± 0.57 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      11 |          tg16 |          6.33 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      12 |          pp64 |          9.90 ± 1.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      12 |          tg16 |          6.70 ± 0.11 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      13 |          pp64 |         10.59 ± 0.89 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      13 |          tg16 |          7.00 ± 0.12 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      14 |          pp64 |         10.50 ± 0.80 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      14 |          tg16 |          7.20 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      15 |          pp64 |         10.07 ± 1.27 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      15 |          tg16 |          7.44 ± 0.10 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      16 |          pp64 |          9.87 ± 0.32 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      16 |          tg16 |          6.06 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      17 |          pp64 |          8.97 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      17 |          tg16 |          6.15 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      18 |          pp64 |          8.77 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      18 |          tg16 |          6.23 ± 0.10 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      19 |          pp64 |          8.78 ± 0.11 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      19 |          tg16 |          6.24 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      20 |          pp64 |          8.66 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      20 |          tg16 |          6.74 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      21 |          pp64 |          8.36 ± 0.10 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      21 |          tg16 |          6.80 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      22 |          pp64 |          8.09 ± 0.21 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      22 |          tg16 |          6.90 ± 0.10 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      23 |          pp64 |          8.04 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      23 |          tg16 |          6.80 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      24 |          pp64 |          8.11 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      24 |          tg16 |          6.92 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      25 |          pp64 |          7.86 ± 0.13 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      25 |          tg16 |          7.16 ± 0.11 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      26 |          pp64 |          7.79 ± 0.20 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      26 |          tg16 |          7.12 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      27 |          pp64 |          7.82 ± 0.19 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      27 |          tg16 |          7.22 ± 0.22 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      28 |          pp64 |          7.60 ± 0.25 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      28 |          tg16 |          7.41 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      29 |          pp64 |          7.53 ± 0.14 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      29 |          tg16 |          6.53 ± 0.84 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      30 |          pp64 |          7.04 ± 0.18 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      30 |          tg16 |          7.37 ± 0.10 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      31 |          pp64 |          6.95 ± 0.20 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      31 |          tg16 |          6.27 ± 0.82 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      32 |          pp64 |          5.97 ± 0.10 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | BLAS       |      32 |          tg16 |          0.56 ± 0.00 |
    
    build: 916c83bf (5061)
    
    Thanked by 3ariq01 JerryHou sumo
  • Did anyone got Xeon E-2288G, 128GB in GRA? Seems that the best upgrade in GRA was Xeon with 64GB or am I wrong?

  • wuckwuck Member
    edited April 2025

    @mrinternational said:
    Did anyone got Xeon E-2288G, 128GB in GRA? Seems that the best upgrade in GRA was Xeon with 64GB or am I wrong?

    https://lowendtalk.com/discussion/comment/4382266#Comment_4382266

    Here is one, the best lottery from E-2288G that we had was 64gb 2x 960nvme + 2x 6TB

    Thanked by 1vr10
  • NeoonNeoon Community Contributor, Veteran

    @loay said:
    llama-bench on AMD EPYC 7351P for Gemma3 12b qat version here

    Xeon G

    bench@rbx:~/build/bin$ ./llama-bench -m ../../.cache/llama.cpp/stduhpf_google-gemma-3-12b-it-qat-q4_0-gguf-small_gemma-3-12b-it-q4_0_s.gguf -n 0 -n 16 -p 64 -t 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32
    | model                          |       size |     params | backend    | ngl | threads |          test |                  t/s |
    | ------------------------------ | ---------: | ---------: | ---------- | --: | ------: | ------------: | -------------------: |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          pp64 |          4.35 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          tg16 |          1.77 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          pp64 |          8.56 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          tg16 |          3.36 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          pp64 |         12.53 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          tg16 |          4.07 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          pp64 |         16.21 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          tg16 |          4.34 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          pp64 |         19.09 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          tg16 |          4.31 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          pp64 |         21.73 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          tg16 |          4.45 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          pp64 |         23.41 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          tg16 |          4.45 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          pp64 |         24.85 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          tg16 |          4.48 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          pp64 |         17.53 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          tg16 |          4.32 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          pp64 |         19.03 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          tg16 |          4.47 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          pp64 |         20.63 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          tg16 |          4.53 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          pp64 |         22.04 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          tg16 |          4.54 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          pp64 |         23.65 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          tg16 |          4.53 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          pp64 |         25.06 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          tg16 |          4.52 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          pp64 |         26.42 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          tg16 |          4.52 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          pp64 |         28.15 ± 0.16 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          tg16 |          4.49 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          pp64 |         20.48 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          tg16 |          3.47 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          pp64 |         20.81 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          tg16 |          3.52 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          pp64 |         21.38 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          tg16 |          3.55 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          pp64 |         21.60 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          tg16 |          3.60 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          pp64 |         21.96 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          tg16 |          3.68 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          pp64 |         22.37 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          tg16 |          3.70 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          pp64 |         22.55 ± 0.11 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          tg16 |          3.66 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          pp64 |         22.65 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          tg16 |          3.71 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          pp64 |         22.53 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          tg16 |          3.72 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          pp64 |         22.82 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          tg16 |          3.72 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          pp64 |         23.10 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          tg16 |          3.71 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          pp64 |         23.29 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          tg16 |          3.69 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          pp64 |         23.56 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          tg16 |          3.65 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          pp64 |         23.64 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          tg16 |          3.61 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          pp64 |         23.69 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          tg16 |          3.55 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          pp64 |         23.74 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          tg16 |          3.51 ± 0.01 |
    
    build: d0d5b223 (5062)
    
    Thanked by 1loay
  • NeoonNeoon Community Contributor, Veteran
    edited April 2025

    @loay said:
    llama-bench on AMD EPYC 7351P for Gemma3 12b qat version here

    KS-LE-B

    bench@gra:~/build/bin$ ./llama-bench -m ../../.cache/llama.cpp/stduhpf_google-gemma-3-12b-it-qat-q4_0-gguf-small_gemma-3-12b-it-q4_0_s.gguf -n 0 -n 16 -p 64 -t 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32
    | model                          |       size |     params | backend    | ngl | threads |          test |                  t/s |
    | ------------------------------ | ---------: | ---------: | ---------- | --: | ------: | ------------: | -------------------: |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          pp64 |          3.54 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          tg16 |          1.52 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          pp64 |          7.06 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          tg16 |          2.89 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          pp64 |         10.12 ± 0.29 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          tg16 |          3.57 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          pp64 |         12.91 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          tg16 |          3.81 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          pp64 |         10.07 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          tg16 |          3.63 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          pp64 |         11.99 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          tg16 |          3.85 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          pp64 |         13.85 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          tg16 |          3.89 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          pp64 |         13.56 ± 0.41 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          tg16 |          3.25 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          pp64 |         12.48 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          tg16 |          3.02 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          pp64 |         12.75 ± 0.10 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          tg16 |          3.17 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          pp64 |         13.35 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          tg16 |          3.23 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          pp64 |         13.88 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          tg16 |          3.26 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          pp64 |         13.62 ± 0.19 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          tg16 |          3.26 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          pp64 |         14.06 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          tg16 |          3.28 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          pp64 |         14.10 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          tg16 |          3.21 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          pp64 |         14.40 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          tg16 |          3.15 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          pp64 |         13.60 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          tg16 |          2.98 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          pp64 |         13.76 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          tg16 |          3.00 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          pp64 |         13.92 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          tg16 |          3.00 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          pp64 |         14.03 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          tg16 |          2.99 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          pp64 |         13.90 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          tg16 |          2.95 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          pp64 |         13.92 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          tg16 |          2.91 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          pp64 |         14.02 ± 0.13 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          tg16 |          2.87 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          pp64 |         14.03 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          tg16 |          2.81 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          pp64 |         13.68 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          tg16 |          2.74 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          pp64 |         13.78 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          tg16 |          2.73 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          pp64 |         13.76 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          tg16 |          2.71 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          pp64 |         13.68 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          tg16 |          2.67 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          pp64 |         13.70 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          tg16 |          2.61 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          pp64 |         13.82 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          tg16 |          2.64 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          pp64 |         13.92 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          tg16 |          2.62 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          pp64 |         13.99 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          tg16 |          2.56 ± 0.01 |
    
    build: d0d5b223 (5062)
    
    Thanked by 1loay
  • loayloay Member
    edited April 2025

    @Neoon said:

    @loay said:
    llama-bench on AMD EPYC 7351P for Gemma3 12b qat version here

    Xeon G

    bench@rbx:~/build/bin$ ./llama-bench -m ../../.cache/llama.cpp/stduhpf_google-gemma-3-12b-it-qat-q4_0-gguf-small_gemma-3-12b-it-q4_0_s.gguf -n 0 -n 16 -p 64 -t 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32
    | model                          |       size |     params | backend    | ngl | threads |          test |                  t/s |
    | ------------------------------ | ---------: | ---------: | ---------- | --: | ------: | ------------: | -------------------: |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          pp64 |          4.35 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          tg16 |          1.77 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          pp64 |          8.56 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          tg16 |          3.36 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          pp64 |         12.53 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          tg16 |          4.07 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          pp64 |         16.21 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          tg16 |          4.34 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          pp64 |         19.09 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          tg16 |          4.31 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          pp64 |         21.73 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          tg16 |          4.45 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          pp64 |         23.41 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          tg16 |          4.45 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          pp64 |         24.85 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          tg16 |          4.48 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          pp64 |         17.53 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          tg16 |          4.32 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          pp64 |         19.03 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          tg16 |          4.47 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          pp64 |         20.63 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          tg16 |          4.53 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          pp64 |         22.04 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          tg16 |          4.54 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          pp64 |         23.65 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          tg16 |          4.53 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          pp64 |         25.06 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          tg16 |          4.52 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          pp64 |         26.42 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          tg16 |          4.52 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          pp64 |         28.15 ± 0.16 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          tg16 |          4.49 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          pp64 |         20.48 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          tg16 |          3.47 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          pp64 |         20.81 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          tg16 |          3.52 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          pp64 |         21.38 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          tg16 |          3.55 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          pp64 |         21.60 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          tg16 |          3.60 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          pp64 |         21.96 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          tg16 |          3.68 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          pp64 |         22.37 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          tg16 |          3.70 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          pp64 |         22.55 ± 0.11 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          tg16 |          3.66 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          pp64 |         22.65 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          tg16 |          3.71 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          pp64 |         22.53 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          tg16 |          3.72 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          pp64 |         22.82 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          tg16 |          3.72 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          pp64 |         23.10 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          tg16 |          3.71 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          pp64 |         23.29 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          tg16 |          3.69 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          pp64 |         23.56 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          tg16 |          3.65 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          pp64 |         23.64 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          tg16 |          3.61 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          pp64 |         23.69 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          tg16 |          3.55 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          pp64 |         23.74 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          tg16 |          3.51 ± 0.01 |
    
    build: d0d5b223 (5062)
    

    Thanks for running this! I’ve been curious how they would perform on such bench, especially when compared to the differences in single-core performance

  • NeoonNeoon Community Contributor, Veteran
    edited April 2025

    @loay said:
    llama-bench on AMD EPYC 7351P for Gemma3 12b qat version here

    MYSTERY E5, up to 3 tokens better than KS-LE-B but mostly on par.
    So rather keep the KS-LE_B for LLM's.

    debian@machina:~/build/bin$ ./llama-bench -m ../../.cache/llama.cpp/stduhpf_google-gemma-3-12b-it-qat-q4_0-gguf-small_gemma-3-12b-it-q4_0_s.gguf -n 0 -n 16 -p 64 -t 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32
    | model                          |       size |     params | backend    | ngl | threads |          test |                  t/s |
    | ------------------------------ | ---------: | ---------: | ---------- | --: | ------: | ------------: | -------------------: |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          pp64 |          2.85 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          tg16 |          1.30 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          pp64 |          5.63 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          tg16 |          2.31 ± 0.10 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          pp64 |          8.40 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          tg16 |          3.46 ± 0.13 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          pp64 |         11.12 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          tg16 |          4.66 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          pp64 |         13.87 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          tg16 |          5.41 ± 0.28 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          pp64 |         16.60 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          tg16 |          6.33 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          pp64 |         12.26 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          tg16 |          5.27 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          pp64 |         13.05 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          tg16 |          5.78 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          pp64 |         14.12 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          tg16 |          6.16 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          pp64 |         15.60 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          tg16 |          6.36 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          pp64 |         16.41 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          tg16 |          6.45 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          pp64 |         18.64 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          tg16 |          6.37 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          pp64 |         13.52 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          tg16 |          4.23 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          pp64 |         13.98 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          tg16 |          4.43 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          pp64 |         14.39 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          tg16 |          4.55 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          pp64 |         14.93 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          tg16 |          4.59 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          pp64 |         15.30 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          tg16 |          4.58 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          pp64 |         15.25 ± 0.16 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          tg16 |          4.56 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          pp64 |         14.82 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          tg16 |          4.58 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          pp64 |         15.24 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          tg16 |          4.59 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          pp64 |         15.67 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          tg16 |          4.58 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          pp64 |         16.27 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          tg16 |          4.56 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          pp64 |         16.74 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          tg16 |          4.50 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          pp64 |         16.95 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          tg16 |          4.41 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          pp64 |         14.97 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          tg16 |          4.34 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          pp64 |         15.24 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          tg16 |          4.33 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          pp64 |         15.59 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          tg16 |          4.33 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          pp64 |         15.93 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          tg16 |          4.31 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          pp64 |         16.03 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          tg16 |          4.29 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          pp64 |         16.19 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          tg16 |          4.25 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          pp64 |         15.68 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          tg16 |          4.26 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          pp64 |         15.89 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          tg16 |          4.21 ± 0.01 |
    
    build: d0d5b223 (5062)
    
    Thanked by 1loay
  • NeoonNeoon Community Contributor, Veteran

    @loay said:

    @Neoon said:

    @loay said:
    llama-bench on AMD EPYC 7351P for Gemma3 12b qat version here

    Xeon G

    bench@rbx:~/build/bin$ ./llama-bench -m ../../.cache/llama.cpp/stduhpf_google-gemma-3-12b-it-qat-q4_0-gguf-small_gemma-3-12b-it-q4_0_s.gguf -n 0 -n 16 -p 64 -t 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32
    | model                          |       size |     params | backend    | ngl | threads |          test |                  t/s |
    | ------------------------------ | ---------: | ---------: | ---------- | --: | ------: | ------------: | -------------------: |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          pp64 |          4.35 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          tg16 |          1.77 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          pp64 |          8.56 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          tg16 |          3.36 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          pp64 |         12.53 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          tg16 |          4.07 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          pp64 |         16.21 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          tg16 |          4.34 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          pp64 |         19.09 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          tg16 |          4.31 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          pp64 |         21.73 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          tg16 |          4.45 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          pp64 |         23.41 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          tg16 |          4.45 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          pp64 |         24.85 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          tg16 |          4.48 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          pp64 |         17.53 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          tg16 |          4.32 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          pp64 |         19.03 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          tg16 |          4.47 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          pp64 |         20.63 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          tg16 |          4.53 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          pp64 |         22.04 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          tg16 |          4.54 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          pp64 |         23.65 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          tg16 |          4.53 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          pp64 |         25.06 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          tg16 |          4.52 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          pp64 |         26.42 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          tg16 |          4.52 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          pp64 |         28.15 ± 0.16 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          tg16 |          4.49 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          pp64 |         20.48 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          tg16 |          3.47 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          pp64 |         20.81 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          tg16 |          3.52 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          pp64 |         21.38 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          tg16 |          3.55 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          pp64 |         21.60 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          tg16 |          3.60 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          pp64 |         21.96 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          tg16 |          3.68 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          pp64 |         22.37 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          tg16 |          3.70 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          pp64 |         22.55 ± 0.11 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          tg16 |          3.66 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          pp64 |         22.65 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          tg16 |          3.71 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          pp64 |         22.53 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          tg16 |          3.72 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          pp64 |         22.82 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          tg16 |          3.72 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          pp64 |         23.10 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          tg16 |          3.71 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          pp64 |         23.29 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          tg16 |          3.69 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          pp64 |         23.56 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          tg16 |          3.65 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          pp64 |         23.64 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          tg16 |          3.61 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          pp64 |         23.69 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          tg16 |          3.55 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          pp64 |         23.74 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          tg16 |          3.51 ± 0.01 |
    
    build: d0d5b223 (5062)
    

    Thanks for running this! I’ve been curious how they would perform on such bench, especially when compared to the differences in single-core performance

    No problem.

  • @Neoon said:

    @loay said:
    llama-bench on AMD EPYC 7351P for Gemma3 12b qat version here

    KS-LE-B

    bench@gra:~/build/bin$ ./llama-bench -m ../../.cache/llama.cpp/stduhpf_google-gemma-3-12b-it-qat-q4_0-gguf-small_gemma-3-12b-it-q4_0_s.gguf -n 0 -n 16 -p 64 -t 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32
    | model                          |       size |     params | backend    | ngl | threads |          test |                  t/s |
    | ------------------------------ | ---------: | ---------: | ---------- | --: | ------: | ------------: | -------------------: |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          pp64 |          3.54 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          tg16 |          1.52 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          pp64 |          7.06 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          tg16 |          2.89 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          pp64 |         10.12 ± 0.29 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          tg16 |          3.57 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          pp64 |         12.91 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          tg16 |          3.81 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          pp64 |         10.07 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          tg16 |          3.63 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          pp64 |         11.99 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          tg16 |          3.85 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          pp64 |         13.85 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          tg16 |          3.89 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          pp64 |         13.56 ± 0.41 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          tg16 |          3.25 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          pp64 |         12.48 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          tg16 |          3.02 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          pp64 |         12.75 ± 0.10 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          tg16 |          3.17 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          pp64 |         13.35 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          tg16 |          3.23 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          pp64 |         13.88 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          tg16 |          3.26 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          pp64 |         13.62 ± 0.19 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          tg16 |          3.26 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          pp64 |         14.06 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          tg16 |          3.28 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          pp64 |         14.10 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          tg16 |          3.21 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          pp64 |         14.40 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          tg16 |          3.15 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          pp64 |         13.60 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          tg16 |          2.98 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          pp64 |         13.76 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          tg16 |          3.00 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          pp64 |         13.92 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          tg16 |          3.00 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          pp64 |         14.03 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          tg16 |          2.99 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          pp64 |         13.90 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          tg16 |          2.95 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          pp64 |         13.92 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          tg16 |          2.91 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          pp64 |         14.02 ± 0.13 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          tg16 |          2.87 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          pp64 |         14.03 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          tg16 |          2.81 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          pp64 |         13.68 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          tg16 |          2.74 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          pp64 |         13.78 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          tg16 |          2.73 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          pp64 |         13.76 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          tg16 |          2.71 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          pp64 |         13.68 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          tg16 |          2.67 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          pp64 |         13.70 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          tg16 |          2.61 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          pp64 |         13.82 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          tg16 |          2.64 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          pp64 |         13.92 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          tg16 |          2.62 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          pp64 |         13.99 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          tg16 |          2.56 ± 0.01 |
    
    build: d0d5b223 (5062)
    

    what config do you have on your KS LE-B?

  • NeoonNeoon Community Contributor, Veteran

    @strictlyparmesan said:

    @Neoon said:

    @loay said:
    llama-bench on AMD EPYC 7351P for Gemma3 12b qat version here

    KS-LE-B

    bench@gra:~/build/bin$ ./llama-bench -m ../../.cache/llama.cpp/stduhpf_google-gemma-3-12b-it-qat-q4_0-gguf-small_gemma-3-12b-it-q4_0_s.gguf -n 0 -n 16 -p 64 -t 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32
    | model                          |       size |     params | backend    | ngl | threads |          test |                  t/s |
    | ------------------------------ | ---------: | ---------: | ---------- | --: | ------: | ------------: | -------------------: |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          pp64 |          3.54 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          tg16 |          1.52 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          pp64 |          7.06 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          tg16 |          2.89 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          pp64 |         10.12 ± 0.29 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          tg16 |          3.57 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          pp64 |         12.91 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          tg16 |          3.81 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          pp64 |         10.07 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          tg16 |          3.63 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          pp64 |         11.99 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          tg16 |          3.85 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          pp64 |         13.85 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          tg16 |          3.89 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          pp64 |         13.56 ± 0.41 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          tg16 |          3.25 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          pp64 |         12.48 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          tg16 |          3.02 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          pp64 |         12.75 ± 0.10 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          tg16 |          3.17 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          pp64 |         13.35 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          tg16 |          3.23 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          pp64 |         13.88 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          tg16 |          3.26 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          pp64 |         13.62 ± 0.19 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          tg16 |          3.26 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          pp64 |         14.06 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          tg16 |          3.28 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          pp64 |         14.10 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          tg16 |          3.21 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          pp64 |         14.40 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          tg16 |          3.15 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          pp64 |         13.60 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          tg16 |          2.98 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          pp64 |         13.76 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          tg16 |          3.00 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          pp64 |         13.92 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          tg16 |          3.00 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          pp64 |         14.03 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          tg16 |          2.99 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          pp64 |         13.90 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          tg16 |          2.95 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          pp64 |         13.92 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          tg16 |          2.91 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          pp64 |         14.02 ± 0.13 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          tg16 |          2.87 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          pp64 |         14.03 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          tg16 |          2.81 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          pp64 |         13.68 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          tg16 |          2.74 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          pp64 |         13.78 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          tg16 |          2.73 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          pp64 |         13.76 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          tg16 |          2.71 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          pp64 |         13.68 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          tg16 |          2.67 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          pp64 |         13.70 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          tg16 |          2.61 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          pp64 |         13.82 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          tg16 |          2.64 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          pp64 |         13.92 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          tg16 |          2.62 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          pp64 |         13.99 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          tg16 |          2.56 ± 0.01 |
    
    build: d0d5b223 (5062)
    

    what config do you have on your KS LE-B?

    I benched on the E3-1270 v6 @ 3.80GHz with 64gigs

    Thanked by 1strictlyparmesan
  • @Neoon said:

    @strictlyparmesan said:

    @Neoon said:

    @loay said:
    llama-bench on AMD EPYC 7351P for Gemma3 12b qat version here

    KS-LE-B

    bench@gra:~/build/bin$ ./llama-bench -m ../../.cache/llama.cpp/stduhpf_google-gemma-3-12b-it-qat-q4_0-gguf-small_gemma-3-12b-it-q4_0_s.gguf -n 0 -n 16 -p 64 -t 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32
    | model                          |       size |     params | backend    | ngl | threads |          test |                  t/s |
    | ------------------------------ | ---------: | ---------: | ---------- | --: | ------: | ------------: | -------------------: |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          pp64 |          3.54 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          tg16 |          1.52 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          pp64 |          7.06 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          tg16 |          2.89 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          pp64 |         10.12 ± 0.29 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          tg16 |          3.57 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          pp64 |         12.91 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          tg16 |          3.81 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          pp64 |         10.07 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          tg16 |          3.63 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          pp64 |         11.99 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          tg16 |          3.85 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          pp64 |         13.85 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          tg16 |          3.89 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          pp64 |         13.56 ± 0.41 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          tg16 |          3.25 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          pp64 |         12.48 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          tg16 |          3.02 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          pp64 |         12.75 ± 0.10 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          tg16 |          3.17 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          pp64 |         13.35 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          tg16 |          3.23 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          pp64 |         13.88 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          tg16 |          3.26 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          pp64 |         13.62 ± 0.19 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          tg16 |          3.26 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          pp64 |         14.06 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          tg16 |          3.28 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          pp64 |         14.10 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          tg16 |          3.21 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          pp64 |         14.40 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          tg16 |          3.15 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          pp64 |         13.60 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          tg16 |          2.98 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          pp64 |         13.76 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          tg16 |          3.00 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          pp64 |         13.92 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          tg16 |          3.00 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          pp64 |         14.03 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          tg16 |          2.99 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          pp64 |         13.90 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          tg16 |          2.95 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          pp64 |         13.92 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          tg16 |          2.91 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          pp64 |         14.02 ± 0.13 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          tg16 |          2.87 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          pp64 |         14.03 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          tg16 |          2.81 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          pp64 |         13.68 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          tg16 |          2.74 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          pp64 |         13.78 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          tg16 |          2.73 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          pp64 |         13.76 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          tg16 |          2.71 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          pp64 |         13.68 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          tg16 |          2.67 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          pp64 |         13.70 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          tg16 |          2.61 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          pp64 |         13.82 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          tg16 |          2.64 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          pp64 |         13.92 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          tg16 |          2.62 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          pp64 |         13.99 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          tg16 |          2.56 ± 0.01 |
    
    build: d0d5b223 (5062)
    

    what config do you have on your KS LE-B?

    I benched on the E3-1270 v6 @ 3.80GHz with 64gigs

    This config will struggle with LLMs right?

    Intel Xeon E3-1245v5 - 4c/8t - 3.5 GHz/3.9 GHz - 32GB

  • NeoonNeoon Community Contributor, Veteran

    @strictlyparmesan said:

    @Neoon said:

    @strictlyparmesan said:

    @Neoon said:

    @loay said:
    llama-bench on AMD EPYC 7351P for Gemma3 12b qat version here

    KS-LE-B

    bench@gra:~/build/bin$ ./llama-bench -m ../../.cache/llama.cpp/stduhpf_google-gemma-3-12b-it-qat-q4_0-gguf-small_gemma-3-12b-it-q4_0_s.gguf -n 0 -n 16 -p 64 -t 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32
    | model                          |       size |     params | backend    | ngl | threads |          test |                  t/s |
    | ------------------------------ | ---------: | ---------: | ---------- | --: | ------: | ------------: | -------------------: |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          pp64 |          3.54 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       1 |          tg16 |          1.52 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          pp64 |          7.06 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       2 |          tg16 |          2.89 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          pp64 |         10.12 ± 0.29 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       3 |          tg16 |          3.57 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          pp64 |         12.91 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       4 |          tg16 |          3.81 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          pp64 |         10.07 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       5 |          tg16 |          3.63 ± 0.00 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          pp64 |         11.99 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       6 |          tg16 |          3.85 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          pp64 |         13.85 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       7 |          tg16 |          3.89 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          pp64 |         13.56 ± 0.41 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       8 |          tg16 |          3.25 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          pp64 |         12.48 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |       9 |          tg16 |          3.02 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          pp64 |         12.75 ± 0.10 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      10 |          tg16 |          3.17 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          pp64 |         13.35 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      11 |          tg16 |          3.23 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          pp64 |         13.88 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      12 |          tg16 |          3.26 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          pp64 |         13.62 ± 0.19 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      13 |          tg16 |          3.26 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          pp64 |         14.06 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      14 |          tg16 |          3.28 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          pp64 |         14.10 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      15 |          tg16 |          3.21 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          pp64 |         14.40 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      16 |          tg16 |          3.15 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          pp64 |         13.60 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      17 |          tg16 |          2.98 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          pp64 |         13.76 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      18 |          tg16 |          3.00 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          pp64 |         13.92 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      19 |          tg16 |          3.00 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          pp64 |         14.03 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      20 |          tg16 |          2.99 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          pp64 |         13.90 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      21 |          tg16 |          2.95 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          pp64 |         13.92 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      22 |          tg16 |          2.91 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          pp64 |         14.02 ± 0.13 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      23 |          tg16 |          2.87 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          pp64 |         14.03 ± 0.09 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      24 |          tg16 |          2.81 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          pp64 |         13.68 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      25 |          tg16 |          2.74 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          pp64 |         13.78 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      26 |          tg16 |          2.73 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          pp64 |         13.76 ± 0.08 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      27 |          tg16 |          2.71 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          pp64 |         13.68 ± 0.06 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      28 |          tg16 |          2.67 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          pp64 |         13.70 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      29 |          tg16 |          2.61 ± 0.03 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          pp64 |         13.82 ± 0.04 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      30 |          tg16 |          2.64 ± 0.01 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          pp64 |         13.92 ± 0.05 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      31 |          tg16 |          2.62 ± 0.02 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          pp64 |         13.99 ± 0.07 |
    | gemma3 12B Q4_0                |   6.41 GiB |    11.77 B | RPC        |  99 |      32 |          tg16 |          2.56 ± 0.01 |
    
    build: d0d5b223 (5062)
    

    what config do you have on your KS LE-B?

    I benched on the E3-1270 v6 @ 3.80GHz with 64gigs

    This config will struggle with LLMs right?

    Intel Xeon E3-1245v5 - 4c/8t - 3.5 GHz/3.9 GHz - 32GB

    No idea, guess depends on the model and the size.
    Quantization if any, I tried Qwen 2.5 32b on the Xeon G with 4 bit quantization, it does struggle quite a bit.

    Thanked by 1strictlyparmesan
  • This config will struggle with LLMs right?

    Intel Xeon E3-1245v5 - 4c/8t - 3.5 GHz/3.9 GHz - 32GB

    I ran a few models on this setup with Ollama including Deepseek V3 and R1, Llama 3.3, phi 3, Gemma 3. They worked but were SLOOOOOOWWWW.

    Thanked by 1strictlyparmesan
  • jndjnd Member

    @barbarza said:

    This config will struggle with LLMs right?

    Intel Xeon E3-1245v5 - 4c/8t - 3.5 GHz/3.9 GHz - 32GB

    I ran a few models on this setup with Ollama including Deepseek V3 and R1, Llama 3.3, phi 3, Gemma 3. They worked but were SLOOOOOOWWWW.

    Any CPU only LLM will be really slow. You need fast processing and even faster memory. Only the tiniest models output fast enough but at that point it's waste of dedicated machine and you're better off remting DDR5 Ryzen or Epyc VPS with just couple gigs of RAM.

    Thanked by 1barbarza
  • @jnd said:

    @barbarza said:

    This config will struggle with LLMs right?

    Intel Xeon E3-1245v5 - 4c/8t - 3.5 GHz/3.9 GHz - 32GB

    I ran a few models on this setup with Ollama including Deepseek V3 and R1, Llama 3.3, phi 3, Gemma 3. They worked but were SLOOOOOOWWWW.

    Any CPU only LLM will be really slow. You need fast processing and even faster memory. Only the tiniest models output fast enough but at that point it's waste of dedicated machine and you're better off remting DDR5 Ryzen or Epyc VPS with just couple gigs of RAM.

    Absolutely. Was just messing around with a spare machine I had to figure stuff out.

    Thanked by 1jnd
  • adnsadns Member

    I asked refund from OVH and the support asked me to cancel the service by hand and reply to their message.

    Is this a new policy? I knew that OVH cancel the server too in case of refund request.

  • @adns said:
    I asked refund from OVH and the support asked me to cancel the service by hand and reply to their message.

    Is this a new policy? I knew that OVH cancel the server too in case of refund request.

    I asked for a refund cos one of my KS-MYSTERYs got delivered with 32GB Ram. They offered me a transfer of the time remaining on the problem server to one of my other KS-MYSTERYS. I had to manually cancel it myself.

  • @adns said:
    I asked refund from OVH and the support asked me to cancel the service by hand and reply to their message.

    Is this a new policy? I knew that OVH cancel the server too in case of refund request.

    I've had both happen, either I have to cancel it myself or they cancel it for me. Seems a bit inconsistent but it's not a big deal either way imo

    Thanked by 1adns
  • @mrinternational said:
    Did anyone got Xeon E-2288G, 128GB in GRA? Seems that the best upgrade in GRA was Xeon with 64GB or am I wrong?

    Not sure if is in GRA, but some people got E-2288G + 128G + 2x1.92T NVME

  • # ## ## ## ## ## ## ## ## ## ## ## ## ## ## ## ## ## #
    #              Yet-Another-Bench-Script              #
    #                     v2025-01-01                    #
    # https://github.com/masonr/yet-another-bench-script #
    # ## ## ## ## ## ## ## ## ## ## ## ## ## ## ## ## ## #
    
    Mon Apr  7 18:43:52 UTC 2025
    
    Basic System Information:
    ---------------------------------
    Uptime     : 0 days, 0 hours, 10 minutes
    Processor  : Intel(R) Xeon(R) E-2288G CPU @ 3.70GHz
    CPU cores  : 16 @ 800.000 MHz
    AES-NI     : ✔ Enabled
    VM-x/AMD-V : ✔ Enabled
    RAM        : 62.5 GiB
    Swap       : 1024.0 MiB
    Disk       : 3.4 TiB
    Distro     : Debian GNU/Linux 12 (bookworm)
    Kernel     : 6.1.0-32-amd64
    VM Type    : NONE
    IPv4/IPv6  : ✔ Online / ✔ Online
    
    IPv6 Network Information:
    ---------------------------------
    ISP        : OVH SAS
    ASN        : AS16276 OVH SAS
    Host       : OVH
    Location   : Roubaix, Hauts-de-France (HDF)
    Country    : France
    
    fio Disk Speed Tests (Mixed R/W 50/50) (Partition /dev/md3):
    ---------------------------------
    Block Size | 4k            (IOPS) | 64k           (IOPS)
      ------   | ---            ----  | ----           ----
    Read       | 1.17 GB/s   (293.6k) | 1.78 GB/s    (27.8k)
    Write      | 1.17 GB/s   (294.4k) | 1.78 GB/s    (27.9k)
    Total      | 2.35 GB/s   (588.0k) | 3.56 GB/s    (55.7k)
               |                      |
    Block Size | 512k          (IOPS) | 1m            (IOPS)
      ------   | ---            ----  | ----           ----
    Read       | 1.80 GB/s     (3.5k) | 1.81 GB/s     (1.7k)
    Write      | 1.90 GB/s     (3.7k) | 1.93 GB/s     (1.8k)
    Total      | 3.71 GB/s     (7.2k) | 3.74 GB/s     (3.6k)
    
    iperf3 Network Speed Tests (IPv4):
    ---------------------------------
    Provider        | Location (Link)           | Send Speed      | Recv Speed      | Ping
    -----           | -----                     | ----            | ----            | ----
    Clouvider       | London, UK (10G)          | 971 Mbits/sec   | 8.54 Gbits/sec  | 4.99 ms
    Eranium         | Amsterdam, NL (100G)      | 970 Mbits/sec   | 8.95 Gbits/sec  | 6.25 ms
    Uztelecom       | Tashkent, UZ (10G)        | 900 Mbits/sec   | 1.85 Gbits/sec  | 104 ms
    Leaseweb        | Singapore, SG (10G)       | 796 Mbits/sec   | 1.09 Gbits/sec  | --
    Clouvider       | Los Angeles, CA, US (10G) | 839 Mbits/sec   | 1.24 Gbits/sec  | 141 ms
    Leaseweb        | NYC, NY, US (10G)         | 904 Mbits/sec   | 2.42 Gbits/sec  | 76.4 ms
    Edgoo           | Sao Paulo, BR (1G)        | 735 Mbits/sec   | 924 Mbits/sec   | 188 ms
    
    iperf3 Network Speed Tests (IPv6):
    ---------------------------------
    Provider        | Location (Link)           | Send Speed      | Recv Speed      | Ping
    -----           | -----                     | ----            | ----            | ----
    Clouvider       | London, UK (10G)          | 957 Mbits/sec   | 8.22 Gbits/sec  | 4.94 ms
    Eranium         | Amsterdam, NL (100G)      | 955 Mbits/sec   | 8.76 Gbits/sec  | 5.95 ms
    Uztelecom       | Tashkent, UZ (10G)        | 891 Mbits/sec   | 1.86 Gbits/sec  | 104 ms
    Leaseweb        | Singapore, SG (10G)       | 816 Mbits/sec   | 1.08 Gbits/sec  | 162 ms
    Clouvider       | Los Angeles, CA, US (10G) | 828 Mbits/sec   | 1.24 Gbits/sec  | 141 ms
    Leaseweb        | NYC, NY, US (10G)         | 900 Mbits/sec   | 2.47 Gbits/sec  | 76.6 ms
    Edgoo           | Sao Paulo, BR (1G)        | 789 Mbits/sec   | 940 Mbits/sec   | 187 ms
    
    Geekbench 6 Benchmark Test:
    ---------------------------------
    Test            | Value
                    |
    Single Core     | 1827
    Multi Core      | 8358
    Full Test       | https://browser.geekbench.com/v6/cpu/11414055
    
    Thanked by 1maverick
  • I'm giving away my KS-C server in GRA2. It's paid until May 1st, but has a commitment ending on July 1st, 2025. Feel free to DM me if you're interested. EU account.

  • CeeCee Member

    @wuck said:

    Sun Apr  6 10:48:29 UTC 2025
    
    Basic System Information:
    ---------------------------------
    Uptime     : 3 days, 2 hours, 24 minutes
    Processor  : AMD EPYC 7351P 16-Core Processor
    CPU cores  : 32 @ 1192.365 MHz
    AES-NI     : ✔ Enabled
    VM-x/AMD-V : ✔ Enabled
    RAM        : 125.8 GiB
    Swap       : 1024.0 MiB
    Disk       : 937.2 GiB
    Distro     : Debian GNU/Linux 11 (bullseye)
    Kernel     : 5.10.0-34-amd64
    VM Type    : NONE
    IPv4/IPv6  : ✔ Online / ✔ Online
    
    IPv6 Network Information:
    ---------------------------------
    ISP        : OVH SAS
    ASN        : AS16276 OVH SAS
    Host       : OVH
    Location   : Gravelines, Hauts-de-France (HDF)
    Country    : France
    
    fio Disk Speed Tests (Mixed R/W 50/50) (Partition /dev/md3):
    ---------------------------------
    Block Size | 4k            (IOPS) | 64k           (IOPS)
      ------   | ---            ----  | ----           ---- 
    Read       | 274.49 MB/s  (68.6k) | 1.52 GB/s    (23.8k)
    Write      | 275.21 MB/s  (68.8k) | 1.53 GB/s    (23.9k)
    Total      | 549.71 MB/s (137.4k) | 3.05 GB/s    (47.7k)
               |                      |                     
    Block Size | 512k          (IOPS) | 1m            (IOPS)
      ------   | ---            ----  | ----           ---- 
    Read       | 1.66 GB/s     (3.2k) | 1.77 GB/s     (1.7k)
    Write      | 1.75 GB/s     (3.4k) | 1.89 GB/s     (1.8k)
    Total      | 3.41 GB/s     (6.6k) | 3.66 GB/s     (3.5k)
    
    Geekbench 4 Benchmark Test:
    ---------------------------------
    Test            | Value                         
                    |                               
    Single Core     | 3837                          
    Multi Core      | 45403                         
    Full Test       | https://browser.geekbench.com/v4/cpu/18646644
    
    Geekbench 5 Benchmark Test:
    ---------------------------------
    Test            | Value                         
                    |                               
    Single Core     | 829                           
    Multi Core      | 12267                         
    Full Test       | https://browser.geekbench.com/v5/cpu/23454268
    
    YABS completed in 6 min 54 sec
    

    Living is best idling life atm
    Rise-S in comparison

    Basic System Information:
    ---------------------------------
    Uptime     : 52 days, 12 hours, 0 minutes
    Processor  : AMD Ryzen 7 9700X 8-Core Processor
    CPU cores  : 16 @ 5526.574 MHz
    AES-NI     : ✔ Enabled
    VM-x/AMD-V : ✔ Enabled
    RAM        : 62.4 GiB
    Swap       : 0.0 KiB
    Disk       : 468.2 GiB
    Distro     : Ubuntu 24.04.2 LTS
    Kernel     : 6.8.0-53-generic
    VM Type    : NONE
    IPv4/IPv6  : ✔ Online / ✔ Online
    
    IPv6 Network Information:
    ---------------------------------
    ISP        : OVH SAS
    ASN        : AS16276 OVH SAS
    Host       : OVH
    Location   : Roubaix, Hauts-de-France (HDF)
    Country    : France
    
    fio Disk Speed Tests (Mixed R/W 50/50) (Partition /dev/md3):
    ---------------------------------
    Block Size | 4k            (IOPS) | 64k           (IOPS)
      ------   | ---            ----  | ----           ---- 
    Read       | 1.44 GB/s   (360.8k) | 2.07 GB/s    (32.4k)
    Write      | 1.44 GB/s   (361.8k) | 2.09 GB/s    (32.6k)
    Total      | 2.89 GB/s   (722.6k) | 4.16 GB/s    (65.1k)
               |                      |                     
    Block Size | 512k          (IOPS) | 1m            (IOPS)
      ------   | ---            ----  | ----           ---- 
    Read       | 2.65 GB/s     (5.1k) | 2.80 GB/s     (2.7k)
    Write      | 2.79 GB/s     (5.4k) | 2.98 GB/s     (2.9k)
    Total      | 5.44 GB/s    (10.6k) | 5.79 GB/s     (5.6k)
    
    Geekbench 4 Benchmark Test:
    ---------------------------------
    Test            | Value                         
                    |                               
    Single Core     | 10564                         
    Multi Core      | 65813                         
    Full Test       | https://browser.geekbench.com/v4/cpu/18646648
    
    Geekbench 5 Benchmark Test:
    ---------------------------------
    Test            | Value                         
                    |                               
    Single Core     | 2631                          
    Multi Core      | 14817                         
    Full Test       | https://browser.geekbench.com/v5/cpu/23454291
    
    YABS completed in 4 min 23 sec
    

    Considering transferring it?

  • wuckwuck Member

    @Cee said:

    @wuck said:

    Sun Apr  6 10:48:29 UTC 2025
    
    Basic System Information:
    ---------------------------------
    Uptime     : 3 days, 2 hours, 24 minutes
    Processor  : AMD EPYC 7351P 16-Core Processor
    CPU cores  : 32 @ 1192.365 MHz
    AES-NI     : ✔ Enabled
    VM-x/AMD-V : ✔ Enabled
    RAM        : 125.8 GiB
    Swap       : 1024.0 MiB
    Disk       : 937.2 GiB
    Distro     : Debian GNU/Linux 11 (bullseye)
    Kernel     : 5.10.0-34-amd64
    VM Type    : NONE
    IPv4/IPv6  : ✔ Online / ✔ Online
    
    IPv6 Network Information:
    ---------------------------------
    ISP        : OVH SAS
    ASN        : AS16276 OVH SAS
    Host       : OVH
    Location   : Gravelines, Hauts-de-France (HDF)
    Country    : France
    
    fio Disk Speed Tests (Mixed R/W 50/50) (Partition /dev/md3):
    ---------------------------------
    Block Size | 4k            (IOPS) | 64k           (IOPS)
      ------   | ---            ----  | ----           ---- 
    Read       | 274.49 MB/s  (68.6k) | 1.52 GB/s    (23.8k)
    Write      | 275.21 MB/s  (68.8k) | 1.53 GB/s    (23.9k)
    Total      | 549.71 MB/s (137.4k) | 3.05 GB/s    (47.7k)
               |                      |                     
    Block Size | 512k          (IOPS) | 1m            (IOPS)
      ------   | ---            ----  | ----           ---- 
    Read       | 1.66 GB/s     (3.2k) | 1.77 GB/s     (1.7k)
    Write      | 1.75 GB/s     (3.4k) | 1.89 GB/s     (1.8k)
    Total      | 3.41 GB/s     (6.6k) | 3.66 GB/s     (3.5k)
    
    Geekbench 4 Benchmark Test:
    ---------------------------------
    Test            | Value                         
                    |                               
    Single Core     | 3837                          
    Multi Core      | 45403                         
    Full Test       | https://browser.geekbench.com/v4/cpu/18646644
    
    Geekbench 5 Benchmark Test:
    ---------------------------------
    Test            | Value                         
                    |                               
    Single Core     | 829                           
    Multi Core      | 12267                         
    Full Test       | https://browser.geekbench.com/v5/cpu/23454268
    
    YABS completed in 6 min 54 sec
    

    Living is best idling life atm
    Rise-S in comparison

    Basic System Information:
    ---------------------------------
    Uptime     : 52 days, 12 hours, 0 minutes
    Processor  : AMD Ryzen 7 9700X 8-Core Processor
    CPU cores  : 16 @ 5526.574 MHz
    AES-NI     : ✔ Enabled
    VM-x/AMD-V : ✔ Enabled
    RAM        : 62.4 GiB
    Swap       : 0.0 KiB
    Disk       : 468.2 GiB
    Distro     : Ubuntu 24.04.2 LTS
    Kernel     : 6.8.0-53-generic
    VM Type    : NONE
    IPv4/IPv6  : ✔ Online / ✔ Online
    
    IPv6 Network Information:
    ---------------------------------
    ISP        : OVH SAS
    ASN        : AS16276 OVH SAS
    Host       : OVH
    Location   : Roubaix, Hauts-de-France (HDF)
    Country    : France
    
    fio Disk Speed Tests (Mixed R/W 50/50) (Partition /dev/md3):
    ---------------------------------
    Block Size | 4k            (IOPS) | 64k           (IOPS)
      ------   | ---            ----  | ----           ---- 
    Read       | 1.44 GB/s   (360.8k) | 2.07 GB/s    (32.4k)
    Write      | 1.44 GB/s   (361.8k) | 2.09 GB/s    (32.6k)
    Total      | 2.89 GB/s   (722.6k) | 4.16 GB/s    (65.1k)
               |                      |                     
    Block Size | 512k          (IOPS) | 1m            (IOPS)
      ------   | ---            ----  | ----           ---- 
    Read       | 2.65 GB/s     (5.1k) | 2.80 GB/s     (2.7k)
    Write      | 2.79 GB/s     (5.4k) | 2.98 GB/s     (2.9k)
    Total      | 5.44 GB/s    (10.6k) | 5.79 GB/s     (5.6k)
    
    Geekbench 4 Benchmark Test:
    ---------------------------------
    Test            | Value                         
                    |                               
    Single Core     | 10564                         
    Multi Core      | 65813                         
    Full Test       | https://browser.geekbench.com/v4/cpu/18646648
    
    Geekbench 5 Benchmark Test:
    ---------------------------------
    Test            | Value                         
                    |                               
    Single Core     | 2631                          
    Multi Core      | 14817                         
    Full Test       | https://browser.geekbench.com/v5/cpu/23454291
    
    YABS completed in 4 min 23 sec
    

    Considering transferring it?

    Already done traded it for a 2288G

  • Looking for some servers $$$

    • E-2288G 64GB RAM 2x1.92 TB NVMe
    • Epyc 7351p 128GB RAM 2x1TB NVMe
  • NeoonNeoon Community Contributor, Veteran
    edited April 2025

    @jnd said:

    @barbarza said:

    This config will struggle with LLMs right?

    Intel Xeon E3-1245v5 - 4c/8t - 3.5 GHz/3.9 GHz - 32GB

    I ran a few models on this setup with Ollama including Deepseek V3 and R1, Llama 3.3, phi 3, Gemma 3. They worked but were SLOOOOOOWWWW.

    Any CPU only LLM will be really slow. You need fast processing and even faster memory. Only the tiniest models output fast enough but at that point it's waste of dedicated machine and you're better off remting DDR5 Ryzen or Epyc VPS with just couple gigs of RAM.

    I mean the model he used for the benchmark:
    https://huggingface.co/google/gemma-3-12b-it-qat-q4_0-gguf

    Is actually usable, up to 25t/s, not bad for 12b, on a 5 year old CPU.
    A couple of gig's won't cut it, you likely need at least 16 or 32gig, to test any model that would be to slow to be run on these dedi's.

    Not sure at this point, if a VDS Ryzen with 32gig would be cheaper than this Dedi and then the question would be, how fast it would be.

    I should try the unsloth models on the MYSTERY boxes.

    Thanked by 2admax loay
Sign In or Register to comment.