Replies: 1 comment 1 reply
-
Great results @spyd3rweb! Do you have logs from any worker? I'm curious about the transfer speed. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Provider: homelab (k8s v1.24.14)
Pod/VolcanoJob resource limits: cpu: [7, 8], memory: [8Gi, 16Gi]
Average Single Token Generation Time
Llama 7B Q40 Weights Q80 Buffer
4x Pod (1x root: 8t/16GB, 3x worker 7t/16GB) k8s-pod-network (40Gb/s)
pods
logs
top nodes
8x Pod (1x root: 8t/8GB, 7x worker 7t/8GB) k8s-pod-network (40Gb/s)
pods
logs
top nodes
CPU Info
CPU: Intel Xeon E3-1246 v3
Beta Was this translation helpful? Give feedback.
All reactions