can not reproduce $6000 #12103
taishan1994
started this conversation in
General
Replies: 3 comments 2 replies
-
Are you running llama.cpp on a virtual machine by any chance? I noticed you were using llama.cpp as root and with some tabs at the top of the image. Virtualization could be the reason why the TPS is lower. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Also is this 6000? https://openi.pcl.ac.cn/6000/aiforge |
Beta Was this translation helpful? Give feedback.
2 replies
-
I have aligned it. I need to set NUMA to NPS0 and set the memory to be interactive. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Here's my machine configuration
AMD双路服务器:
CPU: 2 X AMD EPYC 9115/2.60GHz/64M/16C/32T/165W;
内存: 24 X 32GB/DDR5/5600MHz/REG(消费级),总共768G;
SSD: 1X SSD/3.84T/SATA 6Gb/2.5寸/读取型;
硬盘: 1 X 4TB/SATA/7200RPM/3.5寸/企业级;
阵列卡: 1 X LSI 9560-8I 4G/支持RAID 0,1,5,6,10,50,60;
网卡: 1 X 双口/千兆电口/RJ45接口/I350-T2;
板载2个10G万兆电口 Broadcom® BCM57416;
电源:1200W冗余电源1+1;
When I tested with a sample used by $6000, the TPS was only 3.0, while $6000 reported 5.4.
Beta Was this translation helpful? Give feedback.
All reactions