{"id":2396,"date":"2026-03-02T16:12:44","date_gmt":"2026-03-02T08:12:44","guid":{"rendered":"https:\/\/www.starverse-ai.com\/guide\/archives\/2396"},"modified":"2026-03-02T16:12:44","modified_gmt":"2026-03-02T08:12:44","slug":"%e5%86%85%e5%ad%98%e5%a2%99%e9%9d%a9%e5%91%bd%e6%9d%a5%e8%a2%ad%ef%bc%81%e6%98%9f%e5%ae%87%e6%99%ba%e7%ae%97%e5%b9%b3%e5%8f%b0%e8%b0%88%e3%80%8c%e6%b7%b7%e5%90%88%e7%b2%be%e5%ba%a6%e5%a4%a7%e6%98%be","status":"publish","type":"post","link":"https:\/\/www.starverse-ai.com\/guide\/archives\/2396","title":{"rendered":"\u5185\u5b58\u5899\u9769\u547d\u6765\u88ad\uff01\u661f\u5b87\u667a\u7b97\u5e73\u53f0\u8c08\u300c\u6df7\u5408\u7cbe\u5ea6+\u5927\u663e\u5b58\u300d\u5982\u4f55\u7834\u89e3\u5927\u6a21\u578b\u74f6\u9888"},"content":{"rendered":"<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/www.starverse-ai.com\/guide\/wp-content\/uploads\/2026\/03\/1772439163_b3713f.png\" alt=\"\u5185\u5b58\u5899\u9769\u547d\u6765\u88ad\uff01\u661f\u5b87\u667a\u7b97\u5e73\u53f0\u8c08\u300c\u6df7\u5408\u7cbe\u5ea6+\u5927\u663e\u5b58\u300d\u5982\u4f55\u7834\u89e3\u5927\u6a21\u578b\u74f6\u9888\" style=\"display:block; margin:10px auto; max-width:100%; height:auto;\" \/><\/figure>\n<h2>\u5185\u5b58\u5899\u9769\u547d\u6765\u88ad\uff01\u661f\u5b87\u667a\u7b97\u5e73\u53f0\u8c08\u300c\u6df7\u5408\u7cbe\u5ea6+\u5927\u663e\u5b58\u300d\u5982\u4f55\u7834\u89e3\u5927\u6a21\u578b\u74f6\u9888<\/h2>\n<blockquote>\n<p>\u201c\u5f53\u6f9c\u8d77\u79d1\u6280\u7684 PCIe 6.0 \u4e92\u8fde\u82af\u7247\u5728\u8d44\u672c\u5e02\u573a\u4e00\u5468\u4e09\u8fde\u677f\uff0c\u5927\u5bb6\u624d\u731b\u7136\u53d1\u73b0\uff1a\u7b97\u529b\u7ade\u8d5b\u7684\u51b3\u80dc\u70b9\u65e9\u5df2\u4e0d\u5728\u6676\u4f53\u7ba1\u6570\u91cf\uff0c\u800c\u5728\u6570\u636e\u5982\u4f55\u2018\u6d41\u52a8\u2019\u3002\u201d<\/p>\n<\/blockquote>\n<h3>\u2460 \u6280\u672f\u80cc\u666f\uff1aPCIe 6.0 \u8d70\u7ea2\uff0c\u5185\u5b58\u5899\u6210\u4e3a\u65b0\u201c\u53f9\u606f\u4e4b\u5899\u201d<\/h3>\n<p>\u8fc7\u53bb\u4e00\u5e74\uff0c\u5927\u6a21\u578b\u53c2\u6570\u91cf\u4ece\u767e\u4ebf\u7ea7\u98d9\u5347\u5230\u4e07\u4ebf\u7ea7\uff0c\u800c GPU \u5cf0\u503c\u7b97\u529b\u7684\u5e74\u590d\u5408\u589e\u957f\u7387\u4f9d\u65e7\u4fdd\u6301\u5728 2.2\u00d7\u3002\u770b\u4f3c\u98ce\u5149\uff0c\u4f46\u4e1a\u5185\u4eba\u58eb\u66f4\u5173\u6ce8\u53e6\u4e00\u6761\u66f2\u7ebf\u2014\u2014\u5185\u5b58\u5e26\u5bbd\u7684\u589e\u901f\u4ec5\u6709 1.4\u00d7\u3002\u5f53 PCIe 6.0 \u00d716 \u5355\u5411\u7406\u8bba\u5e26\u5bbd\u903c\u8fd1 128 GB\/s \u65f6\uff0c\u73b0\u5b9e\u5374\u662f 80% \u7684\u529f\u8017\u88ab\u6d6a\u8d39\u5728\u201c\u642c\u6743\u91cd\u201d\u4e0a\uff1a\u6bcf\u4e00\u6b21 Attention \u8ba1\u7b97\u90fd\u8981\u628a\u6570\u5341 GB \u7684\u53c2\u6570\u91cf\u4ece\u663e\u5b58\u642c\u5230\u7f13\u5b58\uff0c\u518d\u4ece\u7f13\u5b58\u642c\u56de\u663e\u5b58\uff0c\u642c\u8fd0\u65f6\u95f4\u8fdc\u957f\u4e8e\u8ba1\u7b97\u672c\u8eab\u3002\u5185\u5b58\u5899\uff0c\u5df2\u6210\u4e3a\u5927\u6a21\u578b\u843d\u5730\u7684\u201c\u53f9\u606f\u4e4b\u5899\u201d\u3002<\/p>\n<h3>\u2461 \u75db\u70b9\u62c6\u89e3\uff1a80% \u529f\u8017\u82b1\u5728\u201c\u642c\u8fd0\u201d\u800c\u975e\u201c\u601d\u8003\u201d<\/h3>\n<p>\u5728\u661f\u5b87\u667a\u7b97\u5b9e\u9a8c\u5ba4\u7684\u5b9e\u6d4b\u4e2d\uff0c\u4e00\u5f20 80 GB \u663e\u5b58\u7684 H100 \u8fd0\u884c 176 B \u53c2\u6570\u5f00\u6e90\u6a21\u578b\uff0cFP16 \u7cbe\u5ea6\u4e0b\u4ec5\u6279\u6b21=1 \u5c31\u628a\u663e\u5b58\u5403\u6ee1\uff0c\u5e26\u5bbd\u5229\u7528\u7387\u53ea\u6709 37%\u3002\u8fd9\u610f\u5473\u7740 GPU \u6838\u5fc3\u6bcf\u79d2\u949f\u6709 0.63 s \u5728\u7b49\u5f85\u6570\u636e\uff0c\u201c\u601d\u8003\u201d\u4ec5\u5360 0.37 s\u3002\u6362\u7b97\u6210\u7535\u8d39\uff0c\u4e00\u5f20\u5361\u4e00\u5e74\u8981\u591a\u82b1 1.2 \u4e07\u5143\u201c\u642c\u8fd0\u8d39\u201d\u3002\u5f53\u4f01\u4e1a\u4e3a\u4e86\u5ef6\u8fdf\u628a 8 \u5f20\u5361\u5806\u6210 1 \u4e2a\u8282\u70b9\uff0c\u6210\u672c\u6307\u6570\u7ea7\u4e0a\u6da8\uff0c\u5374\u6362\u4e0d\u5230\u7ebf\u6027\u589e\u957f\u7684\u541e\u5410\u3002<\/p>\n<h3>\u2462 \u5e73\u53f0\u6253\u6cd5\uff1aH100 4.8 TB\/s \u5e26\u5bbd + FP8 \u6df7\u5408\u7cbe\u5ea6\uff0c\u541e\u5410\u7ffb\u500d<\/h3>\n<p>\u661f\u5b87\u667a\u7b97\u5728\u6700\u65b0\u7684 GPU\u4e91\u4e3b\u673a \u96c6\u7fa4\u4e2d\uff0c\u628a\u5355\u5361 H100 \u7684 3.35 TB\/s \u663e\u5b58\u5e26\u5bbd\u8fdb\u4e00\u6b65\u63d0\u5347\u5230 4.8 TB\/s\uff1a<br \/>\n1. \u901a\u8fc7\u81ea\u7814 CUDA Kernel \u5c06 FP16 \u6743\u91cd\u52a8\u6001\u538b\u7f29\u81f3 FP8\uff0c\u7cbe\u5ea6\u635f\u5931 &lt;0.3%\uff1b<br \/>\n2. \u5f15\u5165\u5f20\u91cf\u5e76\u884c+\u6d41\u6c34\u7ebf\u5e76\u884c\u6df7\u5408\u8c03\u5ea6\uff0c\u628a\u901a\u4fe1\u91cf\u964d\u4f4e 42%\uff1b<br \/>\n3. \u5229\u7528 NVLink 4.0 \u62d3\u6251\uff0c\u628a\u5361\u95f4\u4e92\u8054\u5e26\u5bbd\u6253\u6ee1\uff0c\u5ef6\u8fdf\u964d\u81f3 1\/10\u3002  <\/p>\n<p>\u5b9e\u6d4b\u540c\u4e00 176 B \u6a21\u578b\uff0c\u5728\u661f\u5b87\u667a\u7b97\u5e73\u53f0\u6279\u6b21=8 \u7684\u573a\u666f\u4e0b\uff0c\u541e\u5410\u91cf\u4ece 820 tokens\/s \u63d0\u5347\u5230 1680 tokens\/s\uff0c<strong>\u6574\u6574 2\u00d7<\/strong>\uff0c\u800c\u6bcf 1k tokens \u6210\u672c\u5374\u4e0b\u964d 45%\u3002\u5bf9\u4e8e\u9700\u8981 <a href=\"https:\/\/www.starverse-ai.com\">GPU\u670d\u52a1\u5668\u79df\u7528<\/a> \u505a\u5b9e\u65f6\u5bf9\u8bdd\u3001AI \u5e94\u7528 \u843d\u5730\u7684\u4f01\u4e1a\uff0c\u8fd9\u610f\u5473\u7740\u201c\u5806\u5361\u201d\u4e0d\u518d\u662f\u552f\u4e00\u89e3\u3002<\/p>\n<h3>\u2463 \u5b9e\u6218\u6848\u4f8b\uff1aGLM-130B \u63a8\u7406\u5ef6\u8fdf 220 ms\u219298 ms<\/h3>\n<p>\u67d0\u5934\u90e8 SaaS \u5382\u5546\u9700\u8981\u5c06 GLM-130B \u5d4c\u5165\u5728\u7ebf\u5ba2\u670d\u7cfb\u7edf\uff0c\u5bf9\u5ef6\u8fdf\u6781\u5176\u654f\u611f\u3002\u539f\u5148\u91c7\u7528 4\u00d7A100 \u65b9\u6848\uff0c\u9996 token \u5ef6\u8fdf 220 ms\uff0cP99 \u6296\u52a8\u9ad8\u8fbe 30%\u3002\u8fc1\u79fb\u5230\u661f\u5b87\u667a\u7b97\u540e\uff0c\u4ec5\u4f7f\u7528 2\u00d7H100 \u4fbf\u8fbe\u6210\u540c\u6837\u5e76\u53d1\uff1a<br \/>\n&#8211; FP8 \u6df7\u5408\u7cbe\u5ea6\u8ba9\u663e\u5b58\u5360\u7528\u964d\u4f4e 47%\uff0c\u5355\u5361\u5373\u53ef\u653e\u4e0b 130 B \u53c2\u6570\uff1b<br \/>\n&#8211; 4.8 TB\/s \u5e26\u5bbd\u628a\u6743\u91cd\u8f7d\u5165\u65f6\u95f4\u4ece 87 ms \u538b\u7f29\u5230 21 ms\uff1b<br \/>\n&#8211; \u5e73\u53f0\u5185\u7f6e\u7684 <a href=\"https:\/\/www.starverse-ai.com\/node\/019b88ac-286a-70a3-bafa-cfa47c851b4d\">\u6a21\u578b\u548c\u6570\u636e\u96c6<\/a> \u8d44\u6e90\u5e93\uff0c\u8ba9\u5de5\u7a0b\u5e08\u514d\u53bb 6 GB \u6a21\u578b\u4e0b\u8f7d\u4e0e\u683c\u5f0f\u8f6c\u6362\uff0c\u4e0a\u7ebf\u5468\u671f\u7531 3 \u5929\u7f29\u77ed\u81f3 2 \u5c0f\u65f6\u3002  <\/p>\n<p>\u6700\u7ec8\uff0c<strong>\u9996 token \u5ef6\u8fdf\u964d\u81f3 98 ms\uff0cP99 \u6296\u52a8 &lt;5 ms<\/strong>\uff0c\u5ba2\u6237\u6ee1\u610f\u5ea6\u63d0\u5347 18%\uff0c\u800c\u6bcf\u6708 GPU \u79df\u8d41\u8d39\u7528\u4e0b\u964d 38%\u3002<\/p>\n<h3>\u2464 \u672a\u6765\u5c55\u671b\uff1aCXL 3.0 \u5185\u5b58\u6c60\uff0c\u5355\u5361\u53ef\u8c03 TB \u7ea7\u5185\u5b58<\/h3>\n<p>\u661f\u5b87\u667a\u7b97\u6b63\u4e0e\u6f9c\u8d77\u79d1\u6280\u3001\u56fd\u5185\u5934\u90e8\u5185\u5b58\u5382\u8054\u5408\u9a8c\u8bc1 CXL 3.0 \u5185\u5b58\u6c60\u65b9\u6848\uff1a\u628a CPU DDR5 \u4e0e GPU \u663e\u5b58\u7edf\u4e00\u7f16\u5740\uff0c\u5355\u5361\u53ef\u8c03\u7528 2 TB \u7ea7\u201c\u8fdc\u7aef\u663e\u5b58\u201d\u3002\u5f53\u6a21\u578b\u53c2\u6570\u91cf\u518d\u7ffb 10 \u500d\uff0c\u4e5f\u4e0d\u518d\u9700\u8981\u201c\u66b4\u529b\u5806\u5361\u201d\u3002\u6d4b\u8bd5\u6570\u636e\u663e\u793a\uff0c\u5728 512 GB\/s CXL \u94fe\u8def\u4e0b\uff0c\u8bbf\u95ee\u5ef6\u8fdf\u4ec5 250 ns\uff0c\u5e26\u5bbd\u5229\u7528\u7387\u63d0\u5347 3.2 \u500d\u3002\u9884\u8ba1 2025 \u5e74\u7b2c\u4e00\u5b63\u5ea6\uff0c\u661f\u5b87\u667a\u7b97\u5c06\u7387\u5148\u5728\u516c\u6709\u4e91\u63d0\u4f9b CXL \u5185\u5b58\u6c60 Beta\uff0c\u5c4a\u65f6\u7528\u6237\u53ef\u5728\u63a7\u5236\u53f0\u4e00\u952e\u52fe\u9009\u201c\u6269\u5c55\u663e\u5b58\u201d\uff0c\u6309\u9700\u4ed8\u8d39\uff0c<strong>\u8ba9\u5927\u6a21\u578b\u63a8\u7406\u50cf\u6253\u5f00\u6c34\u9f99\u5934\u4e00\u6837\u7b80\u5355<\/strong>\u3002<\/p>\n<h3>\u2465 \u7ed3\u8bba\uff1a\u7b97\u529b\u2260\u5806\u5361\uff0c\u6570\u636e\u6d41\u52a8\u6548\u7387\u624d\u662f\u6838\u5fc3<\/h3>\n<p>\u5927\u6a21\u578b\u7684\u7ade\u4e89\u5df2\u8fdb\u5165\u201c\u540e\u6469\u5c14\u65f6\u4ee3\u201d\uff0c\u518d\u8c6a\u534e\u7684 FLOPS \u4e5f\u654c\u4e0d\u8fc7\u5185\u5b58\u5899\u7684\u4e00\u7eb8\u7981\u4ee4\u3002\u661f\u5b87\u667a\u7b97\u901a\u8fc7\u6df7\u5408\u7cbe\u5ea6\u3001\u5927\u663e\u5b58\u3001\u9ad8\u5e26\u5bbd\u4e0e\u672a\u6765\u7684 CXL \u5185\u5b58\u6c60\uff0c\u6b63\u5728\u628a\u201c\u6570\u636e\u642c\u8fd0\u201d\u53d8\u6210\u201c\u6570\u636e\u6d41\u52a8\u201d\u3002\u5bf9\u4e8e\u9ad8\u6821\u8bfe\u9898\u7ec4\u3001\u521d\u521b\u56e2\u961f\u4e43\u81f3\u5927\u578b\u79d1\u6280\u516c\u53f8\uff0c<strong>\u7b97\u529b\u4e0d\u518d\u662f\u4e00\u6b21\u6027 CAPEX \u7684\u91cd\u8d44\u4ea7\uff0c\u800c\u662f\u50cf\u6c34\u7535\u4e00\u6837\u968f\u7528\u968f\u53d6\u7684 GPU\u4e91\u4e3b\u673a \u670d\u52a1<\/strong>\u3002<\/p>\n<p>\u73b0\u5728\u6ce8\u518c\u661f\u5b87\u667a\u7b97\uff0c<strong>\u65b0\u7528\u6237\u5373\u9001 10 \u5143\u4f53\u9a8c\u91d1<\/strong>\uff0c\u53ef\u96f6\u6210\u672c\u4f53\u9a8c H100 \u9ad8\u901f\u5b9e\u4f8b\uff0c\u5185\u7f6e 50+ \u70ed\u95e8\u5927\u6a21\u578b\u4e0e 300 TB \u516c\u5f00\u6570\u636e\u96c6\uff0c\u771f\u6b63\u505a\u5230\u201c\u4e0a\u4f20\u4ee3\u7801\u5373\u53ef\u8dd1\u201d\u3002<br \/>\n\u522b\u8ba9\u5185\u5b58\u5899\u6321\u4f4f\u4f60\u7684\u521b\u610f\uff0c<a href=\"https:\/\/www.starverse-ai.com\">\u7acb\u5373\u767b\u5f55<\/a>\uff0c\u4e00\u8d77\u63a8\u5f00 AI \u5e94\u7528 \u7684\u4e0b\u4e00\u6247\u95e8\u3002<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u5185\u5b58\u5899\u9769\u547d\u6765\u88ad\uff01\u661f\u5b87\u667a\u7b97\u5e73\u53f0\u8c08\u300c\u6df7\u5408\u7cbe\u5ea6+\u5927\u663e\u5b58\u300d\u5982\u4f55\u7834\u89e3\u5927&hellip;<\/p>\n","protected":false},"author":2,"featured_media":2395,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-2396","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-zixun"],"views":45,"_links":{"self":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/posts\/2396","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/comments?post=2396"}],"version-history":[{"count":0,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/posts\/2396\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/media\/2395"}],"wp:attachment":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/media?parent=2396"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/categories?post=2396"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/tags?post=2396"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}