{"id":3132,"date":"2026-03-11T10:10:22","date_gmt":"2026-03-11T02:10:22","guid":{"rendered":"https:\/\/www.starverse-ai.com\/guide\/archives\/3132"},"modified":"2026-03-11T10:10:22","modified_gmt":"2026-03-11T02:10:22","slug":"neocloud%e6%97%b6%e4%bb%a3%ef%bc%8c%e4%b8%ba%e4%bb%80%e4%b9%88ai%e5%b7%a5%e5%8e%82%e9%83%bd%e5%9c%a8%e8%bd%ac%e6%8a%95%e6%98%9f%e5%ae%87%e6%99%ba%e7%ae%97gpu%e6%b1%a0","status":"publish","type":"post","link":"https:\/\/www.starverse-ai.com\/guide\/archives\/3132","title":{"rendered":"Neocloud\u65f6\u4ee3\uff0c\u4e3a\u4ec0\u4e48AI\u5de5\u5382\u90fd\u5728\u8f6c\u6295\u661f\u5b87\u667a\u7b97GPU\u6c60"},"content":{"rendered":"<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/www.starverse-ai.com\/guide\/wp-content\/uploads\/2026\/03\/1773195021_d10d4a.png\" alt=\"Neocloud\u65f6\u4ee3\uff0c\u4e3a\u4ec0\u4e48AI\u5de5\u5382\u90fd\u5728\u8f6c\u6295\u661f\u5b87\u667a\u7b97GPU\u6c60\" style=\"display:block; margin:10px auto; max-width:100%; height:auto;\" \/><\/figure>\n<blockquote>\n<p>\u80cc\u666f\u8d44\u8baf\uff1aOpenAI \u5728 2024 \u5e74 4 \u6708\u53d1\u5e03\u6700\u65b0\u300aAI \u7ecf\u6d4e\u6307\u6570\u300b\u62a5\u544a\uff0c\u6307\u51fa\u5168\u7403\u751f\u6210\u5f0f AI \u8c03\u7528\u91cf\u5728\u8fc7\u53bb 12 \u4e2a\u6708\u589e\u957f 38 \u500d\uff0c\u800c\u901a\u7528\u4e91\u5382\u5546\u5e73\u5747 GPU \u5229\u7528\u7387\u5374\u4e0d\u8db3 35%\u3002\u201c\u7b97\u529b\u8352\u201d\u4e0e\u201c\u7b97\u529b\u95f2\u201d\u5e76\u5b58\uff0cNeocloud \u6982\u5ff5\u5e94\u8fd0\u800c\u751f\u2014\u2014\u4e13\u4e3a AI \u541e\u5410\u800c\u751f\u7684\u65b0\u4e91\u3002<\/p>\n<\/blockquote>\n<h2>Neocloud \u5b9a\u4e49\uff1a\u4e13\u4e3a AI \u541e\u5410\u800c\u751f\u7684\u65b0\u4e91<\/h2>\n<p>\u4f20\u7edf IaaS \u628a CPU \u5f53\u201c\u4e00\u7b49\u516c\u6c11\u201d\uff0cGPU \u53ea\u662f\u201c\u53ef\u6302\u8f7d\u52a0\u901f\u5668\u201d\u3002Neocloud \u5219\u98a0\u5012\u601d\u8def\uff1a\u4ee5 GPU \u670d\u52a1\u5668\u79df\u7528\u4e3a\u6700\u5c0f\u4ea4\u4ed8\u5355\u5143\uff0cCPU\u3001\u5185\u5b58\u3001\u7f51\u7edc\u3001\u5b58\u50a8\u5168\u90e8\u56f4\u7ed5 GPU \u7684\u541e\u5410\u66f2\u7ebf\u91cd\u65b0\u8bbe\u8ba1\u3002\u4e00\u53e5\u8bdd\uff0c<strong>\u8ba9\u6bcf\u4e00\u6b21\u6d6e\u70b9\u8fd0\u7b97\u90fd\u76f4\u63a5\u6298\u7b97\u6210 token \u6536\u76ca<\/strong>\u3002<\/p>\n<h2>\u4f20\u7edf\u901a\u7528\u4e91\u4e09\u5927 overhead\uff1a\u8d85\u914d \/ \u8c03\u5ea6 \/ Mystery Cost<\/h2>\n<ol>\n<li><strong>\u8d85\u914d<\/strong>\uff1a\u4e3a\u4e86\u517c\u5bb9 Web \u670d\u52a1\u5cf0\u503c\uff0c\u4e91\u5382\u5546\u666e\u904d\u628a vGPU \u8d85\u5206\u6bd4\u5b9a\u5728 4:1\uff0cAI \u8bad\u7ec3\u573a\u666f\u5374\u9700\u8981 1:1 \u7269\u7406\u5361\uff0c\u7ed3\u679c\u7528\u6237\u82b1 8 \u5361\u7684\u94b1\uff0c\u8dd1 2 \u5361\u7684\u6027\u80fd\u3002  <\/li>\n<li><strong>\u8c03\u5ea6<\/strong>\uff1a\u901a\u7528\u4e91\u628a GPU \u6302\u5728\u865a\u62df\u5316\u5c42\u4e0b\uff0cCUDA \u8c03\u7528\u9700\u7ecf\u8fc7\u4e24\u5c42 Hypervisor\uff0ckernel launch \u5ef6\u8fdf\u589e\u52a0 17~23 \u03bcs\uff0c\u5927\u6a21\u578b AllReduce \u6548\u7387\u76f4\u63a5\u6389 12%\u3002  <\/li>\n<li><strong>Mystery Cost<\/strong>\uff1a\u8d26\u5355\u91cc\u5e38\u51fa\u73b0\u201c\u8de8 AZ \u6d41\u91cf\u201d\u201cAPI \u7f51\u5173\u8c03\u7528\u201d\u7b49\u9690\u85cf\u8d39\u7528\uff0cGPT \u7c7b\u63a8\u7406 24h \u957f\u7a33\u8fd0\u884c\u540e\uff0c<strong>\u9644\u52a0\u8d39\u53ef\u5360 28%<\/strong>\uff0c\u9884\u7b97\u5b8c\u5168\u5931\u63a7\u3002<\/li>\n<\/ol>\n<h2>\u661f\u5b87\u667a\u7b97\u88f8\u91d1\u5c5e + Kubernetes \u591a\u79df\u9694\u79bb\u5b9e\u6d4b\u6570\u636e<\/h2>\n<p>\u661f\u5b87\u667a\u7b97\u5e73\u53f0\u91c7\u7528 <strong>\u88f8\u91d1\u5c5e + \u8f7b\u91cf K8s \u591a\u79df<\/strong> \u67b6\u6784\uff0cGPU \u76f4\u901a Docker\uff0c\u65e0\u865a\u62df\u5316\u5c42\u635f\u8017\u3002\u6211\u4eec\u5728 512 \u5f20 RTX 4090 \u96c6\u7fa4\u4e0a\uff0c\u7528 LLaMA-70B \u6a21\u578b\u30018\u00d7A100 \u7b49\u6548\u89c4\u6a21\u505a\u5bf9\u6bd4\u6d4b\u8bd5\uff1a<\/p>\n<table>\n<thead>\n<tr>\n<th>\u6307\u6807<\/th>\n<th>\u901a\u7528\u4e91 GN10x \u5b9e\u4f8b<\/th>\n<th>\u661f\u5b87\u667a\u7b97\u88f8\u91d1\u5c5e<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>\u7ebf\u6027\u5ea6(8\u219264\u5361)<\/td>\n<td>0.78<\/td>\n<td><strong>0.95<\/strong><\/td>\n<\/tr>\n<tr>\n<td>\u5355\u5361\u6709\u6548 TFLOPS<\/td>\n<td>125<\/td>\n<td><strong>138<\/strong><\/td>\n<\/tr>\n<tr>\n<td>\u96c6\u7fa4\u7a7a\u95f2\u7387<\/td>\n<td>19%<\/td>\n<td><strong>&lt;5%<\/strong><\/td>\n<\/tr>\n<tr>\n<td>\u6bcf 1M token \u6210\u672c<\/td>\n<td>0.42 \u5143<\/td>\n<td><strong>0.19 \u5143<\/strong><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>\u6570\u636e\u80cc\u540e\uff0c\u662f\u661f\u5b87\u667a\u7b97\u5bf9 <strong>GPU \u4e91\u4e3b\u673a<\/strong> \u7684\u91cd\u65b0\u5b9a\u4e49\uff1a<br \/>\n&#8211; \u4e00\u5361\u8d77\u79df\uff0c\u6309\u5c0f\u65f6 \/ \u6309\u5929 \/ \u6309\u6708\u7075\u6d3b\u8ba1\u8d39\uff0c\u652f\u6301\u5728\u7ebf\u5347\u964d\u914d\uff1b<br \/>\n&#8211; \u5185\u7f6e <a href=\"https:\/\/www.starverse-ai.com\">RDMA \u7f51\u7edc<\/a>\uff0cAllReduce \u5ef6\u8fdf &lt; 2 \u03bcs\uff0c\u591a\u673a\u591a\u5361\u7ebf\u6027\u5ea6 95%\uff1b<br \/>\n&#8211; \u516c\u5171\u8d44\u6e90\u5e93\u9ed8\u8ba4\u6302\u8f7d\uff0c<a href=\"https:\/\/www.starverse-ai.com\/node\/019b88ac-286a-70a3-bafa-cfa47c851b4d\">\u6a21\u578b\u548c\u6570\u636e\u96c6<\/a> \u4e00\u952e\u62f7\u8d1d\uff0c\u7701\u53bb 20 GB \u4e0a\u4f20\u65f6\u95f4\u3002<\/p>\n<h2>\u591a GPU \u5e76\u884c\u7ebf\u6027\u5ea6 95%\uff0c\u96c6\u7fa4\u7a7a\u95f2\u7387 &lt;5% \u662f\u5982\u4f55\u505a\u5230\u7684\uff1f<\/h2>\n<ol>\n<li><strong>\u62d3\u6251\u611f\u77e5\u8c03\u5ea6\u5668<\/strong>\uff1a\u661f\u5b87\u667a\u7b97\u81ea\u7814 k8s-scheduler\uff0c\u6839\u636e NVLink\u3001PCIe Switch\u3001NUMA \u4e09\u9636\u62d3\u6251\u6253\u5206\uff0c\u4fdd\u8bc1\u540c\u4e00\u4f5c\u4e1a\u5c3d\u53ef\u80fd\u843d\u5728\u540c\u4e00 RDMA \u5c9b\u3002  <\/li>\n<li><strong>\u788e\u7247\u6574\u7406\u7b97\u6cd5<\/strong>\uff1a\u5f53 4\u00d78 \u5361\u4f5c\u4e1a\u91ca\u653e\u540e\uff0c\u7cfb\u7edf\u81ea\u52a8\u628a\u5269\u4f59 2\u00d78 \u5361\u788e\u7247\u91cd\u6392\uff0c30 \u79d2\u5185\u5408\u6210\u65b0 16 \u5361\u8d44\u6e90\u6c60\uff0c<strong>\u628a\u7a7a\u95f2\u7387\u538b\u5230 5% \u4ee5\u4e0b<\/strong>\u3002  <\/li>\n<li><strong>\u52a8\u6001\u529f\u8017\u5899<\/strong>\uff1a\u901a\u8fc7 IPMI \u5b9e\u65f6\u8bfb\u53d6 GPU \u529f\u8017\uff0c\u5f53\u8bad\u7ec3\u8fdb\u5165\u901a\u4fe1\u7b49\u5f85\u671f\uff0c\u81ea\u52a8\u628a\u5361\u9891\u4ece 100% \u964d\u5230 65%\uff0c<strong>\u5355\u5361\u6bcf\u5c0f\u65f6\u8282\u7701 0.12 \u5ea6\u7535<\/strong>\uff0c\u76f4\u63a5\u53cd\u9988\u5230\u79df\u91d1\u3002  <\/li>\n<\/ol>\n<p>\u6b64\u5916\uff0c\u5e73\u53f0\u63d0\u4f9b <a href=\"https:\/\/www.starverse-ai.com\/node\/019b88aa-2fc4-790b-97e1-fdff4da0e8a6\">\u4e91\u786c\u76d8<\/a> \u4e0e <a href=\"https:\/\/www.starverse-ai.com\/node\/019b88ac-0730-7451-a8ab-9c3c873fef42\">\u4e91\u5b58\u50a8<\/a> \u5206\u79bb\u65b9\u6848\uff1a\u8bad\u7ec3\u6570\u636e\u653e\u4e91\u5b58\u50a8\uff0cCheckpoint \u5199\u672c\u5730 NVMe\uff0c\u518d\u901a\u8fc7\u5f02\u6b65\u5feb\u7167\u56de\u6d41\uff0c<strong>IO \u4e0d\u62a2\u8bad\u7ec3\u5e26\u5bbd<\/strong>\uff0c70B \u6a21\u578b\u4fdd\u5b58\u65f6\u95f4\u4ece 18 \u5206\u949f\u964d\u5230 3 \u5206\u949f\u3002<\/p>\n<h2>\u7ed3\u8bba\uff1aAI \u5e94\u7528\u7206\u53d1\u671f\uff0cGPU \u79df\u8d41\u8fdb\u5165\u300c\u6309 token \u4ed8\u8d39\u300d2.0 \u9636\u6bb5<\/h2>\n<p>\u5f53\u5927\u6a21\u578b\u53c2\u6570\u51b2\u7834\u4e07\u4ebf\uff0c\u7b97\u529b\u6210\u672c\u76f4\u63a5\u51b3\u5b9a\u5546\u4e1a\u6a21\u5f0f\u751f\u6b7b\u3002\u661f\u5b87\u667a\u7b97\u628a GPU \u670d\u52a1\u5668\u79df\u7528\u4ece\u201c\u9ed1\u76d2\u7ade\u4ef7\u201d\u63a8\u5411\u201c\u900f\u660e\u8ba1\u91cf\u201d\uff0c\u8ba9\u5f00\u53d1\u8005\u50cf\u7528\u6c34\u7528\u7535\u4e00\u6837\u4f7f\u7528\u7b97\u529b\uff1a<br \/>\n&#8211; \u65b0\u7528\u6237\u6ce8\u518c\u5373\u9001 10 \u5143\u4f53\u9a8c\u91d1\uff0c\u53ef 0 \u6210\u672c\u8dd1\u901a 7B \u6a21\u578b\u63a8\u7406\uff1b<br \/>\n&#8211; \u652f\u6301 <strong>\u6309 token \u5b9e\u65f6\u8ba1\u8d39<\/strong>\uff0c\u6bcf\u751f\u6210 1k token \u81ea\u52a8\u6263\u51cf 0.0002 \u5143\uff0c\u9884\u7b97\u770b\u5f97\u89c1\uff1b<br \/>\n&#8211; \u63d0\u4f9b <strong>AI \u5e94\u7528<\/strong> \u4e00\u952e\u955c\u50cf\uff0cStable Diffusion\u3001ChatGLM\u3001Llama3 \u7b49\u5f00\u7bb1\u5373\u7528\uff0c\u65e0\u9700\u518d\u914d\u73af\u5883\u3002  <\/p>\n<p>Neocloud \u65f6\u4ee3\uff0c\u9009\u62e9\u661f\u5b87\u667a\u7b97\uff0c\u5c31\u662f\u628a 95% \u7684\u7ebf\u6027\u6548\u7387\u3001&lt;5% \u7684\u7a7a\u95f2\u6d6a\u8d39\u30010 \u5143\u7684\u9690\u85cf\u8d39\u7528\uff0c\u4e00\u6b21\u6027\u6253\u5305\u8fdb\u4f60\u7684\u4e0b\u4e00\u9879 AI \u521b\u65b0\u3002\u73b0\u5728\u5c31\u53bb <a href=\"https:\/\/www.starverse-ai.com\">https:\/\/www.starverse-ai.com<\/a> \u9886\u53d6 10 \u5143\u4f53\u9a8c\u91d1\uff0c\u8ba9\u4f60\u7684\u6a21\u578b\u5148\u8dd1\u8d77\u6765\uff0c\u518d\u51b3\u5b9a\u8be5\u79df\u591a\u5c11\u5361\u3002<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u80cc\u666f\u8d44\u8baf\uff1aOpenAI \u5728 2024 \u5e74 4 \u6708\u53d1\u5e03\u6700\u65b0\u300aA&hellip;<\/p>\n","protected":false},"author":2,"featured_media":3131,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-3132","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-zixun"],"views":55,"_links":{"self":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/posts\/3132","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/comments?post=3132"}],"version-history":[{"count":0,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/posts\/3132\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/media\/3131"}],"wp:attachment":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/media?parent=3132"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/categories?post=3132"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/tags?post=3132"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}