{"id":2305,"date":"2026-03-01T16:13:34","date_gmt":"2026-03-01T08:13:34","guid":{"rendered":"https:\/\/www.starverse-ai.com\/guide\/archives\/2305"},"modified":"2026-03-01T16:13:34","modified_gmt":"2026-03-01T08:13:34","slug":"%e6%8e%a8%e7%90%86%e6%88%90%e6%9c%ac%e5%90%83%e6%8e%8970%e9%a2%84%e7%ae%97%ef%bc%9f%e6%98%9f%e5%ae%87%e6%99%ba%e7%ae%97%e5%b9%b3%e5%8f%b0%e5%bc%b9%e6%80%a7auto-scalingspot%e5%ae%9e%e4%be%8b","status":"publish","type":"post","link":"https:\/\/www.starverse-ai.com\/guide\/archives\/2305","title":{"rendered":"\u63a8\u7406\u6210\u672c\u5403\u638970%\u9884\u7b97\uff1f\u661f\u5b87\u667a\u7b97\u5e73\u53f0\u2018\u5f39\u6027Auto-Scaling+Spot\u5b9e\u4f8b\u2019\u8ba9AI\u5e94\u7528\u6210\u672c\u518d\u964d55%"},"content":{"rendered":"<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/www.starverse-ai.com\/guide\/wp-content\/uploads\/2026\/03\/1772352814_4d28fd.png\" alt=\"\u63a8\u7406\u6210\u672c\u5403\u638970%\u9884\u7b97\uff1f\u661f\u5b87\u667a\u7b97\u5e73\u53f0\u2018\u5f39\u6027Auto-Scaling+Spot\u5b9e\u4f8b\u2019\u8ba9AI\u5e94\u7528\u6210\u672c\u518d\u964d55%\" style=\"display:block; margin:10px auto; max-width:100%; height:auto;\" \/><\/figure>\n<blockquote>\n<p>\u201c\u8fc7\u53bb\u4e09\u5e74\uff0c\u5927\u6a21\u578b\u8bad\u7ec3\u6210\u672c\u4e0b\u964d 70%\uff0c\u63a8\u7406\u5374\u6da8\u4e86\u4e09\u500d\u3002\u201d<br \/>\n\u2014\u2014Gartner\u300a2024 \u4e2d\u56fd AI \u57fa\u7840\u8bbe\u65bd\u62a5\u544a\u300b<\/p>\n<\/blockquote>\n<p>\u5f53\u884c\u4e1a\u7126\u70b9\u8fd8\u5728\u6bd4\u62fc\u8c01\u5bb6\u7684\u53c2\u6570\u66f4\u9ad8\u65f6\uff0c\u771f\u6b63\u7684 CFO \u4eec\u5df2\u7ecf\u53d1\u73b0\uff1a\u4e00\u5f20 A100 \u5728\u8bad\u7ec3\u9636\u6bb5\u8dd1 30 \u5929\uff0c\u5374\u53ef\u80fd\u5728\u63a8\u7406\u73af\u8282\u8fde\u8f74\u8f6c 365 \u5929\uff1bAgent \u4e00\u65e6\u4e0a\u7ebf\uff0c\u8c03\u7528\u66f2\u7ebf\u5448\u6307\u6570\u7ea7\u722c\u5761\uff0c\u9884\u7b97\u9ed1\u6d1e\u968f\u4e4b\u6253\u5f00\u3002\u67d0\u5934\u90e8 SaaS \u5382\u5546\u62ab\u9732\uff0c\u5176\u5ba2\u670d Agent \u9ad8\u5cf0 QPS \u51b2\u5230 3000\uff0c\u4ec5 GPU \u79df\u91d1\u5c31\u5360\u53bb\u5168\u5e74 AI \u9884\u7b97\u7684 72%\u3002\u201c\u63a8\u7406\u6210\u672c\u5403\u6389 70% \u9884\u7b97\u201d\u4e0d\u518d\u662f\u5371\u8a00\u8038\u542c\uff0c\u800c\u662f\u6240\u6709\u60f3\u628a\u6a21\u578b\u771f\u6b63\u843d\u5730\u7684\u56e2\u961f\u5fc5\u987b\u76f4\u9762\u7684\u73b0\u5b9e\u3002<\/p>\n<h2>\u5f39\u6027 Auto-Scaling + Spot \u5b9e\u4f8b\uff0c\u628a\u201c\u5cf0\u503c\u201d\u524a\u6210\u201c\u5c71\u8c37\u201d<\/h2>\n<p>\u661f\u5b87\u667a\u7b97\u56e2\u961f\u5728\u4e00\u7ebf\u966a\u8dd1\u8fc7 200 \u591a\u4e2a AI \u9879\u76ee\u540e\uff0c\u7ed9\u51fa\u4e86\u4e00\u5f20\u66f4\u7ec6\u9897\u7c92\u5ea6\u7684\u8d26\u5355\uff1a<br \/>\n&#8211; \u5178\u578b 7B \u5bf9\u8bdd\u6a21\u578b\uff0c\u5355\u5361 A100 \u53ef\u652f\u6491\u7ea6 120 \u5e76\u53d1\u8bf7\u6c42\uff1b<br \/>\n&#8211; \u5ba2\u670d\u573a\u666f\u767d\u5929\u9ad8\u5cf0 4 \u5c0f\u65f6\uff0c\u591c\u91cc\u4f4e\u8c37\u4ec5 1\/10 \u6d41\u91cf\uff1b<br \/>\n&#8211; \u82e5\u6309\u5305\u6708\u5305\u5e74\u56e4\u5361\uff0c\u4f4e\u8c37\u65f6\u6bb5 80% \u7b97\u529b\u7a7a\u8f6c\uff0c\u76f4\u63a5\u6d6a\u8d39 55% \u8d39\u7528\u3002  <\/p>\n<p>\u4e8e\u662f\uff0c\u5e73\u53f0\u628a\u201c\u5f39\u6027 Auto-Scaling\u201d\u4e0e\u201cSpot \u5b9e\u4f8b\u201d\u505a\u4e86\u539f\u751f\u8026\u5408\uff1a<br \/>\n1. \u57fa\u4e8e KNative\/KServe \u7684 Serverless \u6846\u67b6\uff0cQPS \u9608\u503c\u53ef\u81ea\u5b9a\u4e49\uff0c\u79d2\u7ea7\u62c9\u8d77 Pod\uff1b<br \/>\n2. Spot \u5b9e\u4f8b\u6700\u4f4e 0.4 \u6298\uff0c\u4e0e\u7a33\u6001\u5b9e\u4f8b\u6df7\u5408\u8c03\u5ea6\uff0c\u9ad8\u5cf0\u8865\u8db3\u7b97\u529b\uff0c\u4f4e\u8c37\u7acb\u5373\u91ca\u653e\uff1b<br \/>\n3. \u6570\u636e\u9762\u96f6\u4e22\u5931\uff0c\u81ea\u52a8\u5feb\u7167\u5199\u5165\u5bf9\u8c61\u5b58\u50a8\uff0c\u5b9e\u4f8b\u88ab\u56de\u6536\u524d 30 \u79d2\u5b8c\u6210\u70ed\u8fc1\u79fb\u3002  <\/p>\n<p>\u4e00\u53e5\u8bdd\uff0c<strong>\u7528 Serverless \u7684\u654f\u6377\uff0c\u4e70\u65ad GPU \u670d\u52a1\u5668\u79df\u7528\u7684\u4f4e\u4ef7<\/strong>\u3002<\/p>\n<h2>\u771f\u5b9e\u6848\u4f8b\uff1a\u5ba2\u670d Agent 55% \u6210\u672c\u662f\u8fd9\u6837\u7701\u51fa\u6765\u7684<\/h2>\n<p>\u5ba2\u6237\u80cc\u666f\uff1a\u56fd\u5185 B2B \u7535\u5546\u5e73\u53f0\uff0c\u81ea\u7814 13B \u5ba2\u670d\u6a21\u578b\uff0c\u65e5\u6d3b\u5cf0\u503c 3000 QPS\uff0c\u4f4e\u8c37 200 QPS\u3002<br \/>\n\u539f\u65b9\u6848\uff1a\u5305\u6708 40 \u5f20 A100\uff0c\u6708\u8d26\u5355 28 \u4e07\u5143\u3002  <\/p>\n<p>\u661f\u5b87\u667a\u7b97\u6df7\u5408\u65b9\u6848\uff1a<br \/>\n&#8211; \u7a33\u6001\u4fdd\u5e95 8 \u5f20 A100\uff0c\u5305\u6708\u7528\u4f5c\u70ed\u6570\u636e\u7f13\u5b58\uff1b<br \/>\n&#8211; \u9ad8\u5cf0\u65f6\u6bb5 Auto-Scaling \u5f39\u51fa 32 \u5f20 Spot \u5b9e\u4f8b\uff0c\u5e73\u5747\u5355\u4ef7 0.5 \u6298\uff1b<br \/>\n&#8211; \u4f4e\u8c37\u65f6\u6bb5\u7f29\u5bb9\u81f3 8 \u5f20\uff0c\u591c\u95f4\u81ea\u52a8\u5feb\u7167\u8f6c\u5b58<a href=\"https:\/\/www.starverse-ai.com\/node\/019b88aa-2fc4-790b-97e1-fdff4da0e8a6\">\u4e91\u786c\u76d8<\/a>\u3002  <\/p>\n<p>\u4e0a\u7ebf 30 \u5929\u7ed3\u679c\uff1a<br \/>\n&#8211; GPU \u603b\u6d88\u8017 14.2 \u4e07\u5143\uff0c\u8282\u7701 55%\uff1b<br \/>\n&#8211; P99 \u5ef6\u8fdf\u7a33\u5b9a\u5728 380 ms\uff0c\u65e0\u4e00\u6b21\u6570\u636e\u4e22\u5931\uff1b<br \/>\n&#8211; \u8fd0\u7ef4\u4eba\u529b\u4ece 3 \u4eba\u964d\u81f3 0.5 \u4eba\uff0c\u5168\u90e8\u901a\u8fc7\u63a7\u5236\u53f0\u81ea\u52a9\u5b8c\u6210\u3002  <\/p>\n<p>\u5ba2\u6237 CFO \u7684\u8bc4\u4ef7\u5f88\u76f4\u63a5\uff1a\u201c\u540c\u6837\u7684\u6a21\u578b\u6548\u679c\uff0c<strong>GPU\u4e91\u4e3b\u673a<\/strong>\u8d39\u7528\u780d\u534a\uff0c\u8463\u4e8b\u4f1a\u76f4\u63a5\u7ed9 AI \u56e2\u961f\u8ffd\u52a0 200 \u4e07\u9884\u7b97\u505a\u65b0\u529f\u80fd\u3002\u201d<\/p>\n<h2>\u4e00\u952e\u90e8\u7f72\uff0c\u4e0d\u6b62\u7701\u94b1\uff0c\u8fd8\u7701\u547d<\/h2>\n<p>\u5f88\u591a\u5f00\u53d1\u8005\u62c5\u5fc3 Serverless \u95e8\u69db\u9ad8\uff0c\u661f\u5b87\u667a\u7b97\u628a KNative\/KServe \u505a\u6210\u201c\u4e00\u952e\u6a21\u677f\u201d\uff1a<br \/>\n&#8211; \u9009\u62e9\u6a21\u578b \u2192 \u8bbe\u7f6e QPS \u9608\u503c \u2192 \u70b9\u51fb\u90e8\u7f72\uff0c3 \u5206\u949f\u751f\u6210\u53ef\u8bbf\u95ee\u7684 HTTPS  endpoint\uff1b<br \/>\n&#8211; \u5185\u7f6e\u4e3b\u6d41\u955c\u50cf\uff08PyTorch 2.2\u3001TensorRT-LLM\u3001vLLM\uff09\uff0c<a href=\"https:\/\/www.starverse-ai.com\/node\/019b88ac-286a-70a3-bafa-cfa47c851b4d\">\u6a21\u578b\u548c\u6570\u636e\u96c6<\/a>\u5373\u62d6\u5373\u7528\uff1b<br \/>\n&#8211; \u652f\u6301\u7070\u5ea6\u53d1\u5e03\u3001A\/B \u6d4b\u8bd5\uff0c\u56de\u6eda\u540c\u6837\u79d2\u7ea7\u5b8c\u6210\u3002  <\/p>\n<p>\u8fd9\u610f\u5473\u7740\uff0c\u7b97\u6cd5\u5de5\u7a0b\u5e08\u518d\u4e5f\u4e0d\u7528\u534a\u591c\u8d77\u5e8a\u624b\u52a8\u6269\u5361\uff0c<strong>\u628a\u7cbe\u529b\u653e\u56de AI \u5e94\u7528\u521b\u65b0\u672c\u8eab<\/strong>\u3002<\/p>\n<h2>\u6570\u636e\u4e0d\u4e22\uff0c\u624d\u6562\u5927\u80c6\u7528 Spot<\/h2>\n<p>Spot \u5b9e\u4f8b\u6700\u5927\u7684\u5fc3\u75c5\u662f\u201c\u968f\u65f6\u88ab\u56de\u6536\u201d\u3002\u661f\u5b87\u667a\u7b97\u7ed9\u51fa\u7684\u515c\u5e95\u7b56\u7565\u662f\uff1a<br \/>\n1. \u6bcf 30 \u79d2\u81ea\u52a8\u5feb\u7167\uff0c\u589e\u91cf\u5199\u5165<a href=\"https:\/\/www.starverse-ai.com\/node\/019b88ac-0730-7451-a8ab-9c3c873fef42\">\u4e91\u5b58\u50a8<\/a>\uff0c\u56de\u6536\u524d\u5b8c\u6210\u6700\u540e\u4e00\u5757\u6570\u636e\u540c\u6b65\uff1b<br \/>\n2. \u591a AZ \u5197\u4f59\uff0c\u5feb\u7167\u8de8\u533a\u590d\u5236\uff0cRPO &lt; 30 \u79d2\uff1b<br \/>\n3. \u91cd\u65b0\u8c03\u5ea6\u65f6\uff0c\u65b0\u5b9e\u4f8b\u76f4\u63a5\u4ece\u5feb\u7167\u6062\u590d\uff0c\u65ad\u70b9\u7eed\u8dd1\u3002  <\/p>\n<p>\u5b9e\u6d4b 500 \u6b21\u968f\u673a\u56de\u6536\uff0c\u4e1a\u52a1\u5c42\u96f6\u611f\u77e5\uff0c\u5e73\u5747\u51b7\u542f\u52a8\u65f6\u95f4 18 \u79d2\u3002<\/p>\n<h2>\u9644\u8d60\uff1a\u6210\u672c\u8ba1\u7b97\u5668\u6a21\u677f\uff0c\u5148\u7b97\u518d\u4e70<\/h2>\n<p>\u60f3\u5feb\u901f\u77e5\u9053\u4f60\u7684\u6a21\u578b\u4e00\u4e2a\u6708\u5230\u5e95\u8981\u82b1\u591a\u5c11\u94b1\uff1f\u661f\u5b87\u667a\u7b97\u516c\u5f00\u4e86 Excel \u6210\u672c\u8ba1\u7b97\u5668\uff1a<br \/>\n&#8211; \u8f93\u5165 QPS\u3001\u6a21\u578b\u5927\u5c0f\u3001\u4e0a\u4e0b\u6587\u957f\u5ea6\uff0c\u81ea\u52a8\u7ed9\u51fa\u7a33\u6001+Spot \u6df7\u5408\u8d39\u7528\uff1b<br \/>\n&#8211; \u652f\u6301\u5bf9\u6bd4\u5305\u6708\u3001\u6309\u9700\u3001\u7eaf Spot \u4e09\u79cd\u6a21\u5f0f\uff1b<br \/>\n&#8211; \u4e00\u952e\u751f\u6210 PDF \u62a5\u4ef7\u5355\uff0c\u65b9\u4fbf\u8d22\u52a1\u5ba1\u6279\u3002  <\/p>\n<p>\u5173\u6ce8\u201c\u661f\u5b87\u667a\u7b97\u201d\u516c\u4f17\u53f7\uff0c\u56de\u590d\u201c\u6210\u672c\u201d\u5373\u53ef\u4e0b\u8f7d\uff0c<strong>\u65b0\u6ce8\u518c\u7528\u6237\u518d\u9001 10 \u5143\u4f53\u9a8c\u91d1<\/strong>\uff0c\u53ef\u76f4\u63a5\u62b5\u6263 GPU \u670d\u52a1\u5668\u79df\u7528\u8d39\u7528\u3002<\/p>\n<h2>\u5199\u5728\u6700\u540e<\/h2>\n<p>\u5f53\u201c\u8bad\u7ec3\u201d\u53ea\u662f\u5f00\u573a\uff0c\u201c\u63a8\u7406\u201d\u624d\u662f\u65e5\u5e38\uff0c\u8c01\u80fd\u628a\u5f39\u6027\u7b97\u529b\u7528\u5230\u6781\u81f4\uff0c\u8c01\u5c31\u80fd\u628a\u9884\u7b97\u7528\u5728\u771f\u6b63\u7684\u521b\u65b0\u3002\u661f\u5b87\u667a\u7b97\u7528\u4e00\u5f20 0.4 \u6298\u7684 Spot \u8d26\u5355\u544a\u8bc9\u884c\u4e1a\uff1a<br \/>\n<strong>\u4e0d\u662f\u6a21\u578b\u592a\u8d35\uff0c\u800c\u662f\u7b97\u529b\u6ca1\u9009\u5bf9\u3002<\/strong>  <\/p>\n<p>\u73b0\u5728\u5c31\u8bbf\u95ee<a href=\"https:\/\/www.starverse-ai.com\">https:\/\/www.starverse-ai.com<\/a>\uff0c\u4f53\u9a8c\u5f39\u6027 Auto-Scaling \u4e0e Spot \u5b9e\u4f8b\u5e26\u6765\u7684 55% \u6210\u672c\u964d\u5e45\uff0c\u8ba9\u4f60\u7684 AI \u5e94\u7528\u8dd1\u5f97\u66f4\u5feb\u3001\u66f4\u7701\u3001\u66f4\u7a33\u3002<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u201c\u8fc7\u53bb\u4e09\u5e74\uff0c\u5927\u6a21\u578b\u8bad\u7ec3\u6210\u672c\u4e0b\u964d 70%\uff0c\u63a8\u7406\u5374\u6da8\u4e86\u4e09\u500d\u3002\u201d &hellip;<\/p>\n","protected":false},"author":2,"featured_media":2304,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-2305","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-zixun"],"views":35,"_links":{"self":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/posts\/2305","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/comments?post=2305"}],"version-history":[{"count":0,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/posts\/2305\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/media\/2304"}],"wp:attachment":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/media?parent=2305"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/categories?post=2305"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/tags?post=2305"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}