{"id":2853,"date":"2026-03-08T10:07:33","date_gmt":"2026-03-08T02:07:33","guid":{"rendered":"https:\/\/www.starverse-ai.com\/guide\/archives\/2853"},"modified":"2026-03-08T10:07:33","modified_gmt":"2026-03-08T02:07:33","slug":"gpt-5-3-garlic-40%e4%b8%87token%e9%95%bf%e6%96%87%e6%a1%a3%e6%80%bb%e7%bb%93%e5%ae%9e%e6%b5%8b%ef%bc%9a%e6%98%9f%e5%ae%87%e6%99%ba%e7%ae%972xa100-80g%e4%b8%80%e5%b0%8f%e6%97%b6%e8%b7%91%e5%ae%8c","status":"publish","type":"post","link":"https:\/\/www.starverse-ai.com\/guide\/archives\/2853","title":{"rendered":"GPT-5.3 Garlic 40\u4e07Token\u957f\u6587\u6863\u603b\u7ed3\u5b9e\u6d4b\uff1a\u661f\u5b87\u667a\u7b972\u00d7A100 80G\u4e00\u5c0f\u65f6\u8dd1\u5b8c"},"content":{"rendered":"<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/www.starverse-ai.com\/guide\/wp-content\/uploads\/2026\/03\/1772935652_cac211.png\" alt=\"GPT-5.3 Garlic 40\u4e07Token\u957f\u6587\u6863\u603b\u7ed3\u5b9e\u6d4b\uff1a\u661f\u5b87\u667a\u7b972\u00d7A100 80G\u4e00\u5c0f\u65f6\u8dd1\u5b8c\" style=\"display:block; margin:10px auto; max-width:100%; height:auto;\" \/><\/figure>\n<blockquote>\n<p>\u201c\u5f53 GPT-5.3 Garlic \u628a\u4e0a\u4e0b\u6587\u7a97\u53e3\u4e00\u53e3\u6c14\u62c9\u5230 40 \u4e07 Token\uff0cOpenAI \u540c\u65f6\u5ba3\u5e03 3 \u6708\u8d77\u5411\u5168\u91cf API \u7528\u6237\u5f00\u653e\uff0c\u957f\u6587\u6863 RAG \u5e94\u7528\u7ec8\u4e8e\u4ece Demo \u8d70\u5411\u751f\u4ea7\u3002\u201d<br \/>\n\u2014\u2014TechCrunch 3 \u6708 7 \u65e5\u5934\u6761<\/p>\n<\/blockquote>\n<h2>\u70ed\u70b9\uff1a\u957f\u4e0a\u4e0b\u6587\u201c\u6740\u624b\u7ea7\u201d\u5e94\u7528\u6765\u4e86\uff0c\u672c\u5730\u5361\u5374\u5148\u8dea\u4e86<\/h2>\n<p>3 \u6708 14 \u65e5\uff0cGPT-5.3 Garlic \u6b63\u5f0f\u7248\u4e0a\u7ebf\uff0c\u5b98\u65b9\u5ba3\u79f0\u5728 128K \u4ee5\u4e0a\u957f\u5ea6\u573a\u666f\uff0c\u5e7b\u89c9\u7387\u4e0b\u964d 37%\uff0c\u6307\u4ee4\u9075\u5faa\u5ea6\u63d0\u5347 22%\u3002\u91d1\u878d\u3001\u6cd5\u5f8b\u3001\u533b\u7597\u3001\u653f\u5e9c\u516c\u6587\u7b49\u5782\u76f4\u8d5b\u9053\u77ac\u95f4\u6cb8\u817e\u2014\u2014\u53ea\u8981\u628a\u4e00\u6574\u5e74\u8d22\u62a5\u3001\u4e00\u6574\u672c\u76d1\u7ba1\u6761\u4f8b\u3001\u4e00\u6574\u4efd\u4e34\u5e8a\u8bd5\u9a8c\u62a5\u544a\u4e00\u6b21\u6027\u585e\u8fdb\u63d0\u793a\u8bcd\uff0c\u6a21\u578b\u5c31\u80fd\u7ed9\u51fa\u5e26\u5f15\u7528\u9875\u7801\u7684\u6458\u8981\u3002<br \/>\n\u7136\u800c\u5174\u594b\u4e0d\u8fc7\u4e09\u79d2\uff0c\u5f00\u53d1\u8005\u4eec\u5c31\u53d1\u73b0\uff1a\u672c\u5730 24G \u663e\u5b58\u7684 4090 \u8fde\u52a0\u8f7d\u5168\u7cbe\u5ea6 40 \u4e07 Token \u90fd\u62a5\u9519 OOM\uff0c\u66f4\u522b\u8bf4\u8fd8\u8981\u7559\u663e\u5b58\u7ed9 KV-Cache \u548c\u63a8\u7406\u7f13\u51b2\u533a\u3002\u60f3\u8dd1\u901a\u751f\u4ea7\u7ea7\u957f\u6587\u672c\u7ba1\u7ebf\uff0c\u53ea\u80fd\u4e0a\u591a\u5361\u5e76\u884c\uff0c\u53ef\u5355\u5361 80G \u7684 A100 \u73b0\u8d27\u5e02\u4ef7 11 \u4e07\uff0c\u81ea\u5efa\u673a\u623f\u5149\u662f\u7535\u6e90\u6539\u9020\u6210\u672c\u5c31\u8ba9\u4eba\u671b\u800c\u5374\u6b65\u3002<\/p>\n<h2>\u75db\u70b9\uff1a\u663e\u5b58\u3001\u5e26\u5bbd\u3001\u6210\u672c\u4e09\u91cd\u9501\u6b7b<\/h2>\n<ul>\n<li>\u663e\u5b58\u9501\uff1a40 \u4e07 Token \u5168\u7cbe\u5ea6 \u2248 80GB\uff0c\u5355\u5361 48GB \u90fd\u88c5\u4e0d\u4e0b  <\/li>\n<li>\u5e26\u5bbd\u9501\uff1aPCIe \u70b9\u5bf9\u70b9 32GB\/s\uff0c\u8de8\u5361\u540c\u6b65\u68af\u5ea6\u62d6\u6162 47%  <\/li>\n<li>\u6210\u672c\u9501\uff1a\u4f20\u7edf\u4e91\u5382\u5546 8\u00d7A100 80G \u6309\u91cf 5.9 \u5143\/\u5206\u949f\uff0c\u8dd1 10 \u5c0f\u65f6\u5c31\u662f 3 \u4e07+\uff0c\u9879\u76ee\u8fd8\u6ca1\u4e0a\u7ebf\u5148\u70e7\u6389\u4e00\u53f0 Model Y<\/li>\n<\/ul>\n<h2>\u65b9\u6848\uff1a\u661f\u5b87\u667a\u7b97 GPU\u670d\u52a1\u5668\u79df\u7528\uff0cNVLink 2\u00d7A100 80G \u4e00\u5c0f\u65f6\u4e0a\u7ebf<\/h2>\n<p><a href=\"https:\/\/www.starverse-ai.com\">\u661f\u5b87\u667a\u7b97<\/a> \u628a\u4e0a\u8ff0\u4e09\u5ea7\u5927\u5c71\u4e00\u6b21\u6027\u63a8\u5e73\uff1a<br \/>\n&#8211; GPU\u4e91\u4e3b\u673a \u9884\u88c5 CUDA 12.3\u3001PyTorch 2.2\u3001OpenAI \u5b98\u65b9\u63a8\u7406\u955c\u50cf\uff0c\u5f00\u673a\u5373\u5f97 160GB \u663e\u5b58\u6c60<br \/>\n&#8211; 600GB\/s NVLink \u5e26\u5bbd\uff0c\u8ba9\u4e24\u5f20 A100 \u903b\u8f91\u4e0a\u66f4\u50cf\u201c\u4e00\u5f20 160G \u8d85\u5927\u5361\u201d\uff0cAll-Reduce \u5ef6\u8fdf &lt; 2\u03bcs<br \/>\n&#8211; \u6309\u91cf\u8ba1\u8d39 28 \u5143\/\u5c0f\u65f6\uff0c\u6bd4\u5934\u90e8\u4e91\u540c\u89c4\u683c\u4f4e 42%\uff0c\u6ce8\u518c\u5c31\u9001 10 \u5143\u4f53\u9a8c\u91d1\uff0c\u53ef\u8dd1 20 \u5206\u949f\u5b8c\u6574\u6d4b\u8bd5<br \/>\n&#8211; \u63a7\u5236\u53f0\u4e00\u952e\u4e0a\u4f20 PDF\uff0c\u5185\u7f6e <a href=\"https:\/\/www.starverse-ai.com\">\u957f\u6587\u672c RAG \u5957\u4ef6<\/a> \uff1a\u89e3\u6790\u3001\u5207\u7247\u3001Embedding\u3001\u91cd\u6392\u5e8f\u3001\u6458\u8981\u3001\u601d\u7ef4\u5bfc\u56fe\u5168\u81ea\u52a8<\/p>\n<h2>\u5b9e\u6d4b\uff1a20 \u4efd\u6e2f\u80a1\u8d22\u62a5\uff0c60 \u5206\u949f\u751f\u6210\u53ef\u6295\u51b3\u7ea7\u6458\u8981<\/h2>\n<p>\u6d4b\u8bd5\u914d\u7f6e\uff1a<br \/>\n&#8211; \u5b9e\u4f8b\uff1a\u661f\u5b87\u667a\u7b97 GPU\u670d\u52a1\u5668\u79df\u7528 2\u00d7A100 80G<br \/>\n&#8211; \u6570\u636e\u96c6\uff1a20 \u4efd 2023 \u5e74\u5ea6\u6e2f\u80a1\u4e3b\u677f\u516c\u53f8 PDF\uff0c\u5171 38.7 \u4e07 Token<br \/>\n&#8211; \u4efb\u52a1\u94fe\uff1aPDF \u89e3\u6790 \u2192 \u7ed3\u6784\u5316 \u2192 \u5411\u91cf\u7d22\u5f15 \u2192 GPT-5.3 Garlic 40k \u7a97\u53e3\u6ed1\u52a8\u6458\u8981 \u2192 \u601d\u7ef4\u5bfc\u56fe \u2192 \u98ce\u9669\u6807\u7b7e<br \/>\n&#8211; \u8017\u65f6\uff1a Wall time 57 \u5206\u949f\uff0c\u663e\u5b58\u5cf0\u503c 147GB\uff0cNVLink \u5229\u7528\u7387 93%\uff0c\u603b\u82b1\u8d39 28 \u5143  <\/p>\n<p>\u8f93\u51fa\u793a\u4f8b\uff1a<br \/>\n\u201c\u2026\u2026\u817e\u8baf\u97f3\u4e50\u5a31\u4e50 2023 \u5e74\u7248\u6743\u6210\u672c\u540c\u6bd4\u4e0b\u964d 11.4%\uff0c\u5e26\u52a8\u5728\u7ebf\u97f3\u4e50\u670d\u52a1\u6bdb\u5229\u7387\u63d0\u5347 3.2 \u4e2a\u767e\u5206\u70b9\uff1b\u4f46\u793e\u4ea4\u5a31\u4e50 ARPPU \u8fde\u7eed\u4e09\u5b63\u5ea6\u4e0b\u6ed1\uff0c\u9700\u8b66\u60d5\u76f4\u64ad\u6253\u8d4f\u76d1\u7ba1\u98ce\u9669\u3002\u201d<br \/>\n\u540c\u65f6\u751f\u6210\u53ef\u4ea4\u4e92\u601d\u7ef4\u5bfc\u56fe\uff0c\u8282\u70b9\u76f4\u8fbe\u539f\u6587\u9875\u7801\uff0c\u6295\u7814\u540c\u4e8b\u76f4\u63a5\u590d\u5236\u8fdb PPT \u5c31\u80fd\u6c47\u62a5\u3002<\/p>\n<h2>\u6210\u672c\uff1a\u6309\u9700\u4ed8\u8d39\uff0c\u5f39\u6027\u6269\u5bb9\u5230 8 \u5361\u4e5f\u4e0d\u5fc3\u75bc<\/h2>\n<table>\n<thead>\n<tr>\n<th>\u89c4\u683c<\/th>\n<th>\u4f20\u7edf\u4e91<\/th>\n<th>\u661f\u5b87\u667a\u7b97<\/th>\n<th>\u8282\u7701<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>2\u00d7A100 80G<\/td>\n<td>48 \u5143\/\u65f6<\/td>\n<td>28 \u5143\/\u65f6<\/td>\n<td>42 %<\/td>\n<\/tr>\n<tr>\n<td>8\u00d7A100 80G<\/td>\n<td>192 \u5143\/\u65f6<\/td>\n<td>110 \u5143\/\u65f6<\/td>\n<td>43 %<\/td>\n<\/tr>\n<tr>\n<td>\u82e5\u9879\u76ee\u8fdb\u5165\u91cf\u4ea7\uff0c\u53ea\u9700\u5728\u63a7\u5236\u53f0\u70b9\u51fb\u201c\u7eb5\u5411\u6269\u5bb9\u201d\uff0c3 \u5206\u949f\u5b8c\u6210 2 \u5361\u5230 8 \u5361\u70ed\u5347\u7ea7\uff0c\u65e0\u9700\u8fc1\u79fb\u6570\u636e\uff0c\u65e0\u9700\u91cd\u542f\u8bad\u7ec3\u3002<\/td>\n<td><\/td>\n<td><\/td>\n<td><\/td>\n<\/tr>\n<tr>\n<td>\u6b64\u5916\uff0c<a href=\"https:\/\/www.starverse-ai.com\">\u661f\u5b87\u667a\u7b97<\/a> \u63d0\u4f9b\u8de8\u5b9e\u4f8b\u5171\u4eab\u7684\u6301\u4e45\u5316\u4e91\u5b58\u50a8\uff0cTB \u7ea7\u5411\u91cf\u5e93\u4e00\u6b21\u4e0a\u4f20\uff0c\u591a\u5361\u591a\u8282\u70b9\u540c\u65f6\u6302\u8f7d\uff0c\u907f\u514d\u91cd\u590d\u4e0b\u8f7d\u6d6a\u8d39\u5e26\u5bbd\u3002<\/td>\n<td><\/td>\n<td><\/td>\n<td><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2>\u5f00\u53d1\u8005\u751f\u6001\uff1a\u6a21\u578b\u3001\u6570\u636e\u3001\u5de5\u5177\u4e00\u7ad9\u5f0f<\/h2>\n<p>\u767b\u5f55\u63a7\u5236\u53f0\u5373\u53ef\u8c03\u7528\uff1a<br \/>\n&#8211; \u516c\u5171\u6a21\u578b\u6c60\uff1aLlama-3-70B\u3001Qwen-72B\u3001ChatGLM3-6B \u5df2\u9884\u88c5\u6743\u91cd<br \/>\n&#8211; \u5f00\u653e\u6570\u636e\u96c6\uff1aCommonCrawl-2024\u3001FinGLUE-zh\u3001\u6cd5\u5f8b\u6761\u6587 230 \u4e07\u6761<br \/>\n&#8211; \u4e00\u952e\u955c\u50cf\uff1aLangChain\u3001LlamaIndex\u3001Dify\u3001FastChat \u5f00\u7bb1\u5373\u7528<br \/>\n&#8211; \u6559\u7a0b\u4e0e\u793e\u533a\uff1a\u5b98\u65b9\u7ef4\u62a4\u201c\u957f\u6587\u672c RAG \u6700\u4f73\u5b9e\u8df5\u201d\u4ee3\u7801\u5e93\uff0cStar \u6570 3.2k\uff0cIssue \u5e73\u5747\u54cd\u5e94 2 \u5c0f\u65f6<\/p>\n<h2>\u7ed3\u8bba\uff1a\u957f\u6587\u672c RAG \u5e94\u7528\u9996\u9009 GPU\u4e91\u4e3b\u673a<\/h2>\n<p>GPT-5.3 Garlic \u628a\u201c\u4e0a\u4e0b\u6587\u5373\u6570\u636e\u201d\u5e26\u5230 40 \u4e07 Token \u7ea7\u522b\uff0c\u53ef\u771f\u6b63\u7684\u74f6\u9888\u4ece\u6765\u4e0d\u662f\u6a21\u578b\uff0c\u800c\u662f\u7b97\u529b\u4e0e\u6210\u672c\u3002\u501f\u52a9 <a href=\"https:\/\/www.starverse-ai.com\">\u661f\u5b87\u667a\u7b97 GPU\u670d\u52a1\u5668\u79df\u7528<\/a> \uff0c\u5f00\u53d1\u8005\u65e0\u9700\u6295\u5165\u767e\u4e07\u7ea7\u786c\u4ef6\uff0c\u5c31\u80fd\u5728\u4e00\u5c0f\u65f6\u5185\u5b8c\u6210\u8fc7\u53bb\u9700\u8981 4 \u5361 48G \u8dd1 6 \u5c0f\u65f6\u7684\u4efb\u52a1\uff1b\u9879\u76ee\u9a8c\u8bc1\u9636\u6bb5\u6309\u91cf\u4ed8\u8d39\uff0c\u4e0a\u7ebf\u540e\u5f39\u6027\u6269\u5bb9\uff0c\u8ba9\u6bcf\u4e00\u5206\u94b1\u90fd\u82b1\u5728\u5200\u5203\u4e0a\u3002<br \/>\n\u73b0\u5728\u6ce8\u518c\u661f\u5b87\u667a\u7b97\uff0c\u65b0\u7528\u6237\u7acb\u5f97 10 \u5143\u4f53\u9a8c\u91d1\uff0c28 \u5143\u5373\u53ef\u8dd1\u6ee1 1 \u5c0f\u65f6 2\u00d7A100 80G \u5b9e\u4f8b\uff0c\u628a 40 \u4e07 Token \u7684\u957f\u6587\u6863 RAG \u5e94\u7528\u771f\u6b63\u642c\u8fdb\u751f\u4ea7\u73af\u5883\u3002<br \/>\n<strong>\u957f\u6587\u672c\u65f6\u4ee3\uff0c\u8c01\u5148\u62a2\u5230\u7b97\u529b\uff0c\u8c01\u5c31\u62a2\u5230\u65f6\u95f4\u7a97\u53e3\u3002<\/strong><\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u201c\u5f53 GPT-5.3 Garlic \u628a\u4e0a\u4e0b\u6587\u7a97\u53e3\u4e00\u53e3\u6c14\u62c9\u5230 &hellip;<\/p>\n","protected":false},"author":2,"featured_media":2852,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-2853","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-zixun"],"views":48,"_links":{"self":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/posts\/2853","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/comments?post=2853"}],"version-history":[{"count":0,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/posts\/2853\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/media\/2852"}],"wp:attachment":[{"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/media?parent=2853"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/categories?post=2853"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.starverse-ai.com\/guide\/wp-json\/wp\/v2\/tags?post=2853"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}