{"id":11973,"date":"2025-08-06T18:16:29","date_gmt":"2025-08-06T09:16:29","guid":{"rendered":"https:\/\/aireviewirush.com\/?p=11973"},"modified":"2025-08-06T18:16:29","modified_gmt":"2025-08-06T09:16:29","slug":"openai-open-weight-fashions-now-out-there-on-aws","status":"publish","type":"post","link":"https:\/\/aireviewirush.com\/?p=11973","title":{"rendered":"OpenAI open weight fashions now out there on AWS"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div id=\"\">\n<table id=\"amazon-polly-audio-table\">\n<tbody>\n<tr>\n<td id=\"amazon-polly-audio-tab\">\n<div id=\"amazon-polly-by-tab\">\n            <a href=\"https:\/\/aws.amazon.com\/polly\/\" target=\"_blank\" rel=\"noopener noreferrer\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/a0.awsstatic.com\/aws-blog\/images\/Voiced_by_Amazon_Polly_EN.png\" alt=\"Voiced by Polly\" width=\"554\" height=\"56\"\/><\/a>\n           <\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>AWS is dedicated to bringing you essentially the most superior <a href=\"https:\/\/aws.amazon.com\/what-is\/foundation-models\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">basis fashions (FMs)<\/a> within the business, repeatedly increasing our choice to incorporate groundbreaking fashions from main AI innovators so that you just all the time have entry to the newest developments to drive what you are promoting ahead.<\/p>\n<p>Right now, I&#8217;m blissful to announce the provision of two new <a href=\"https:\/\/aws.amazon.com\/bedrock\/openai\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">OpenAI fashions with open weights<\/a> in <a href=\"https:\/\/aws.amazon.com\/bedrock\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">Amazon Bedrock<\/a> and <a href=\"https:\/\/aws.amazon.com\/sagemaker-ai\/jumpstart\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">Amazon SageMaker JumpStart<\/a>. OpenAI <a href=\"https:\/\/openai.com\/index\/introducing-gpt-oss\" target=\"_blank\" rel=\"noopener\"><strong>gpt-oss-120b<\/strong> and <strong>gpt-oss-20b<\/strong><\/a>\u00a0fashions are designed for textual content technology and reasoning duties, providing builders and organizations new choices to construct AI purposes with full management over their infrastructure and information.<\/p>\n<p>These open weight fashions excel at coding, scientific evaluation, and mathematical reasoning, with efficiency similar to main alternate options. Each fashions assist a 128K context window and supply adjustable reasoning ranges (low\/medium\/excessive) to match your particular use case necessities. The fashions assist exterior instruments to boost their capabilities and can be utilized in an agentic workflow, for instance, utilizing a framework like <a href=\"https:\/\/strandsagents.com\/\" target=\"_blank\" rel=\"noopener\">Strands Brokers<\/a>.<\/p>\n<p>With Amazon Bedrock and Amazon SageMaker JumpStart, AWS provides you the liberty to innovate with entry to tons of of FMs from main AI firms, together with OpenAI open weight fashions. With our complete collection of fashions, you&#8217;ll be able to match your AI workloads to the proper mannequin each time.<\/p>\n<p>By way of Amazon Bedrock, you&#8217;ll be able to seamlessly experiment with totally different fashions, combine and match capabilities, and swap between suppliers with out rewriting code\u2014turning <a href=\"https:\/\/aws.amazon.com\/bedrock\/model-choice\/\" target=\"_blank\" rel=\"noopener\">mannequin alternative<\/a> right into a strategic benefit that helps you repeatedly evolve your AI technique as new improvements emerge. These new fashions can be found in Bedrock by way of an OpenAI-compatible endpoint. You&#8217;ll be able to level the OpenAI SDK to this endpoint or use the Bedrock <a href=\"https:\/\/docs.aws.amazon.com\/bedrock\/latest\/APIReference\/API_runtime_InvokeModel.html\" target=\"_blank\" rel=\"noopener\">InvokeModel<\/a> and <a href=\"https:\/\/docs.aws.amazon.com\/bedrock\/latest\/APIReference\/API_runtime_Converse.html\" target=\"_blank\" rel=\"noopener\">Converse API<\/a>.<\/p>\n<p>With SageMaker JumpStart, you&#8217;ll be able to shortly consider, examine, and customise fashions to your use case. You&#8217;ll be able to then deploy the unique or the custom-made mannequin in manufacturing with the SageMaker AI console or utilizing the <a href=\"https:\/\/docs.aws.amazon.com\/sagemaker\/latest\/dg\/jumpstart-foundation-models-use-python-sdk.html\" target=\"_blank\" rel=\"noopener\">SageMaker Python SDK<\/a>.<\/p>\n<p>Let\u2019s see how these work in follow.<\/p>\n<p><span style=\"text-decoration: underline\"><strong>Getting began with OpenAI open weight fashions in Amazon Bedrock<br \/><\/strong><\/span>Within the <a href=\"https:\/\/console.aws.amazon.com\/bedrock\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">Amazon Bedrock console<\/a>, I select <strong>Mannequin entry<\/strong> from the <strong>Configure and study<\/strong> part of the navigation pane. Then, I navigate to the 2 listed OpenAI fashions on this web page and request entry.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-98753\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/da4b9237bacccdf19c0760cab7aec4a8359010b0\/2025\/08\/05\/openai-gpt-model-access.png\" alt=\"Console screenshot\" width=\"1064\" height=\"366\"\/><\/p>\n<p>Now that I&#8217;ve entry, I exploit the <strong>Chat\/Check<\/strong> playground to check and consider the fashions. I choose <strong>OpenAI<\/strong> because the class after which the <strong>gpt-oss-120b<\/strong> mannequin.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-98754\" style=\"width: 90%\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/da4b9237bacccdf19c0760cab7aec4a8359010b0\/2025\/08\/05\/openai-gpt-model-selection.png\" alt=\"Console screenshot\" width=\"845\" height=\"707\"\/><\/p>\n<p>Utilizing this mannequin, I run the next pattern immediate:<\/p>\n<p><em>A household has $5,000 to avoid wasting for his or her trip subsequent yr. They&#8217;ll place the cash in a financial savings account incomes 2% curiosity yearly or in a certificates of deposit incomes 4% curiosity yearly however with no entry to the funds till the holiday. In the event that they want $1,000 for emergency bills throughout the yr, how ought to they divide their cash between the 2 choices to maximise their trip fund?<\/em><\/p>\n<p>This immediate generates an output that features the chain of thought used to provide the end result.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter wp-image-98776 size-full\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/da4b9237bacccdf19c0760cab7aec4a8359010b0\/2025\/08\/06\/2025-openai-models-bedrock-chat-playground-1.jpg\" alt=\"\" width=\"1064\" height=\"536\"><\/p>\n<p>I can use these fashions with the OpenAI SDK by configuring the API endpoint (base URL) and utilizing an <a href=\"https:\/\/docs.aws.amazon.com\/bedrock\/latest\/userguide\/api-keys.html?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">Amazon Bedrock API key<\/a> for authentication. For instance, I set this atmosphere variables to make use of the US West (Oregon) AWS Area endpoint (<code>us-west-2<\/code>) and my Amazon Bedrock API key:<\/p>\n<pre><code class=\"lang-bash\">export OPENAI_API_KEY=\"&lt;my-bedrock-api-key&gt;\"\nexport OPENAI_BASE_URL=\"https:\/\/bedrock-runtime.us-west-2.amazonaws.com\/openai\/v1\"<\/code><\/pre>\n<p>Now I invoke the mannequin utilizing the OpenAI Python SDK.<\/p>\n<pre><code class=\"lang-python\">from openai import OpenAI\n\nshopper = OpenAI()\n\nresponse = shopper.chat.completions.create( \n    messages=[{ \"role\": \"user\", \"content\": \"Tell me the square root of 42 ^ 3\" }],\n    mannequin=\"openai.gpt-oss-120b-1:0\",\n    stream=False\n)\n\nfor merchandise in response:\n    print(merchandise)<\/code><\/pre>\n<p>I save the code (<code>test-openai.py<\/code> file), set up the dependencies, and run the agent regionally:<\/p>\n<pre><code class=\"lang-bash\">pip set up openai\npython test-openai.py<\/code><\/pre>\n<p>To construct an AI agent, I can select any framework that helps the Amazon Bedrock API or the OpenAI API. For instance, right here\u2019s the beginning code for Strands Brokers utilizing the Amazon Bedrock API:<\/p>\n<pre><code class=\"lang-python\">from strands import Agent\nfrom strands.fashions import BedrockModel\n\nbedrock_model = BedrockModel(\n    model_id=\"openai.gpt-oss-120b-1:0\",\n    region_name=\"us-west-2\",\n    streaming=False\n)\n\nagent = Agent(\n    mannequin=bedrock_model\n)\n\nagent(\"Inform me the sq. root of 42 ^ 3\")<\/code><\/pre>\n<p>I save the code (<code>test-strands.py<\/code> file), set up the dependencies, and run the agent regionally:<\/p>\n<pre><code class=\"lang-bash\">pip set up strands-agents\npython test-strands.py<\/code><\/pre>\n<p>When I&#8217;m happy with the agent, I can deploy in manufacturing utilizing the <a href=\"https:\/\/aws.amazon.com\/blogs\/aws\/introducing-amazon-bedrock-agentcore-securely-deploy-and-operate-ai-agents-at-any-scale\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">capabilities supplied by Amazon Bedrock AgentCore<\/a>, together with a completely managed serverless runtime and reminiscence and identification administration.<\/p>\n<p><span style=\"text-decoration: underline\"><strong>Getting began with OpenAI open weight fashions in Amazon SageMaker JumpStart<br \/><\/strong><\/span>Within the <a href=\"https:\/\/console.aws.amazon.com\/sagemaker\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">Amazon SageMaker AI console<\/a>, you need to use OpenAI open weight fashions within the <a href=\"https:\/\/aws.amazon.com\/sagemaker-ai\/studio\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">SageMaker Studio<\/a>. The primary time I do that, I have to arrange a SageMaker area. There are alternatives to set it up for a single person (less complicated) or a corporation. For these checks, I exploit a single person setup.<\/p>\n<p>Within the <strong>SageMaker JumpStart<\/strong> mannequin view, I&#8217;ve entry to an in depth description of the <strong>gpt-oss-120b<\/strong> or<strong> gpt-oss-20b<\/strong> mannequin.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-98774\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/da4b9237bacccdf19c0760cab7aec4a8359010b0\/2025\/08\/06\/2025-openai-models-sagemaker-js-model.jpg\" alt=\"\" width=\"1248\" height=\"560\"><\/p>\n<p>I select the <strong>gpt-oss-20b mannequin<\/strong> after which deploy the mannequin. Within the subsequent steps, I choose the occasion kind and the preliminary occasion rely. After a couple of minutes, the deployment creates an endpoint that I can then invoke in SageMaker Studio and utilizing any <a href=\"https:\/\/aws.amazon.com\/tools\/\" target=\"_blank\" rel=\"noopener\">AWS SDKs<\/a>.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-98771\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/da4b9237bacccdf19c0760cab7aec4a8359010b0\/2025\/08\/06\/2025-openai-models-sagemaker-js.jpg\" alt=\"\" width=\"1440\" height=\"799\"><\/p>\n<p>To study extra, go to <a href=\"https:\/\/aws.amazon.com\/blogs\/machine-learning\/gpt-oss-models-from-openai-are-now-available-on-sagemaker-jumpstart\/\" target=\"_blank\" rel=\"noopener\">GPT OSS fashions from OpenAI are actually out there on SageMaker JumpStart<\/a> within the AWS Synthetic Intelligence Weblog.<\/p>\n<p><span style=\"text-decoration: underline\"><strong>Issues to know<br \/><\/strong><\/span>The brand new\u00a0<a href=\"https:\/\/aws.amazon.com\/bedrock\/openai\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">OpenAI open weight fashions<\/a> are actually out there in <a href=\"https:\/\/aws.amazon.com\/bedrock\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">Amazon Bedrock<\/a> within the US West (Oregon) <a href=\"https:\/\/aws.amazon.com\/about-aws\/global-infrastructure\/regions_az\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">AWS Area<\/a>, whereas <a href=\"https:\/\/aws.amazon.com\/sagemaker-ai\/jumpstart\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">Amazon SageMaker JumpStart<\/a> helps these fashions in US East (Ohio, N. Virginia) and Asia Pacific (Mumbai, Tokyo).<\/p>\n<p>Every mannequin comes outfitted with full chain-of-thought output capabilities, offering you with detailed visibility into the mannequin\u2019s reasoning course of. This transparency is especially beneficial for purposes requiring excessive ranges of interpretability and validation.\u00a0These fashions provide the freedom to switch, adapt, and customise them to your particular wants. This flexibility lets you fine-tune the fashions to your distinctive use circumstances, combine them into your present workflows, and even construct upon them to create new, specialised fashions tailor-made to your business or software.<\/p>\n<p>Safety and security are constructed into the core of those fashions, with complete analysis processes and security measures in place. The fashions preserve compatibility with the usual GPT-4 tokenizer.<\/p>\n<p>Each fashions can be utilized in your most well-liked atmosphere, whether or not that\u2019s by way of the serverless expertise of Amazon Bedrock or the in depth <a href=\"https:\/\/aws.amazon.com\/ai\/machine-learning\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">machine studying (ML)<\/a> improvement capabilities of SageMaker JumpStart. For details about the prices related to utilizing these fashions and providers, go to the <a href=\"https:\/\/aws.amazon.com\/bedrock\/pricing\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">Amazon Bedrock pricing<\/a> and <a href=\"https:\/\/aws.amazon.com\/sagemaker-ai\/pricing\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">Amazon SageMaker AI pricing<\/a> pages.<\/p>\n<p>To study extra, see the <a href=\"https:\/\/docs.aws.amazon.com\/bedrock\/latest\/userguide\/model-parameters-openai.html\" target=\"_blank\" rel=\"noopener\">parameters for the fashions<\/a> and the <a href=\"https:\/\/docs.aws.amazon.com\/bedrock\/latest\/userguide\/inference-chat-completions.html\" target=\"_blank\" rel=\"noopener\">chat completions API<\/a> within the Amazon Bedrock documentation.<\/p>\n<p>Get began immediately with OpenAI open weight fashions on AWS within the <a href=\"https:\/\/console.aws.amazon.com\/bedrock\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">Amazon Bedrock console<\/a> or in <a href=\"https:\/\/console.aws.amazon.com\/sagemaker\/?trk=e61dee65-4ce8-4738-84db-75305c9cd4fe&amp;sc_channel=el\" target=\"_blank\" rel=\"noopener\">Amazon SageMaker AI console<\/a>.<\/p>\n<p>\u2013 <a href=\"https:\/\/x.com\/danilop\">Danilo<\/a><\/p>\n<p>       <!-- '\"` -->\n      <\/div>\n\n","protected":false},"excerpt":{"rendered":"<p>AWS is dedicated to bringing you essentially the most superior basis fashions (FMs) within the business, repeatedly increasing our choice to incorporate groundbreaking fashions from main AI innovators so that you just all the time have entry to the newest developments to drive what you are promoting ahead. Right now, I&#8217;m blissful to announce the [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":11975,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[],"class_list":["post-11973","post","type-post","status-publish","format-standard","has-post-thumbnail","category-cloud-computing"],"_links":{"self":[{"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/posts\/11973","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=11973"}],"version-history":[{"count":1,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/posts\/11973\/revisions"}],"predecessor-version":[{"id":11974,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/posts\/11973\/revisions\/11974"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/media\/11975"}],"wp:attachment":[{"href":"https:\/\/aireviewirush.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=11973"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=11973"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=11973"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}