{"id":4850,"date":"2025-03-27T22:16:59","date_gmt":"2025-03-27T13:16:59","guid":{"rendered":"https:\/\/aireviewirush.com\/?p=4850"},"modified":"2025-03-27T22:16:59","modified_gmt":"2025-03-27T13:16:59","slug":"accelerating-agentic-workflows-with-azure-ai-foundry-nvidia-nim-and-nvidia-agentiq","status":"publish","type":"post","link":"https:\/\/aireviewirush.com\/?p=4850","title":{"rendered":"Accelerating agentic workflows with Azure AI Foundry, NVIDIA NIM, and NVIDIA AgentIQ"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p>\n\t\t\tIn collaboration with Microsoft and NVIDIA, we have built-in NVIDIA NIM microservices and NVIDIA AgentIQ toolkit into Azure AI Foundry\u2014unlocking unprecedented effectivity, efficiency, and value optimization on your AI initiatives.\u00a0\t\t<\/p>\n<p class=\"wp-block-paragraph\">I\u2019m excited to share a significant leap ahead in how we develop and deploy AI. In collaboration with NVIDIA, we\u2019ve built-in NVIDIA NIM microservices and NVIDIA AgentIQ toolkit into <a href=\"https:\/\/azure.microsoft.com\/en-us\/products\/ai-foundry\" target=\"_blank\" rel=\"noreferrer noopener\">Azure AI Foundry<\/a>\u2014unlocking unprecedented effectivity, efficiency, and value optimization on your AI initiatives.\u00a0<\/p>\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\">\n<p>\n<iframe loading=\"lazy\" title=\"NVIDIA NIM on Azure AI Foundry\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube-nocookie.com\/embed\/JAj5f-s4-x4?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen=\"\"><\/iframe>\n<\/p>\n<\/figure>\n<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_53 counter-hierarchy ez-toc-counter ez-toc-grey ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title \" >Table of Contents<\/p>\n<span class=\"ez-toc-title-toggle\"><a href=\"#\" class=\"ez-toc-pull-right ez-toc-btn ez-toc-btn-xs ez-toc-btn-default ez-toc-toggle\" aria-label=\"Toggle Table of Content\" role=\"button\"><label for=\"item-6a27d90d39381\" ><span class=\"\"><span style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input aria-label=\"Toggle\" aria-label=\"item-6a27d90d39381\"  type=\"checkbox\" id=\"item-6a27d90d39381\"><\/a><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/aireviewirush.com\/?p=4850\/#A_brand_new_period_of_AI_effectivity\" title=\"A brand new period of AI effectivity\u00a0\">A brand new period of AI effectivity\u00a0<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/aireviewirush.com\/?p=4850\/#NVIDIA_NIM_on_Azure_AI_Foundry\" title=\"NVIDIA NIM on Azure AI Foundry\u00a0\">NVIDIA NIM on Azure AI Foundry\u00a0<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/aireviewirush.com\/?p=4850\/#Optimizing_efficiency_with_NVIDIA_AgentIQ\" title=\"Optimizing efficiency with NVIDIA AgentIQ\u00a0\">Optimizing efficiency with NVIDIA AgentIQ\u00a0<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/aireviewirush.com\/?p=4850\/#Actual-world_impression\" title=\"Actual-world impression\u00a0\">Actual-world impression\u00a0<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/aireviewirush.com\/?p=4850\/#Unlock_AI-powered_innovation\" title=\"Unlock AI-powered innovation\u00a0\">Unlock AI-powered innovation\u00a0<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/aireviewirush.com\/?p=4850\/#Able_to_speed_up_your_AI_journey\" title=\"Able to speed up your AI journey?\u00a0\">Able to speed up your AI journey?\u00a0<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-7\" href=\"https:\/\/aireviewirush.com\/?p=4850\/#Azure_AI_Foundry\" title=\"Azure AI Foundry\">Azure AI Foundry<\/a><\/li><\/ul><\/nav><\/div>\n<h2 class=\"wp-block-heading\" id=\"a-new-era-of-ai-efficiency\"><span class=\"ez-toc-section\" id=\"A_brand_new_period_of_AI_effectivity\"><\/span>A brand new period of AI effectivity\u00a0<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">In right now\u2019s fast-paced digital panorama, scaling AI purposes calls for extra than simply innovation\u2014it requires streamlined processes that ship speedy time-to-market with out compromising on efficiency. With enterprise AI initiatives typically taking 9 to 12 months to maneuver from conception to manufacturing, each effectivity achieve counts. Our integration is designed to vary that by simplifying each step of the AI growth lifecycle.\u00a0<\/p>\n<h2 class=\"wp-block-heading\" id=\"nvidia-nim-on-azure-ai-foundry\"><span class=\"ez-toc-section\" id=\"NVIDIA_NIM_on_Azure_AI_Foundry\"><\/span>NVIDIA NIM on Azure AI Foundry\u00a0<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">NVIDIA NIM\u2122, a part of the NVIDIA AI Enterprise software program suite, is a collection of easy-to-use microservices engineered for safe, dependable, and high-performance AI inferencing. Leveraging strong applied sciences akin to NVIDIA Triton Inference Server\u2122, TensorRT\u2122, TensorRT-LLM, and PyTorch, NIM microservices are constructed to scale seamlessly on managed Azure compute.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">They supply:\u00a0<\/p>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Zero-configuration deployment:<\/strong> Rise up and working rapidly with out-of-the-box optimization.\u00a0<\/li>\n<\/ul>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Seamless Azure integration:<\/strong> Works effortlessly with Azure AI Agent Service and Semantic Kernel.\u00a0<\/li>\n<\/ul>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Enterprise-grade reliability:<\/strong> Profit from NVIDIA AI Enterprise help for steady efficiency and safety.\u00a0<\/li>\n<\/ul>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Scalable inference:<\/strong> Faucet into Azure\u2019s NVIDIA accelerated infrastructure for demanding workloads.\u00a0<\/li>\n<\/ul>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Optimized workflows:<\/strong> Speed up purposes starting from giant language fashions to superior analytics.\u00a0<\/li>\n<\/ul>\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1280\" height=\"720\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/03\/NIM1.gif\" alt=\"A screenshot of a computer\" class=\"wp-image-39322\"\/><\/figure>\n<p class=\"wp-block-paragraph\">Deploying these providers is straightforward. With just some clicks\u2014whether or not deciding on fashions just like the Llama-3.3-70B-NIM or others from the mannequin catalog in Azure AI Foundry\u2014you may combine them instantly into your AI workflows and begin constructing generative AI purposes that work flawlessly throughout the Azure ecosystem.\u00a0<\/p>\n<h2 class=\"wp-block-heading\" id=\"optimizing-performance-with-nvidia-agentiq\"><span class=\"ez-toc-section\" id=\"Optimizing_efficiency_with_NVIDIA_AgentIQ\"><\/span>Optimizing efficiency with NVIDIA AgentIQ\u00a0<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">As soon as your NVIDIA NIM microservices are deployed, NVIDIA AgentIQ takes middle stage. This <a href=\"https:\/\/github.com\/microsoft\/semantic-kernel\/tree\/main\/python\/semantic_kernel\/connectors\/ai\/nvidia\" target=\"_blank\" rel=\"noreferrer noopener\">open-source toolkit<\/a> is designed to seamlessly join, profile, and optimize groups of AI brokers, allows your programs to run at peak efficiency. AgentIQ delivers:\u00a0<\/p>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Profiling and optimization:<\/strong> Leverage real-time telemetry to fine-tune AI agent placement, lowering latency and compute overhead.\u00a0<\/li>\n<\/ul>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Dynamic inference enhancements:<\/strong> Constantly acquire and analyze metadata\u2014akin to predicted output tokens per name, estimated time to subsequent inference, and anticipated token lengths\u2014to dynamically enhance agent efficiency.\u00a0<\/li>\n<\/ul>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Integration with Semantic Kernel:<\/strong> Direct integration with Azure AI Foundry Agent Service additional empowers your brokers with enhanced semantic reasoning and process execution capabilities.\u00a0<\/li>\n<\/ul>\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" alt=\"Image showing how NVIDIA NIM Models and Azure AI Agent Service can be used together to enable agentic apps. \" class=\"wp-image-39347 webp-format\" srcset=\"\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2025\/03\/Blog-Diagram.webp\"\/><\/figure>\n<p class=\"wp-block-paragraph\">This clever profiling not solely reduces compute prices but in addition boosts accuracy and responsiveness, so that each a part of your agentic AI workflow is optimized for achievement.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">As well as, we are going to quickly be integrating the NVIDIA Llama Nemotron Cause open reasoning mannequin. NVIDIA Llama Nemotron Cause is a strong AI mannequin household designed for superior reasoning. In accordance\u00a0to NVIDIA, Nemotron excels at coding, advanced math, and scientific reasoning whereas understanding person intent and seamlessly calling instruments like search and translations to perform duties.<\/p>\n<h2 class=\"wp-block-heading\" id=\"real-world-impact\"><span class=\"ez-toc-section\" id=\"Actual-world_impression\"><\/span>Actual-world impression\u00a0<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">Trade leaders are already witnessing the advantages of those improvements. <\/p>\n<p class=\"wp-block-paragraph\">Drew McCombs, Vice President, Cloud and Analytics at Epic, famous:\u00a0<\/p>\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-large-font-size wp-block-paragraph\">The launch of NVIDIA NIM microservices in Azure AI Foundry affords a safe and environment friendly manner for Epic to deploy open-source generative AI fashions that enhance affected person care, increase clinician and operational effectivity, and uncover new insights to drive medical innovation. In collaboration with UW Well being and UC San Diego Well being, we\u2019re additionally researching strategies to guage scientific summaries with these superior fashions. Collectively, we\u2019re utilizing the newest AI expertise in ways in which really enhance the lives of clinicians and sufferers.<\/p>\n<\/blockquote>\n<p class=\"wp-block-paragraph\">Epic\u2019s expertise underscores how our built-in answer can drive transformational change\u2014not simply in healthcare however throughout each {industry} the place high-performance AI is a sport changer.\u00a0As famous by Jon Sigler, EVP, Platform and AI at ServiceNow:<\/p>\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"has-large-font-size wp-block-paragraph\">This mix of ServiceNow\u2019s AI platform with NVIDIA NIM and Microsoft Azure AI Foundry and Azure AI Agent Service helps us deliver to market industry-specific, out-of-the-box AI brokers, delivering full-stack agentic AI options to assist resolve issues sooner, ship nice buyer experiences, and speed up enhancements in organizations\u2019 productiveness and effectivity.<\/p>\n<\/blockquote>\n<h2 class=\"wp-block-heading\" id=\"unlock-ai-powered-innovation\"><span class=\"ez-toc-section\" id=\"Unlock_AI-powered_innovation\"><\/span>Unlock AI-powered innovation\u00a0<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">By combining the strong deployment capabilities of NVIDIA NIM with the dynamic optimization of NVIDIA AgentIQ, Azure AI Foundry offers a turnkey answer for constructing, deploying, and scaling enterprise-grade agentic purposes. This integration can speed up AI deployments, improve agentic workflows, and cut back infrastructure prices\u2014enabling you to deal with what really issues: driving innovation.\u00a0<\/p>\n<h2 class=\"wp-block-heading\" id=\"ready-to-accelerate-your-ai-journey\"><span class=\"ez-toc-section\" id=\"Able_to_speed_up_your_AI_journey\"><\/span>Able to speed up your AI journey?\u00a0<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">Deploy NVIDIA NIM microservices and <a href=\"https:\/\/github.com\/microsoft\/semantic-kernel\/tree\/main\/python\/semantic_kernel\/connectors\/ai\/nvidia\" target=\"_blank\" rel=\"noreferrer noopener\">optimize your AI brokers<\/a> with NVIDIA AgentIQ toolkit on <a href=\"https:\/\/azure.microsoft.com\/en-us\/products\/ai-foundry\" target=\"_blank\" rel=\"noreferrer noopener\">Azure AI Foundry<\/a>. Discover extra concerning the Azure AI Foundry <a href=\"https:\/\/ai.azure.com\/explore\/models?tid=72f988bf-86f1-41af-91ab-2d7cd011db47\" target=\"_blank\" rel=\"noreferrer noopener\">mannequin catalog<\/a>.<\/p>\n<p class=\"wp-block-paragraph\">Let\u2019s construct a wiser, sooner, and extra environment friendly future collectively.\u00a0<\/p>\n<aside class=\"cta-block cta-block--align-left cta-block--has-image wp-block-msx-cta\" data-bi-an=\"CTA Block\">\n<div class=\"cta-block__content\">\n<div class=\"cta-block__image-container\">\n\t\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"683\" src=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/12\/Picture1-1024x683.jpg\" class=\"cta-block__image\" alt=\"Two people sitting with a laptop\" srcset=\"https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/12\/Picture1-1024x683.jpg 1024w, https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/12\/Picture1-300x200.jpg 300w, https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/12\/Picture1-768x512.jpg 768w, https:\/\/azure.microsoft.com\/en-us\/blog\/wp-content\/uploads\/2024\/12\/Picture1.jpg 1430w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\"\/>\t\t\t<\/div>\n<div class=\"cta-block__body\">\n<h2 class=\"cta-block__headline\"><span class=\"ez-toc-section\" id=\"Azure_AI_Foundry\"><\/span>Azure AI Foundry<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"cta-block__text\">Design, customise, and handle AI apps and brokers at scale.<\/p>\n<\/p><\/div>\n<\/p><\/div>\n<\/aside><\/div>\n<p><script>\n\t\tfunction facebookTracking() {\n\t\t\t!function(f,b,e,v,n,t,s){if(f.fbq)return;n=f.fbq=function(){n.callMethod?\n\t\t\t\tn.callMethod.apply(n,arguments):n.queue.push(arguments)};if(!f._fbq)f._fbq=n;\n\t\t\t\tn.push=n;n.loaded=!0;n.version='2.0';n.queue=[];t=b.createElement(e);t.async=!0;\n\t\t\t\tt.src=v;t.type=\"ms-delay-type\";t.setAttribute('data-ms-type','text\/javascript');\n\t\t\t\ts=b.getElementsByTagName(e)[0];s.parentNode.insertBefore(t,s)}(window,\n\t\t\t\tdocument,'script','https:\/\/connect.facebook.net\/en_US\/fbevents.js');\n\t\t\tfbq('init', '1770559986549030');\n\t\t\t\t\t\tfbq('track', 'PageView');\n\t\t\t\t\t}\n\t<\/script><br \/>\n<br \/><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In collaboration with Microsoft and NVIDIA, we have built-in NVIDIA NIM microservices and NVIDIA AgentIQ toolkit into Azure AI Foundry\u2014unlocking unprecedented effectivity, efficiency, and value optimization on your AI initiatives.\u00a0 I\u2019m excited to share a significant leap ahead in how we develop and deploy AI. In collaboration with NVIDIA, we\u2019ve built-in NVIDIA NIM microservices and [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":4852,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[],"class_list":["post-4850","post","type-post","status-publish","format-standard","has-post-thumbnail","category-cloud-computing"],"_links":{"self":[{"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/posts\/4850","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=4850"}],"version-history":[{"count":1,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/posts\/4850\/revisions"}],"predecessor-version":[{"id":4851,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/posts\/4850\/revisions\/4851"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/media\/4852"}],"wp:attachment":[{"href":"https:\/\/aireviewirush.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=4850"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=4850"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=4850"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}