{"id":26904,"date":"2026-05-15T11:16:16","date_gmt":"2026-05-15T02:16:16","guid":{"rendered":"https:\/\/aireviewirush.com\/?p=26904"},"modified":"2026-05-15T11:16:16","modified_gmt":"2026-05-15T02:16:16","slug":"profiling-device-will-get-duckdb-sooner-experiences-and-extra-ze","status":"publish","type":"post","link":"https:\/\/aireviewirush.com\/?p=26904","title":{"rendered":"Profiling Device Will get DuckDB, Sooner Experiences and Extra Ze\u2026"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<div class=\"skio-article-bar ilab-aseo-article-bar\" id=\"skio-article-bar\">\n<div class=\"skio-bar-row\"><span class=\"skio-reading-time-badge\">\ud83d\udcd6 Studying time: approx. 4 minutes \u00b7 716 phrases \u00b7 4,577 characters<\/span><\/p>\n<div class=\"skio-tts-controls\" id=\"skio-tts-controls\" aria-live=\"polite\"><button type=\"button\" class=\"skio-tts-btn\" id=\"skio-tts-play\" data-skio-tts=\"1\" title=\"Listen to article\" aria-label=\"Listen to article\"><span class=\"skio-tts-icon\" id=\"skio-tts-icon\" aria-hidden=\"true\">\ud83d\udd0a<\/span><span class=\"skio-tts-label\" id=\"skio-tts-label\">Pay attention<\/span><\/button><\/p>\n<p><button type=\"button\" class=\"skio-speed-btn\" id=\"skio-tts-slower\" title=\"Slower\" aria-label=\"Slower\">\u2212<\/button><span class=\"skio-speed-value\" id=\"skio-tts-speed-val\">1.0\u00d7<\/span><button type=\"button\" class=\"skio-speed-btn\" id=\"skio-tts-faster\" title=\"Faster\" aria-label=\"Faster\">+<\/button><\/p>\n<p><button type=\"button\" class=\"skio-tts-stop\" id=\"skio-tts-stop\" style=\"display:none;\" title=\"Stop\">\u23f9 Cease<\/button><span class=\"skio-tts-message\" id=\"skio-tts-message\"\/><\/div>\n<\/div>\n<\/div>\n<p data-start=\"108\" data-end=\"665\">AMD has launched uProf 5.3, updating its profiling device for builders, HPC customers, and directors. In line with AMD, the brand new model has been out there since Could 12, 2026, and continues to focus on x86 purposes on Home windows, Linux, and FreeBSD, with a specific concentrate on Zen-based processors and Intuition accelerators. At first look, this feels like a standard upkeep replace, however for sensible efficiency work it&#8217;s way more related than the subsequent slide with theoretical peak values<\/p>\n<figure id=\"attachment_318556\" aria-describedby=\"caption-attachment-318556\" style=\"width: 980px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-large wp-image-318556\" src=\"https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-980x552.jpg\" alt=\"Illustrative image\" width=\"980\" height=\"552\" srcset=\"https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-980x552.jpg 980w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-300x168.jpg 300w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-768x432.jpg 768w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-1536x864.jpg 1536w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-1320x743.jpg 1320w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-470x264.jpg 470w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-640x360.jpg 640w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-215x120.jpg 215w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-414x232.jpg 414w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-130x73.jpg 130w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-187x105.jpg 187w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f-990x557.jpg 990w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2026\/05\/270a0ffd-6c34-4882-928f-0ed0a711d40f.jpg 1672w\" sizes=\"auto, (max-width: 980px) 100vw, 980px\"\/><figcaption id=\"caption-attachment-318556\" class=\"wp-caption-text\">Illustrative picture<\/figcaption><\/figure>\n<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_53 counter-hierarchy ez-toc-counter ez-toc-grey ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title \" >Table of Contents<\/p>\n<span class=\"ez-toc-title-toggle\"><a href=\"#\" class=\"ez-toc-pull-right ez-toc-btn ez-toc-btn-xs ez-toc-btn-default ez-toc-toggle\" aria-label=\"Toggle Table of Content\" role=\"button\"><label for=\"item-6a271a0d9bf2d\" ><span class=\"\"><span style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input aria-label=\"Toggle\" aria-label=\"item-6a271a0d9bf2d\"  type=\"checkbox\" id=\"item-6a271a0d9bf2d\"><\/a><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/aireviewirush.com\/?p=26904\/#Extra_pace_throughout_profiling_as_an_alternative_of_extra_endurance_whereas_ready\" title=\"Extra pace throughout profiling as an alternative of extra endurance whereas ready\">Extra pace throughout profiling as an alternative of extra endurance whereas ready<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/aireviewirush.com\/?p=26904\/#Conclusion\" title=\"Conclusion\">Conclusion<\/a><\/li><\/ul><\/nav><\/div>\n<h3 data-section-id=\"1sm840e\" data-start=\"667\" data-end=\"725\"><span class=\"ez-toc-section\" id=\"Extra_pace_throughout_profiling_as_an_alternative_of_extra_endurance_whereas_ready\"><\/span><span style=\"color: #993366;\"><strong>Extra pace throughout profiling as an alternative of extra endurance whereas ready<\/strong><\/span><span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p data-start=\"727\" data-end=\"1441\">The main target of uProf 5.3 is on efficiency and scaling enhancements. AMD cites, amongst different issues, quicker translation of CPU profiling knowledge for modules with inline features, considerably decrease Python profiling overhead throughout lengthy runs, and shorter report technology instances for giant periods. Significantly noteworthy is the swap of the default backend from SQLite to DuckDB, whereas SQLite stays out there for compatibility causes. For small particular person measurements, this isn&#8217;t a revolution, however for intensive Hotspot, threading, OpenMP, or MPI analyses it may be precisely the distinction between \u201cfast analysis\u201d and \u201cespresso, dinner, doubts concerning the career.\u201d Technically, AMD is addressing an issue that&#8217;s turning into more and more seen in trendy workloads: profiling itself generates related volumes of knowledge. Anybody analyzing many threads, lengthy runtimes, parallel ranks, or blended CPU\/accelerator workloads is just not solely measuring compute efficiency, but additionally producing a second knowledge pipeline for analysis. That AMD is specializing in the database, translation, and reporting is due to this fact constant. The device doesn&#8217;t turn into extra spectacular, however extra usable, and that&#8217;s often the higher information for developer instruments.<\/p>\n<p data-start=\"1991\" data-end=\"2686\">Visualization has additionally been expanded. In line with AMD, uProf 5.3 consists of enhancements to AMDuProfPCM HTML studies, a Linux timeline visualization for Perform Tracing periods within the GUI, and a per-rank evaluation of MPI knowledge. As well as, there are new CLI choices, together with the power to assign session names and, underneath Linux, acquire waiting-time knowledge for a particular thread. These particulars could appear minor, however they deal with typical bottlenecks in server, HPC, and workstation analyses: it&#8217;s not sufficient to know {that a} program is sluggish; one should additionally know which thread, which rank distribution, or which ready section is inflicting the issue. On the platform facet, AMD provides new metrics for Zen 4 and Zen 5 methods. Talked about is IBS_[LD,ST]_L1_DTLB_REFILL_LAT, an IMS metric for analyzing TLB-related load and retailer bottlenecks. That is supplemented by PCIe metrics for Zen 3 server platforms in AMDuProfPCM and a brand new metric for unused threads. That is significantly related for Zen 4 and Zen 5, as a result of excessive core counts, massive caches, and complicated reminiscence paths don&#8217;t robotically make debugging simpler. Extra cores don&#8217;t robotically imply extra throughput; generally they solely imply that extra cores are staring on the identical bottleneck collectively.<\/p>\n<p data-start=\"3355\" data-end=\"3969\">Additionally notable is the concentrate on virtualized environments. AMD mentions vIBS assist for KVM, and underneath Linux the AMDSystemCheck utility is included, which is meant to gather particulars concerning the working system, BIOS, and platform topology. For cloud, lab, and server environments, this isn&#8217;t a marginal situation, as a result of efficiency issues there usually come up from a mixture of {hardware}, firmware, hypervisor, and working system. uProf due to this fact stays not solely a desktop device for native optimization, however is more and more transferring towards productive server diagnostics.<\/p>\n<h3 data-section-id=\"x8o1ad\" data-start=\"3971\" data-end=\"3979\"><span class=\"ez-toc-section\" id=\"Conclusion\"><\/span><span style=\"color: #993366;\"><strong>Conclusion<\/strong><\/span><span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p data-start=\"3981\" data-end=\"4573\">uProf 5.3 is just not a function fireworks show for finish customers, however a sober device replace for individuals who have to search out actual bottlenecks. DuckDB as the brand new default backend, decreased evaluation instances, improved MPI and OpenMP studies, and new Zen 4 and Zen 5 metrics make the model significantly attention-grabbing for bigger profiling periods. The classification stays grounded, nonetheless: a profiler doesn&#8217;t make software program quicker, it merely removes a number of the room for excuses. For builders on Ryzen, EPYC, and Intuition methods, that&#8217;s usually crucial first step.<\/p>\n<div id=\"igorslab-1281371125\" data-igorslab-trackid=\"146672\" data-igorslab-trackbid=\"1\" class=\"igorslab-target igorslab-target\">\n<div id=\"igorslab-170597683\" data-igorslab-trackid=\"169364\" data-igorslab-trackbid=\"1\" data-igorslab-redirect=\"1\" class=\"igorslab-target igorslab-target\"><a data-bid=\"1\" data-no-instant=\"1\" href=\"https:\/\/www.igorslab.de\/linkout\/169364\" rel=\"noopener\" class=\"notrack\" target=\"_blank\" aria-label=\"SHARKOON_Igors_Lab_SKILLER_SGP30_D1-D6_1024x120px\"><img fetchpriority=\"high\" decoding=\"async\" src=\"https:\/\/www.igorslab.de\/wp-content\/uploads\/2021\/07\/SHARKOON_Igors_Lab_SKILLER_SGP30_D1-D6_1024x120px.jpg\" alt=\"\" srcset=\"https:\/\/www.igorslab.de\/wp-content\/uploads\/2021\/07\/SHARKOON_Igors_Lab_SKILLER_SGP30_D1-D6_1024x120px.jpg 1024w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2021\/07\/SHARKOON_Igors_Lab_SKILLER_SGP30_D1-D6_1024x120px-300x35.jpg 300w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2021\/07\/SHARKOON_Igors_Lab_SKILLER_SGP30_D1-D6_1024x120px-980x115.jpg 980w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2021\/07\/SHARKOON_Igors_Lab_SKILLER_SGP30_D1-D6_1024x120px-768x90.jpg 768w, https:\/\/www.igorslab.de\/wp-content\/uploads\/2021\/07\/SHARKOON_Igors_Lab_SKILLER_SGP30_D1-D6_1024x120px-990x116.jpg 990w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" width=\"1024\" height=\"120\" style=\" max-width: 100%; height: auto;\"><\/a><\/div>\n<\/div>\n<p><span class=\"xfwp-discussion-anchor\" id=\"xfwp-discussion-anchor-318637\" data-xfwp-post-id=\"318637\" aria-hidden=\"true\"\/>        <\/p>\n<aside class=\"xfwp-discussion-box\" id=\"xfwp-discussion-318637\">\n<div class=\"xfwp-discussion-preview\">\n<article class=\"xfwp-forum-post-preview\">\n<p>\n                                    <span>z<\/span>\n                            <\/p>\n<div class=\"xfwp-forum-post-body\">\n<div class=\"xfwp-forum-post-message\">\n<p>Hinweis: Unter Linux bietet perf(1) (<a href=\"https:\/\/man7.org\/linux\/man-pages\/man1\/perf.1.html\" target=\"_blank\" class=\"link link--external\" rel=\"nofollow ugc noopener\">https:\/\/man7.org\/linux\/man-pages\/man1\/perf.1.html<\/a>) \u00e4hnliche Funktionen, kommt per Default in den meisten Distris mit und funktioniert gleich f\u00fcr verschiedene CPU-Architekturen. Counter die es auf bestimmten Architekturen nicht gibt, kann das Device nat\u00fcrlich nicht nachbilden.<br \/>Und ohne eigenen Hirnschmalz lassen sich die Daten nur bedingt interpretieren.<\/p>\n<p>Zur Inspiration: <a href=\"https:\/\/chipsandcheese.com\/\" target=\"_blank\" class=\"link link--external\" rel=\"nofollow ugc noopener\">https:\/\/chipsandcheese.com\/<\/a> nutzt das f\u00fcr sehr interesante Auswertungen und Interpretationen.<\/p>\n<\/div><\/div>\n<\/article><\/div>\n<\/aside><\/div>\n\n","protected":false},"excerpt":{"rendered":"<p>\ud83d\udcd6 Studying time: approx. 4 minutes \u00b7 716 phrases \u00b7 4,577 characters \ud83d\udd0aPay attention \u22121.0\u00d7+ \u23f9 Cease AMD has launched uProf 5.3, updating its profiling device for builders, HPC customers, and directors. In line with AMD, the brand new model has been out there since Could 12, 2026, and continues to focus on x86 purposes [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":26906,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[6],"tags":[],"class_list":["post-26904","post","type-post","status-publish","format-standard","has-post-thumbnail","category-computer-components"],"_links":{"self":[{"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/posts\/26904","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=26904"}],"version-history":[{"count":1,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/posts\/26904\/revisions"}],"predecessor-version":[{"id":26905,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/posts\/26904\/revisions\/26905"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=\/wp\/v2\/media\/26906"}],"wp:attachment":[{"href":"https:\/\/aireviewirush.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=26904"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=26904"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/aireviewirush.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=26904"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}