{"id":3889,"date":"2026-05-01T05:11:51","date_gmt":"2026-05-01T04:11:51","guid":{"rendered":"https:\/\/blogs.qub.ac.uk\/dipsa\/?p=3889"},"modified":"2026-05-01T05:12:39","modified_gmt":"2026-05-01T04:12:39","slug":"user-behaviour-on-kelvin2-hpc","status":"publish","type":"post","link":"https:\/\/blogs.qub.ac.uk\/dipsa\/user-behaviour-on-kelvin2-hpc\/","title":{"rendered":"User behaviour on Kelvin2 HPC"},"content":{"rendered":"\n<h3 class=\"wp-block-heading\"><strong>1. User Job Distribution: A Heavy-Tailed Workload<\/strong><\/h3>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><img fetchpriority=\"high\" decoding=\"async\" width=\"871\" height=\"608\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-07-21.png\" alt=\"\" class=\"wp-image-3890\" style=\"aspect-ratio:1.4325881036224168;width:501px;height:auto\" srcset=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-07-21.png 871w, https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-07-21-300x209.png 300w, https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-07-21-768x536.png 768w\" sizes=\"(max-width: 871px) 100vw, 871px\" \/><\/figure>\n<\/div>\n\n\n<p id=\"p-rc_79b68a4b6eb7077e-86\">The cluster&#8217;s workload is heavily concentrated among a very small fraction of the total user base. The data exhibits an extreme version of the Pareto principle (80\/20 rule)<sup><\/sup>.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>The Top 2%:<\/strong> Just <strong>2.12%<\/strong> of the heaviest users are responsible for <strong>70%<\/strong> of the total workload.<\/li>\n\n\n\n<li><strong>The Top 3%:<\/strong> Only <strong>3.75%<\/strong> of users account for <strong>80%<\/strong> of all jobs.<\/li>\n\n\n\n<li><strong>The 99th Percentile:<\/strong> To reach 99% of the total cluster workload, you only need to account for the top <strong>29.32%<\/strong> of users.<\/li>\n<\/ul>\n\n\n\n<p>This indicates that a vast majority of users submit relatively few jobs, while a core group of &#8220;power users&#8221; drives almost all cluster activity.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>2. Job Type Distribution: The Dominance of Normal and Array Jobs<\/strong><\/h3>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"926\" height=\"613\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-08-24.png\" alt=\"\" class=\"wp-image-3891\" srcset=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-08-24.png 926w, https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-08-24-300x199.png 300w, https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-08-24-768x508.png 768w\" sizes=\"(max-width: 926px) 100vw, 926px\" \/><\/figure>\n\n\n\n<p id=\"p-rc_79b68a4b6eb7077e-90\">The vast majority of the tens of millions of jobs run on the cluster fall into a few primary categories, while complex step-based or interactive jobs are statistically rare<sup><\/sup>.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Top Job Types (Log Scale Order of Magnitude):<\/strong>\n<ul class=\"wp-block-list\">\n<li><strong><code>X (Normal)<\/code>:<\/strong> The most common job type, with <strong>~2.93 million<\/strong> instances.<\/li>\n\n\n\n<li><strong><code>X.batch (Normal Batch)<\/code>:<\/strong> Close behind with <strong>~2.68 million<\/strong> jobs.<\/li>\n\n\n\n<li><strong><code>X_Y (Array Job)<\/code>:<\/strong> Accounting for <strong>~1.98 million<\/strong> jobs.<\/li>\n\n\n\n<li><strong><code>X_Y.batch (Array Batch)<\/code>:<\/strong> Representing <strong>~1.90 million<\/strong> jobs.<\/li>\n\n\n\n<li><strong><code>X.Y (Job Step)<\/code>:<\/strong> Representing <strong>~1.35 million<\/strong> jobs.<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li><strong>Rare Job Types:<\/strong> Conversely, standard interactive jobs (<code>X.interactive<\/code>) barely register, with only 2 counts in the entire analysed dataset. Complex mixed array types like <code>X_[Mixed%S]<\/code> and <code>X_[Mixed]<\/code> only have 3 and 12 counts respectively.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>3. Partition Distribution &amp; Behaviours<\/strong><\/h3>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" width=\"1024\" height=\"384\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-11-04-1024x384.png\" alt=\"\" class=\"wp-image-3892\" srcset=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-11-04-1024x384.png 1024w, https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-11-04-300x113.png 300w, https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-11-04-768x288.png 768w, https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-11-04-1536x576.png 1536w, https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2026\/05\/Screenshot-from-2026-05-01-05-11-04.png 1612w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p id=\"p-rc_79b68a4b6eb7077e-97\">Job volume is radically different depending on the partition, and specific partitions attract entirely different shapes of workloads<sup><\/sup>.<\/p>\n\n\n\n<p><strong>Overall Volume:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong><code>k2-hipri<\/code><\/strong> (High Priority) is the absolute behemoth of the cluster, processing an astonishing <strong>8,384,627 jobs<\/strong>.<\/li>\n\n\n\n<li><strong><code>k2-medpri<\/code><\/strong> (Medium Priority) is a distant second, processing <strong>1,214,761 jobs<\/strong>.<\/li>\n\n\n\n<li>Other notable general partitions include <code>k2-himem<\/code> (259k), <code>k2-living-labs<\/code> (176k), and <code>k2-lowpri<\/code> (142k).<\/li>\n\n\n\n<li>Among GPU partitions, <code>k2-gpu-v100<\/code> sees the highest job count (65k), followed by <code>k2-gpu-a100<\/code> (32k) and <code>k2-gpu-a100mig<\/code> (20k).<\/li>\n<\/ul>\n\n\n\n<p id=\"p-rc_79b68a4b6eb7077e-102\"><strong>Job Types by Partition (The Heatmap Analysis):<\/strong> The way users interact with different partitions varies greatly based on the queue<sup><\/sup>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>General Queues (<code>k2-lowpri<\/code> &amp; <code>k2-living-labs<\/code>):<\/strong> These partitions see a relatively balanced mix of job types. For example, <code>k2-living-labs<\/code> processed ~42k Normal jobs, ~40k Array jobs, and ~31k Normal Batch jobs.<\/li>\n\n\n\n<li><strong>The <code>MIXED<\/code> Partition Anomaly:<\/strong> The <code>MIXED<\/code> partition is almost exclusively used for massive array workloads. It processed virtually zero &#8220;Normal&#8221; jobs, but handled roughly <strong>69k Array jobs (<code>X_Y<\/code>)<\/strong> and <strong>68k Array Batch jobs (<code>X_Y.batch<\/code>)<\/strong>.<\/li>\n\n\n\n<li><strong><code>k2-gpu<\/code>:<\/strong> The standard GPU partition sees a much lower volume overall compared to CPU nodes, with its highest count being standard <code>X (Normal)<\/code> jobs (~4.3k) and <code>X.batch<\/code> (~3.4k), indicating less use of arrays for basic GPU tasks.<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>1. User Job Distribution: A Heavy-Tailed Workload The cluster&#8217;s workload is heavily concentrated among a very small fraction of the total user base. The data exhibits an extreme version of the Pareto principle (80\/20 rule). This indicates that a vast majority of users submit relatively few jobs, while a core group of &#8220;power users&#8221; drives [&hellip;]<\/p>\n","protected":false},"author":1149,"featured_media":3550,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[152],"tags":[],"class_list":{"0":"post-3889","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-kelvin-living-lab","8":"czr-hentry"},"jetpack_featured_media_url":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2025\/08\/cf973c88-02be-4493-b13c-3face3e98b73.jpg","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/posts\/3889","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/users\/1149"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/comments?post=3889"}],"version-history":[{"count":1,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/posts\/3889\/revisions"}],"predecessor-version":[{"id":3893,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/posts\/3889\/revisions\/3893"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/media\/3550"}],"wp:attachment":[{"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/media?parent=3889"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/categories?post=3889"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/tags?post=3889"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}