DIPSA at IPDPS’25

Posted on 23 January 2025 by Hans Vandierendonck

Two of our papers were accepted at IPDPS’25.

Brian will present his work on improving the scalability of parallel molecular dynamics simulation. He has developed a novel way to reduce the scalability bottleneck that exists in the communication between those processes computing short-range forces vs those computing long-range forces. His technique discards data dependences when long-range processes are “too slow” and uses interpolation of the (slowly-varying) long-range forces to progress the computation. Stay tuned for the camera-ready copy of the paper! This work was supported by the EPSRC New Horizons project ASCCED (EP/X01794X/1).

Hans will present a parallel algorithm for the maximum clique problem. The key ideas relate to reducing the amount of work where possible, which includes delaying or avoiding the construction of fast representations of neighbour lists, early-exiting set intersection operations and algorithmic choice between maximum clique search and the complementary minimum vertex cover problem.

Addtionally, Marco will attend IPDPS’25 by virtue of a travel grant from the TCHPC/TCPP HPC student cohort programme.

Sweeping AAAI’25 success

Posted on 23 January 2025 by Hans Vandierendonck

We have been fortunate to have 3 papers accepted at AAAI’25.

Hung and colleagues will present their work on explainability of time series classification. InteDisUX aims to create explanations that are accessible and meaningful to users (real people) by identifying subsequences of the time series that provide positive or negative influence on a prediction. It uses a segment-level integrated gradient to merge successive segments into variable-length segments with high faithfulness and robustness. Follow the paper here: https://pure.qub.ac.uk/en/publications/intedisux-intepretation-guided-discriminative-user-centric-explan or come visit Hung at poster #8580. This work is funded by the MSCA-DN network RELAX.

Zichi and colleagues will present their work on WaveletMixer, a new time series forecasting method that leverages wavelets to create a latent representation at multiple levels of resolution and phases. It creates distinct forecasting models for each resolution, where the relationships between different frequency domains are exploited to update each of the models. Zichi also introduces a new MLP model for timeseries forecasting that works well in this setting. Follow the paper here: https://pure.qub.ac.uk/en/publications/waveletmixer-a-multi-resolution-wavelets-based-mlp-mixer-for-mult or come visit Zich at poster #10198. Zichi is supported by a scholarship from the China Scholarship Council.

Kazi Hasan Ibn Arif is a PhD student at Virginia Tech who we collaborate with through the US-Ireland project ‘SWEET’ (USI-226). Kazi has developed a new technique to improve the computational efficiency of high-resolution Vision-Language Models. A VLM combines two models, one to generate language tokens from the image, followed by a large language model. The technique uses attention in the token generation model to selectively drop tokens according to predefined budgets. The paper is on arxiv: https://arxiv.org/abs/2408.10945. Come visit Kazi at poster #7547.

Brian Dandurand is offered a Marie Curie Individual Fellowship

Posted on 17 February 2024 by Hans Vandierendonck

Congratulations to Brian Dandurand who has received notification that his Individual Fellowship proposal entitled “Scaling Parallelism and Convexity Hurdles in Bi-Level Machine Learning” has been proposed for funding.

More details will follow in due time.

SIMD Bit Twiddling Hacks

Posted on 25 November 2023 by Hans Vandierendonck

The Bit Twiddling Hacks website collects an array of useful code fragments that implement some very specific computations very efficiently. Here we collect references to some handy code fragments for SIMD based computation.

AVX2 Population Count: Mula’s algorithm
AVX2 Count Leading Zeros for 8-bit integers
AVX2 Count Leading Zeroes for 32-bit integers
AVX512 Alternatives faster than VP2INTERSECT
AVX2 Setting single bit in SIMD register

Technical Posts

Random Vertex Relabelling in LaganLighter21 August 2024
Minimum Spanning Forest of MS-BioGraphs9 August 2024
Topology-Based Thread Affinity Setting (Thread Pinning) in OpenMP3 August 2024
An (Incomplete) List of Publicly Available Graph Datasets/Generators21 June 2024
An Evaluation of Bandwidth of Different Storage Types (HDD vs. SSD vs. LustreFS) for Different Block Sizes and Different Parallel Read Methods (mmap vs pread vs read)20 April 2024
SIMD Bit Twiddling Hacks25 November 2023
LaganLighter Source Code14 November 2022

Open Position for Post-Doctoral Researcher on transprecise scheduling of machine learning tasks in edge and IoT environments

Posted on 13 November 2023 by Hans Vandierendonck

We are currently seeking to appoint an exceptional candidate to the post of Research Fellow.

The post holder will perform research on deployment of machine-learned models for health analytics on distributed IoT/edge/cloud systems using transprecise computing and contribute to the research project “Sustainable Wearable Edge InTelligence (SWEET)”.

The successful candidate must have, and your application should clearly demonstrate that you meet the following criteria:

Normally have, or be about to obtain, a relevant PhD. Relevant areas include high-performance computing, middleware and computing systems.
Recent relevant research experience to include:
- Undertaking research in the area of high-performance / distributed / parallel computing or middleware
- A proven track record of using experimental models to carry out analyses, critical evaluations, and interpretations of experimental data as relevant to the research project
- Working effectively as part of a research team in the development and promotion of the research theme.
- Strong publication record commensurate with stage of career.

Please note the above are not an exhaustive list. For further information about the role including the essential and desirable criteria please check the recruitment web page.

This post is available on a fixed term contract for 33 months.

PhD Scholarship on Sustainable Wearable Edge InTelligence

Posted on 6 November 2023 by Hans Vandierendonck

We are seeking to recruit a an excellent PhD candidate on the project “SWEET: Hardware and Software for Sustainable Wearable Edge InTelligence”, seeking to optimise performance and energy efficiency of machine learning inference in response to time-varying conditions.

Interested applicants can apply here: https://www.qub.ac.uk/courses/postgraduate-research/phd-opportunities/optimising-speed-energy-and-quality-of-machine-learning-models-with-transprecise-computing.html

Sustainable Wearable EdgE inTelligence (SWEET)

Posted on 31 August 2023 by Hans Vandierendonck

The SWEET project will investigate the efficient deployment and sustainability issues of wearable sensors in particular for health analytics. Real-time remote monitoring of physiological indicators can support early detection and intervention in heart diseases and save lives. These services however require wearable technologies with strong predictive abilities, fast networks and fast servers to extract insights from the collected data. Unfortunately, these technology components are often inaccessible to hundreds of millions of people, specifically, people who live in areas with limited broadband connectivity and limited means to invest in local computing and communication infrastructure.

The project will focus on three components (i) energy-efficient wearable hardware accelerators using custom instruction set acceleration, (ii) distributed scheduling and machine learning model serving to account for performance variability of the systems and networks, (iii) technologies for efficient and portable deployment of web services and approximate key caching.

The project will support one PhD student and one post-doctoral researcher in our group.

The project is a collaboration between Deepu John (University College Dublin), Dimitrios S. Nikolopoulos (Virginia Tech), Bo Ji (Virginia Tech) and ourselves in DIPSA (Queen’s University Belfast), and is funded through the tripartite US-Ireland funding scheme.

We graciously acknowledge the support by the Department for the Economy, NI (contracts to be finalised).

Three PhD Positions in Data Analytics – RELAX Doctoral Network

Posted on 27 March 2023 by Hans Vandierendonck

We have 3 recruitment opportunities in a Marie Curie Doctoral Training Network on data analytics. These are PhD opportunities with a research assistant contract:

(1) ~~Application-Aware Relaxed Synchronisation for Distributed Graph Processing,~~ (offered)
(2) ~~Interactive and intelligent exploration of big complex data,~~ (offered) and
(3) ~~Efficient and Responsible Analytics for Urban Mobility and Allied Applications~~ (offered).

Application Deadlines
7 May 2023

RELAX Doctoral Network
The RELAX Doctoral Network brings together 5 cross-disciplinary research groups working across data science, data management, distributed computing and computing systems to pursue a fundamentally new approach to this problem by leveraging the semantics or correctness conditions of applications, with the goal of enhancing scalability, response times, and availability. The Doctoral Network provides a bespoke technical and non-technical training programme and fosters cross-disciplinary and third-party collaborations.

Funding Information
This project is funded by the Engineering and Physical Sciences Research Council grant number EP/X029174/1.

To be eligible for consideration for a RELAX Doctoral Candidate position (covering tuition fees and a basic salary with pension of approx. £33,001 per annum), a candidate must satisfy all the eligibility criteria based on transnational mobility and academic qualifications. The Studentship is open to all nationalities.

Applicants MUST be doctoral candidates, i.e. not already in possession of a doctoral degree at the date of the recruitment (understood as the recruitment call deadline) and undertake transnational mobility (see mobility rule below). Researchers who have successfully defended their doctoral thesis but who have not yet formally been awarded the doctoral degree will not be considered eligible.

Mobility Rule
Researchers must not have resided or carried out their main activity (work, studies, etc.) in the United Kingdom for more than 12 months in the 36 months immediately before their date of recruitment. Compulsory national service, short stays such as holidays, and time spent as part of a procedure for obtaining refugee status under the Geneva Convention are not taken into account.

Academic Requirements
The minimum academic requirement for admission is normally an Upper Second Class Honours degree from a UK or ROI Higher Education provider in a relevant discipline, or an equivalent qualification acceptable to the University.

More Information
Applicants may additionally consider applying to positions with the partner universities of the network: http://www.relax-dn.eu/

ROMA: Run-Time Object Detection To Maximize Real-Time Accuracy

Posted on 1 January 2023 by Hans Vandierendonck

This is a follow-up work on TOD, which selects one of multiple deep neural networks (DNNs) to perform real-time video analytics (object detection) on low-end devices, e.g., on the camera itself. TOD uses the median object size to determine which one out of 4 YOLO DNNs will meet the real-time requirement best, with respect to object size and speed of the objects. TOD requires specific knowledge of the device to select appropriate thresholds on the median object sizes, and needs to be retuned for each computing device.

ROMA removes the limitation that TOD imposes by performing a more detailed analysis of the image content. In particular, ROMA separately estimates the impact of the selected DNN on object size and on object speed. Its formulation is sufficiently flexible to adapt to changes in the computational power of the device such that it does not need to be retrained when migrating across hardware. Moreover, this way, ROMA can adapt to runtime changes in computational power, which may arise from power management features on the device, or from other workloads which share the device. ROMA does however have hyper-parameters that are dependent on the YOLO DNNs.

ROMA will be presented at the 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).

You can read the post-print on arXiv.

TOD: Transprecise Object Detection

Posted on 1 January 2023 by Hans Vandierendonck

Real-time video analytics on the edge is challenging as the computationally constrained resources typically cannot analyse video streams at full fidelity and frame rate, which results in loss of accuracy. We propose a Transprecise Object Detector (TOD) which maximises the real-time object detection accuracy on an edge device by selecting an appropriate Deep Neural Network (DNN) on the fly with negligible computational overhead.

TOD makes two key contributions over the state of the art: (1) TOD leverages characteristics of the video stream such as object size and speed of movement to identify networks with high prediction accuracy for the current frames; (2) it selects the best-performing network based on projected accuracy and computational demand using an effective and low-overhead decision mechanism.

Experimental evaluation on a Jetson Nano demonstrates that TOD improves the average object detection precision by 34.7 % over the YOLOv4-tiny-288 model on average over the MOT17Det dataset. In the MOT17-05 test dataset, TOD utilises only 45.1 % of GPU resource and 62.7 % of the GPU board power without losing accuracy, compared to YOLOv4-416 model. We expect that TOD will maximise the application of edge devices to real-time object detection, since TOD maximises real-time object detection accuracy given edge devices according to dynamic input features without increasing inference latency in practice.

TOD was presented at the 5th International Conference on Fog and Edge Computing (ICFEC). Read the paper on arXiv.

DIPSA: Data-Intensive Parallel Systems and Algorithms

Author Archives: Hans Vandierendonck