Mohsen Koohi Esfahani – DIPSA: Data-Intensive Parallel Systems and Algorithms

DOI: 10.1109/BigData66926.2025.11401782 Whereas the literature describes an increasing number of graph algorithms, loading graphs remains a time-consuming component of the end-to-end execution time. Graph frameworks often rely on custom graph storage formats, that are not optimized for efficient loading of large-scale graph datasets. Furthermore, graph loading is often not optimized […]

ParaGrapher

ParaGrapher: A Parallel and Distributed Graph Loading Library for Large-Scale …

DOI: 10.48550/arXiv.2501.06872PDF version This paper investigates the shared-memory Graph Transposition (GT) problem, a fundamental graph algorithm that is widely used in graph analytics and scientific computing. Previous GT algorithms have significant memory requirements that are proportional to the number of vertices and threads which obstructs their use on large graphs. […]

LaganLighter

On Optimizing Locality of Graph Transposition on Modern Architectures

PDF versionDOI: 10.48550/arXiv.2507.00716 ParaGrapher is a graph loading API and library that enables graph processing frameworks to load large-scale compressed graphs with minimal overhead. This capability accelerates the design and implementation of new high-performance graph algorithms and their evaluation on a wide range of graphs and across different frameworks. However, […]

ParaGrapher

Accelerating Loading WebGraphs in ParaGrapher

To evaluate the impacts of locality-optimizing reordering algorithms, a baseline is required. To create the baseline a random assignment of IDs to vertices may be used to produce a representation of the graph with reduced locality [ DOI:10.1109/ISPASS57527.2023.00029, DOI:10.1109/IISWC53511.2021.00020 ]. To that end, we create the random_ordering() function in relabel.c […]

LaganLighter Technical Posts

Random Vertex Relabelling in LaganLighter

We use MASTIFF to compute the weight of Minimum Spanning Forest (MST) of MS-BioGraphs while ignoring self-edges of the graphs. – MS1 Using machine with 24 cores. MSF weight: 109,915,787,546 – MS50 Using machine with 128 cores. MSF weight: 416,318,200,808 MS-BioGraphsRelated Posts Technical Posts LaganLighter

LaganLighter MS-BioGraphs Technical Posts

Minimum Spanning Forest of MS-BioGraphs

In applications such as graph processing, it is important how threads are pinned on CPU cores as the threads that share resources (such as memory and cache) can accelerate the performance by processing consecutive blocks of input dataset, especially, when the dataset has a high-level of locality. In LaganLighter, we […]

LaganLighter Technical Posts

Topology-Based Thread Affinity Setting (Thread Pinning) in OpenMP

Short URL of this post: https://blogs.qub.ac.uk/DIPSA/graphs-list-2024 Real-World Graphs Smaller Graphs Synthetic Graph Generators Technical Posts

Technical Posts

An (Incomplete) List of Publicly Available Graph Datasets/Generators

30th International European Conference on Parallel and Distributed Computing (Euro-Par 2024) DOI: 10.1007/978-3-031-69583-4_7PDF Version Abstract The Maximum Weighted Clique(MWC) problem remains challenging due to its unfavourable time complexity.In this paper, we analyze the execution of exact search-based MWC algorithms and show that high-accuracy weighted cliques can be discovered in the […]

Uncategorised

QClique: Optimizing Performance and Accuracy in Maximum Weighted Clique – …

PDF versionDOI: 10.48550/arXiv.2404.19735 Comprehensive evaluation is one of the basis of experimental science. In High-Performance Graph Processing, a thorough evaluation of contributions becomes more achievable by supporting common input formats over different frameworks. However, each framework creates its specific format, which may not support reading large-scale real-world graph datasets. This […]

ParaGrapher

Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapher – …

Short URL of this post: https://blogs.qub.ac.uk/DIPSA/HDD-vs-SSD-vs-LustreFS-2024 We evaluate read bandwidth of three storage types: and for three parallel read methods: and for two block sizes: The source code is available on ParaGrapher repository: The OS cache of storage contents have been dropped after each evaluation (sudo sh -c 'echo 3 […]

ParaGrapher Technical Posts