{"id":2276,"date":"2023-08-10T09:51:00","date_gmt":"2023-08-10T08:51:00","guid":{"rendered":"https:\/\/blogs.qub.ac.uk\/dipsa\/?p=2276"},"modified":"2024-06-20T16:29:18","modified_gmt":"2024-06-20T15:29:18","slug":"ms-biographs-ms200","status":"publish","type":"post","link":"https:\/\/blogs.qub.ac.uk\/dipsa\/ms-biographs-ms200\/","title":{"rendered":"MS-BioGraphs MS200"},"content":{"rendered":"\n<p><\/p>\n\n\n\n\n\t\t<div id='msb'>\n\t<table id='tb1'>\n<tr><td>Name<\/td><td>MS-BioGraphs &#8211; MS200<\/td><\/tr>\n<tr><td>URL<\/td><td style='font-weight:normal'><a target='_blank' href='https:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs-MS200' rel=\"noopener\">https:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs-MS200<\/a><\/td><\/tr>\n<tr><td>Download Link<\/td><td><a href='https:\/\/doi.org\/10.21227\/gmd9-1534'>https:\/\/doi.org\/10.21227\/gmd9-1534<\/a><\/td><\/tr><tr><td>Script for Downloading All Files<\/td><td><a href='https:\/\/blogs.qub.ac.uk\/dipsa\/MS-BioGraphs-on-IEEE-DataPort\/'>https:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs-on-IEEE-DataPort\/<\/a><\/td><\/tr><tr><td>Validating and Sample Code<\/td><td><a href='https:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs-Validation\/'>https:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs-Validation\/<\/a><\/td><\/tr><tr><td>Graph Explanation<\/td><td style='font-weight: normal'>Vertices represent proteins and each edge represents the sequence similarity between its two endpoints<\/td><\/tr><tr><td>Edge Weighted<\/td><td>Yes<\/td><\/tr>\n<tr><td>Directed<\/td><td>No<\/td><\/tr>\n<tr><td>Number of Vertices<\/td><td>1,414,493,449<\/td><\/tr>\n<tr><td>Number of Edges<\/td><td>502,930,788,612<\/td><\/tr>\n<tr><td>Maximum Degree<\/td><td>745,735<\/td><\/tr>\n<tr><td>Minimum Weight<\/td><td>460<\/td><\/tr>\n<tr><td>Maximum Weight<\/td><td>634,925<\/td><\/tr>\n<tr><td>Number of Zero-Degree Vertices<\/td><td>0<\/td><\/tr>\n<tr><td>Average Degree<\/td><td>355.6<\/td><\/tr>\n<tr><td>Size of The Largest WCC<\/td><td>485,867,547,569<\/td><\/tr>\n<tr><td>Number of WCC<\/td><td>338,348,495<\/td><\/tr>\n<tr><td>Creation Details<\/td><td style='font-weight: normal'><a href='https:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs-Sequence-Similarity-Graph-Datasets\/' target='_blank' rel=\"noopener\">MS-BioGraphs: Sequency Similarity Graph Datasets<\/a><\/td><\/tr><tr><td>Format<\/td><td><a href='https:\/\/webgraph.di.unimi.it\/' target='_blank' rel=\"noopener\">WebGraph<\/a><\/td><\/tr><tr><td>License<\/td><td><a target='_blank' href='https:\/\/creativecommons.org\/licenses\/by-nc-sa\/2.0\/' rel=\"noopener\">CC BY-NC-SA<\/td><tr><td>QUB IDF<\/td><td style='font-weight: normal'>2223-052<\/td><\/tr>\n<tr><td>DOI<\/td><td><a target='_blank' href='https:\/\/doi.org\/10.5281\/zenodo.7820812' rel=\"noopener\">10.5281\/zenodo.7820812<\/a><\/td><\/tr>\n<tr><td>Citation<\/td><td style='font-weight:normal'><pre>Mohsen Koohi Esfahani, Sebastiano Vigna, \nPaolo Boldi, Hans Vandierendonck, Peter Kilpatrick, March 13, 2024, \n\"MS-BioGraphs: Trillion-Scale Sequence Similarity Graph Datasets\", \nIEEE Dataport, doi: https:\/\/doi.org\/10.21227\/gmd9-1534.<\/pre><\/td><\/tr>\n<tr><td>Bibtex<\/td><td style='font-weight:normal'><pre>@data{gmd9-1534-24,\ndoi = {10.21227\/gmd9-1534},\nurl = {https:\/\/doi.org\/10.21227\/gmd9-1534},\nauthor = {Koohi Esfahani, Mohsen and Vigna, Sebastiano and Boldi, \nPaolo and Vandierendonck, Hans and Kilpatrick, Peter},\npublisher = {IEEE Dataport},\ntitle = {MS-BioGraphs: Trillion-Scale Sequence Similarity Graph Datasets},\nyear = {2024} }<\/pre><\/td><\/tr>\n<\/table><br><br><h2>Files<\/h2><table id='files'>\n\n\t\t<tr>\n\t\t\t<td>Underlying Graph<\/td>\n\t\t\t<td>\n\t\t\t\tThe underlying graph in WebGraph format: \n\t\t\t\t<ul>\n\t\t\t\t\t<li>File: MS200-underlying.graph, Size: 1,459,981,767,426 Bytes<\/li>\n\t\t\t\t\t<li>File: MS200-underlying.offsets, Size: 3,174,012,489 Bytes<\/li>\n\t\t\t\t\t<li>File: MS200-underlying.properties, Size: 1,515 Bytes<\/li>\n\t\t\t\t\t\n\t\t\t\t<\/ul>\n\t\t\t\tTotal Size: 1,463,155,781,430 Bytes<br>\n\t\t\t\tThese files are validated using &#8216;Edge Blocks SHAs File&#8217; as follows.\n\t\t\t<\/td>\n\t\t<\/tr>\n\n\t\t<tr>\n\t\t\t<td>Weights (Labels)<\/td>\n\t\t\t<td>\n\t\t\t\tThe weights of the graph in WebGraph format: \n\t\t\t\t<ul>\n\t\t\t\t\t<li>File: MS200-weights.labels, Size: 1,199,053,831,206 Bytes<\/li>\n\t\t\t\t\t<li>File: MS200-weights.labeloffsets, Size: 3,090,041,102 Bytes<\/li>\n\t\t\t\t\t<li>File: MS200-weights.properties, Size: 186 Bytes<\/li>\n\t\t\t\t<\/ul>\n\t\t\t\tTotal Size: 1,202,143,872,494 Bytes<br>\n\t\t\t\tThese files are validated using &#8216;Edge Blocks SHAs File&#8217; as follows.\n\t\t\t<\/td>\n\t\t<\/tr>\n\n\t\t<tr>\n\t\t\t<td>Edge Blocks SHAs File (Text) <\/td>\n\t\t\t<td>\n\t\t\t\tThis file contains the shasums of edge blocks where each block contains \n\t\t\t\t64 Million continuous edges and has one shasum for its 64M endpoints and \n\t\t\t\tone for its 64M edge weights. <br>\n\t\t\t\tThe file is used to validate the underlying graph and the weights.\n\t\t\t\tFor further explanation about validation process,  please visit\n\t\t\t\tthe <a href='https:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs-Validation'>\n\t\t\t\thttps:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs-Validation<\/a>.\n\n\t\t\t\t<ul>\n\t\t\t\t\t<li>Name: MS200_edges_shas.txt<\/li>\n\t\t\t\t\t<li>Size: 899,640 Bytes<\/li>\n\t\t\t\t\t<li>SHASUM: 5bb635fc94aea3ee7b2b6a4aecbbb1fc6f77e1b5<\/li>\n\t\t\t\t\t\n\t\t\t\t<\/ul>\n\t\t\t<\/td>\n\t\t<\/tr>\n\n\t\t<tr>\n\t\t\t<td>Offsets (Binary)<\/td>\n\t\t\t<td>\n\t\t\t\tThe offsets array of the CSX (Compressed Sparse Rows\/Columns) graph in binary \n\t\t\t\tformat and little endian order. It consists of |V|+1 8-Bytes elements. <br>\n\t\t\t\tThe first and last values are 0 and |E|, respectively.<br>\n\t\t\t\tThis array helps converting the graph (or parts of it) from WebGraph format \n\t\t\t\tto binary format by one pass over (related) edges.<br>\n\t\t\t\t<ul>\n\t\t\t\t\t<li>Name: MS200_offsets.bin<\/li>\n\t\t\t\t\t<li>Size: 11,315,947,600 Bytes<\/li>\n\t\t\t\t\t<li>SHASUM: 9192158aab65e1ca536a46183411d87452cd9ee3<\/li>\n\t\t\t\t\t\n\t\t\t\t<\/ul>\n\t\t\t<\/td>\n\t\t<\/tr>\n\n\t\t<tr>\n\t\t\t<td>WCC (Binary)<\/td>\n\t\t\t<td>\n\t\t\t\tThe Weakly-Connected Compontent (WCC) array in binary format and little endian order.<br>\n\t\t\t\tThis array consists of |V| 4-Bytes elements\n\t\t\t\tThe vertices in the same component have the same values in the WCC array.<br>\n\t\t\t\t<ul>\n\t\t\t\t\t<li>Name: MS200-wcc.bin<\/li>\n\t\t\t\t\t<li>Size: 5,657,973,796 Bytes<\/li>\n\t\t\t\t\t<li>SHASUM: 027e1b826659b5ec0f62921a4eb3ecd6c83fa76a<\/li>\n\t\t\t\t\t\n\t\t\t\t<\/ul>\n\t\t\t<\/td>\n\t\t<\/tr>\n\n\t\t<tr>\n\t\t\t<td>Names (tar.gz) <\/td>\n\t\t\t<td>\n\t\t\t\tThis compressed file contains 120 files in CSV format using &#8216;;&#8217; as the separator.\n\t\t\t\tEach row has two columns: ID of vertex and name of the sequence.<br>\n\t\t\t\tNote: If the graph has a &#8216;N2O Reordering&#8217; file, the n2o array should \n\t\t\t\tbe used to convert the vertex ID to old vertex ID which is used for identifying\n\t\t\t\tname of the protein in the `names.tar.gz` file.\n\t\t\t\t\n\t\t\t\t<ul>\n\t\t\t\t\t<li>Name: names.tar.gz<\/li>\n\t\t\t\t\t<li>Size: 27,130,045,933 Bytes<\/li>\n\t\t\t\t\t<li>SHASUM: ba00b58bbb2795445554058a681b573c751ef315<\/li>\n\t\t\t\t\t\n\t\t\t\t<\/ul>\n\t\t\t<\/td>\n\t\t<\/tr>\n\n\t\t\t<tr>\n\t\t\t\t<td>N2O Reordering (Binary)<\/td>\n\t\t\t\t<td>\n\t\t\t\t\tThe New to Old (N2O) reordering array of the graph in binary format and little endian order.<br>\n\t\t\t\t\tIt consists of |V| 4-Bytes elements and identifies the old ID of each vertex\n\t\t\t\t\twhich is used in searching the name of vertex (protein) in the names.tar.gz file .\n\t\t\t\t\t<ul>\n\t\t\t\t\t\t<li>Name: MS200-n2o.bin<\/li>\n\t\t\t\t\t\t<li>Size: 5,657,973,796 Bytes<\/li>\n\t\t\t\t\t\t<li>SHASUM: de833f1c36011af07c165f53760b82a49715537d<\/li>\n\t\t\t\t\t\t\n\t\t\t\t\t<\/ul>\n\t\t\t\t<\/td>\n\t\t\t<\/tr>\n\n\t\t<tr>\n\t\t\t<td>OJSON<\/td>\n\t\t\t<td>\n\t\t\t\tThe charactersitics of the graph and shasums of the files.<br>\n\t\t\t\tIt is in the open json format and needs a closing brace (}) to be appended\n\t\t\t\tbefore being passed to a json parser.\n\n\t\t\t\t<ul>\n\t\t\t\t\t<li>Name: MS200.ojson<\/li>\n\t\t\t\t\t<li>Size: 757 Bytes<\/li>\n\t\t\t\t\t<li>SHASUM: 540c0bded9ab8d334574ed7dd7909435b617ecf3<\/li>\n\t\t\t\t\t\n\t\t\t\t<\/ul>\n\t\t\t<\/td>\n\t\t<\/tr>\n<\/table><br><br><h2>Plots<\/h2><p style='font-size:1.2em'>For the explanation about the plots, please refer to the <a href='https:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs-Sequence-Similarity-Graph-Datasets'>MS-BioGraphs paper<\/a>.<br>To have a better resolution, please click on the images.<\/p><table id='plots'><tr><td>Degree Distribution<\/td><td><a target='_blank' href='https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/MS200-1.png' rel=\"noopener\"><img src='https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/MS200-1.png' \/><\/a><\/td><\/tr>\n<tr><td>Weight Distribution<\/td><td><a target='_blank' href='https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/MS200-2.png' rel=\"noopener\"><img src='https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/MS200-2.png' \/><\/a><\/td><\/tr>\n<tr><td>Vertex-Relative Weight Distribution<\/td><td><a target='_blank' href='https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/MS200-3.png' rel=\"noopener\"><img src='https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/MS200-3.png' \/><\/a><\/td><\/tr>\n<tr><td>Degree Decomposition<\/td><td><a target='_blank' href='https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/MS200-4.png' rel=\"noopener\"><img src='https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/MS200-4.png' \/><\/a><\/td><\/tr>\n<tr><td>Cell-Binned Average Weight Degree Distribution<\/td><td><a target='_blank' href='https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/MS200-5.png' rel=\"noopener\"><img src='https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/MS200-5.png' \/><\/a><\/td><\/tr>\n<tr><td>Weakly-Connected Components Size Distribution<\/td><td><a target='_blank' href='https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/wcc-sym.png' rel=\"noopener\"><img src='https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/wcc-sym.png' \/><\/a><\/td><\/tr>\n<\/table><\/div>\n\n\n\n<p class=\"has-medium-font-size\"><br><strong><a rel=\"noreferrer noopener\" href=\"https:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs\/\" target=\"_blank\">MS-BioGraphs<\/a><\/strong><\/p>\n\n\n\n<p class=\"has-medium-font-size\"><br><strong>Related Posts<\/strong><\/p>\n\n\n<ul class=\"wp-block-latest-posts__list has-dates wp-block-latest-posts\"><li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2024\/08\/trees-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/minimum-spanning-forest-of-ms-biographs\/\">Minimum Spanning Forest of MS-BioGraphs<\/a><time datetime=\"2024-08-09T14:11:36+01:00\" class=\"wp-block-latest-posts__post-date\">9 August 2024<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2024\/04\/ivy-2-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/ms-biographs-on-ieee-dataport\/\">MS-BioGraphs on IEEE DataPort<\/a><time datetime=\"2024-04-17T07:26:23+01:00\" class=\"wp-block-latest-posts__post-date\">17 April 2024<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2024\/02\/poplar2-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/paragrapher-source-code-for-webgraph-types\/\">ParaGrapher Source Code For WebGraph Types<\/a><time datetime=\"2024-02-16T08:13:13+00:00\" class=\"wp-block-latest-posts__post-date\">16 February 2024<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/11\/goldcrest-1-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/on-overcoming-hpc-challenges-of-trillion-scale-real-world-graph-datasets\/\">On Overcoming HPC Challenges of  Trillion-Scale Real-World Graph Datasets \u2013 BigData&#8217;23 (Short Paper)<\/a><time datetime=\"2023-12-15T02:47:00+00:00\" class=\"wp-block-latest-posts__post-date\">15 December 2023<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/10-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/dataset-announcement-ms-biographs-trillion-scale-public-real-world-sequence-similarity-graphs\/\">Dataset Announcement: MS-BioGraphs, Trillion-Scale Public Real-World Sequence Similarity Graphs &#8211; IISWC&#8217;23 (Poster)<\/a><time datetime=\"2023-10-02T00:26:00+01:00\" class=\"wp-block-latest-posts__post-date\">2 October 2023<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/2-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/ms-biographs-sequence-similarity-graph-datasets\/\">MS-BioGraphs: Sequence Similarity Graph Datasets<\/a><time datetime=\"2023-08-30T06:52:00+01:00\" class=\"wp-block-latest-posts__post-date\">30 August 2023<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/1-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/ms-biographs-ms\/\">MS-BioGraphs MS<\/a><time datetime=\"2023-08-10T09:53:42+01:00\" class=\"wp-block-latest-posts__post-date\">10 August 2023<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/6-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/ms-biographs-msa500\/\">MS-BioGraphs MSA500<\/a><time datetime=\"2023-08-10T09:52:00+01:00\" class=\"wp-block-latest-posts__post-date\">10 August 2023<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/3-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/ms-biographs-ms200\/\">MS-BioGraphs MS200<\/a><time datetime=\"2023-08-10T09:51:00+01:00\" class=\"wp-block-latest-posts__post-date\">10 August 2023<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/7-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/ms-biographs-msa200\/\">MS-BioGraphs MSA200<\/a><time datetime=\"2023-08-10T09:50:00+01:00\" class=\"wp-block-latest-posts__post-date\">10 August 2023<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/4-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/ms-biographs-ms50\/\">MS-BioGraphs MS50<\/a><time datetime=\"2023-08-10T09:49:00+01:00\" class=\"wp-block-latest-posts__post-date\">10 August 2023<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/8-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/ms-biographs-msa50\/\">MS-BioGraphs MSA50<\/a><time datetime=\"2023-08-10T09:48:00+01:00\" class=\"wp-block-latest-posts__post-date\">10 August 2023<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/9-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/ms-biographs-msa10\/\">MS-BioGraphs MSA10<\/a><time datetime=\"2023-08-10T09:44:41+01:00\" class=\"wp-block-latest-posts__post-date\">10 August 2023<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/5-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/ms-biographs-ms1\/\">MS-BioGraphs MS1<\/a><time datetime=\"2023-08-10T09:41:14+01:00\" class=\"wp-block-latest-posts__post-date\">10 August 2023<\/time><\/li>\n<li><div class=\"wp-block-latest-posts__featured-image alignleft\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/11-150x150.jpg\" class=\"attachment-thumbnail size-thumbnail wp-post-image\" alt=\"\" style=\"max-width:60px;max-height:60px;\" \/><\/div><a class=\"wp-block-latest-posts__post-title\" href=\"https:\/\/blogs.qub.ac.uk\/dipsa\/ms-biographs-validation\/\">MS-BioGraphs Validation<\/a><time datetime=\"2023-08-10T09:40:00+01:00\" class=\"wp-block-latest-posts__post-date\">10 August 2023<\/time><\/li>\n<\/ul>\n\n\n<p><\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Name MS-BioGraphs &#8211; MS200 URL https:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs-MS200 Download Link https:\/\/doi.org\/10.21227\/gmd9-1534 Script for Downloading All Files https:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs-on-IEEE-DataPort\/ Validating and Sample Code https:\/\/blogs.qub.ac.uk\/DIPSA\/MS-BioGraphs-Validation\/ Graph Explanation Vertices represent proteins and each edge represents the sequence similarity between its two endpoints Edge Weighted Yes Directed No Number of Vertices 1,414,493,449 Number of Edges 502,930,788,612 Maximum Degree 745,735 Minimum Weight 460 [&hellip;]<\/p>\n","protected":false},"author":1315,"featured_media":2277,"comment_status":"open","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[63],"tags":[68,67,35,38,64,66,65],"class_list":{"0":"post-2276","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-ms-biographs","8":"tag-dataset","9":"tag-graph-datasets","10":"tag-graph-processing","11":"tag-high-performance-computing","12":"tag-high-performance-graph-processing","13":"tag-real-world-graphs","14":"tag-sequence-similarity-graphs","15":"czr-hentry"},"jetpack_featured_media_url":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-content\/uploads\/sites\/14\/2023\/08\/3.jpg","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/posts\/2276","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/users\/1315"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/comments?post=2276"}],"version-history":[{"count":11,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/posts\/2276\/revisions"}],"predecessor-version":[{"id":3101,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/posts\/2276\/revisions\/3101"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/media\/2277"}],"wp:attachment":[{"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/media?parent=2276"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/categories?post=2276"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/tags?post=2276"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}