Name MS-BioGraphs – MSA50 URL https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA50 Download Link https://doi.org/10.21227/gmd9-1534 Script for Downloading All Files https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-on-IEEE-DataPort/ Validating and Sample Code https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-Validation/ Graph Explanation Vertices represent proteins and each edge represents the sequence similarity between its two endpoints Edge Weighted Yes Directed Yes Number of Vertices 1,757,323,526 Number of Edges 125,312,536,732 Maximum […]
graph datasets
Name MS-BioGraphs – MSA10 URL https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MSA10 Download Link https://doi.org/10.21227/gmd9-1534 Script for Downloading All Files https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-on-IEEE-DataPort/ Validating and Sample Code https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-Validation/ Graph Explanation Vertices represent proteins and each edge represents the sequence similarity between its two endpoints Edge Weighted Yes Directed Yes Number of Vertices 1,757,323,526 Number of Edges 25,236,632,682 Maximum […]
Name MS-BioGraphs – MS1 URL https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-MS1 Download Link https://doi.org/10.21227/gmd9-1534 Script for Downloading All Files https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-on-IEEE-DataPort/ Validating and Sample Code https://blogs.qub.ac.uk/DIPSA/MS-BioGraphs-Validation/ Graph Explanation Vertices represent proteins and each edge represents the sequence similarity between its two endpoints Edge Weighted Yes Directed No Number of Vertices 43,144,218 Number of Edges 2,660,495,200 Maximum […]
Repository https://github.com/DIPSA-QUB/MS-BioGraphs-Validation Explanation We provide a Shell script, validation.sh, and a Java program, EdgeBlockSHA.java, to verify the the correctness of the graphs. Each graph has a .ojson file whose shasum is verified by the value retreived from our server. Files such as offsets.bin, wcc.bin, n2o.bin, trans_offsets.bin, and edges_shas.txt have shasum […]