{"id":408,"date":"2021-01-20T20:36:00","date_gmt":"2021-01-20T20:36:00","guid":{"rendered":"https:\/\/blogs.qub.ac.uk\/dipsa\/invited-talk-parallel-algorithms-for-density-based-and-structural-clustering\/"},"modified":"2021-01-20T20:36:00","modified_gmt":"2021-01-20T20:36:00","slug":"invited-talk-parallel-algorithms-for-density-based-and-structural-clustering","status":"publish","type":"post","link":"https:\/\/blogs.qub.ac.uk\/dipsa\/invited-talk-parallel-algorithms-for-density-based-and-structural-clustering\/","title":{"rendered":"Invited Talk: Parallel Algorithms for Density-Based and Structural Clustering &#8211; Dr Julian Shun"},"content":{"rendered":"\n<p>Dr Julian Shun, Massachusetts Institute of Technology <br>14 January 2021<\/p>\n\n\n\n<p>Abstract: This talk presents new parallel algorithms for density-based<br>spatial clustering (DBSCAN) on point sets and structural clustering<br>(SCAN) on graphs, two problems that have received significant<br>attention due to their applicability in a variety of data analysis<br>tasks.<br><br>Existing parallel algorithms for DBSCAN require much more work than<br>their sequential counterparts, making them inefficient for large<br>datasets.&nbsp; We bridge the gap between theory and practice of parallel<br>DBSCAN by presenting new parallel algorithms for Euclidean exact<br>DBSCAN and approximate DBSCAN that match the work bounds of their<br>sequential counterparts, and are highly parallel (polylogarithmic<br>depth).&nbsp; For graphs, we present the first parallel index-based SCAN<br>algorithm, based a recent sequential algorithm, which enables users to<br>efficiently explore many different parameter settings for cluster<br>generation. Our parallel algorithm has a better work bound than the<br>sequential algorithm, and achieves logarithmic depth. We also apply<br>locality-sensitive hashing to design a novel approximate SCAN<br>algorithm and prove guarantees for its clustering quality.&nbsp; We present<br>optimized implementations of our algorithms which achieve good<br>parallel scalability and outperform existing parallel implementations<br>by up to several orders of magnitude.<br><br>Bio: Julian Shun is the Douglas T. Ross Career Development Assistant<br>Professor of Software Technology in EECS and CSAIL at MIT.&nbsp; Prior to<br>coming to MIT, he was a Miller Research Fellow at UC Berkeley.&nbsp; He<br>received his Ph.D. from Carnegie Mellon University and his B.A. from<br>UC Berkeley.&nbsp; His research focuses on the theory and practice of<br>parallel algorithms and programming frameworks.&nbsp; He has received the<br>NSF CAREER Award, DOE Early Career Award, ACM Doctoral Dissertation<br>Award, CMU School of Computer Science Doctoral Dissertation Award,<br>Facebook Graduate Fellowship, Google Faculty Research Award, and best&nbsp;<br>paper awards at PLDI, SPAA, and DCC.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Dr Julian Shun, Massachusetts Institute of Technology 14 January 2021 Abstract: This talk presents new parallel algorithms for density-basedspatial clustering (DBSCAN) on point sets and structural clustering(SCAN) on graphs, two problems that have received significantattention due to their applicability in a variety of data analysistasks. Existing parallel algorithms for DBSCAN require much more work thantheir [&hellip;]<\/p>\n","protected":false},"author":974,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1],"tags":[33],"class_list":{"0":"post-408","1":"post","2":"type-post","3":"status-publish","4":"format-standard","6":"category-uncategorised","7":"tag-seminars","8":"czr-hentry"},"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/posts\/408","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/users\/974"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/comments?post=408"}],"version-history":[{"count":0,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/posts\/408\/revisions"}],"wp:attachment":[{"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/media?parent=408"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/categories?post=408"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.qub.ac.uk\/dipsa\/wp-json\/wp\/v2\/tags?post=408"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}