Logo image
GPU-accelerated Hungarian algorithms for the Linear Assignment Problem
Journal article   Peer reviewed

GPU-accelerated Hungarian algorithms for the Linear Assignment Problem

Ketan Date and Rakesh Nagi
Parallel computing, Vol.57, pp.52-72
01/09/2016

Abstract

Computer Science Computer Science, Theory & Methods Science & Technology Technology
In this paper, we describe parallel versions of two different variants (classical and alternating tree) of the Hungarian algorithm for solving the Linear Assignment Problem (LAP). We have chosen Compute Unified Device Architecture (CUDA) enabled NVIDIA Graphics Processing Units (GPU) as the parallel programming architecture because of its ability to perform intense computations on arrays and matrices. The main contribution of this paper is an efficient parallelization of the augmenting path search phase of the Hungarian algorithm. Computational experiments on problems with up to 25 million variables reveal that the GPU-accelerated versions are extremely efficient in solving large problems, as compared to their CPU counterparts. Tremendous parallel speedups are achieved for problems with up to 400 million variables, which are solved within 13 seconds on average. We also tested multi-GPU versions of the two variants on up to 16 GPUs, which show decent scaling behavior for problems with up to 1.6 billion variables and dense cost matrix structure. (C) 2016 Elsevier B.V. All rights reserved.

Metrics

1 Record Views

Details

Logo image