"Parallel implementation of a gradient boosted trees algorithm"
Druzhkov P.N.

The software implementation of a parallel gradient boosted trees algorithm that requires a distributed data storage and is intended mostly to large machine learning tasks is described. Computational experimental results are given to show an advantage in the performance and scalability of the proposed implementation over some other open-source implementations while using large datasets. Experimental quality evaluations are given also to show the competitiveness of the proposed implementation. The paper was recommended for publication by the Program Committee of the HPC-2012 Forum (http://agora.guru.ru/hpc2012).

Keywords: decision tree, gradient boosting, parallel computation, MPI, distributed memory

Druzhkov P.N., e-mail: druzhkov.paul@gmail.com – Lobachevsky State University of Nizhni Novgorod; prospect Gagarina 23, Nizhni Novgorod, 603950, Russia