Publications/Projects
My interests
are Parallel computing mainly using Message Passing and GPGPU techniques, and mathematical modelling/simulation of complex systems. My works combine strong theory with applications.
Publications
 Rahmat hidayat, Davide Spataro, Elisa De Giorgio, William Spataro, Donato D'Ambrosio, MultiAgent System with Multiple Group Modelling for Bird Flocking on GPU, Proceedings of The 2016 International Conference on Parallel, Distributed and NetworkBased Processing (PDP), Crete, Greece.
 Giuseppe Filippone, William Spataro, Donato D'Ambrosio, Davide Spataro, Davide Marocco,Giuseppe A. Trunfio, CUDA Dynamic Active Thread List Strategy to Accelerate Debris Flow Simulations, Proceedings of The 2015 International Conference on Parallel, Distributed and NetworkBased Processing (PDP), Turku, Finland, pp 330338.
 Spataro D., D’Ambrosio D., Filippone G., Spataro W.,The new SCIARAfv3 numerical model and acceleration by GPGPU strategies, International Journal of High Performance and Applications
 Spataro W., D’Ambrosio D., Filippone G.,Spataro D., G. Iovine, D. Marocco, Lava flow modeling by the SCIARAfv3 parallel numerical model, Proceedings of The 2014 International Conference on Parallel, Distributed and NetworkBased Processing (PDP), Turin, Italy, Feb.1214, 2014, pp 330338.
 G. Filippone, R. Parise, D. Spataro, D. D’Ambrosio, R. Rongo, and W. Spataro, Evolutionary applications to Cellular Automata models for volcano risk mitigation, Proceedings of The 2014 International Conference on Workshop on Artificial Life and Evolutionary Computation (WIVACE), May 1415 2014, Vietri sul Mare, Salerno, Italy.
Projects
Here listed a few projects/works I'm related with:
A CUDA header only library for performing stream compaction on newest devices (Compute Capability >=3.5) that uses ballotting intrinsic functions to achieve high performance. Tests shows that this implementation is faster than the Thrust library (see image, thanks to Zhang Jingrong). The library's implementation and its usage are described in this blog article (here). Freely available on github (here)

CuCCAl
acronym for “Cuda complex cellular automata library”, capable of hiding all the complexity behind the GPU programming process. The aim of this library is to make life easier to researchers that need to parallelize using GPU cellular automata. The programmer has only to write the elementary processes that compose the transition function, register substates and set up some parameters (for example, the number of steps). Memory management is completely behind the scene, letting the programmer concentrate only on the model he needs to write. (Full article)

MSc Thesis: accelerating the new SCRIARAfv3 numerical model by different GPGPU strategies (click to read)
A parallel version of the model SCIARAfv3 was designed and implemented using GeneralPurpose Computation with Graphics Processing Units and specifically adopting CUDA framework in order to improve the overall execution time. It involves the design and the application of strategies that allow to avoid incorrect computation results due to race conditions of any type and at the same time to achieve the best performance and occupancy of the underlying available hardware. Carried out experiments show that significant performance improvement in terms of speedup are achieved also thanks to some original optimizations strategies adopted, confirming the validity of graphics hardware as an alternative to much more expensive solutions for the simulation of cellular automata models.
Download as latex source or read the html version