Academic Authority

Publications

The list below is a representative cross-section of our topics and collaborations. Use sorting and filtering to quickly locate work by year, theme, or application area.

Publications

Focus areas include DSL design, CFD acceleration, reproducibility, mixed precision, public health simulation, and medical imaging workloads.

Filter by keyword


2012	OP2: An active library framework for solving unstructured mesh-based applications on multi-core and many-core architectures	2012 Innovative Parallel Computing (InPar) - IEEE	Search
2012	Efficient sparse matrix-vector multiplication on cache-based GPUs	2012 Innovative Parallel Computing (InPar) - IEEE	Search
2013	Designing OP2 for GPU architectures	Journal of Parallel and Distributed Computing - Academic Press	Search
2013	Design and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems	Parallel Computing - North-Holland	Search
2012	An analytical study of loop tiling for a large-scale unstructured mesh application	2012 SC Companion: High Performance Computing, Networking Storage and Analysis - IEEE	Search
2014	Trends in high-performance computing for engineering calculations	Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences - The Royal Society	Search
2014	Vectorizing unstructured mesh computations for many-core architectures	Proceedings of Programming Models and Applications on Multicores and Manycores	Search
2015	Acceleration of a full-scale industrial CFD application with OP2	IEEE Transactions on Parallel and Distributed Systems - IEEE	Search
2015	Finite element algorithms and data structures on graphical processing units	International Journal of Parallel Programming - Springer US Boston	Search
2015	A comparison between parallelization approaches in molecular dynamics simulations on GPUs	Journal of computational chemistry	Search
2014	The OPS domain specific abstraction for multi-block structured grid computations	2014 Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing - IEEE	Search
2014	Rolls Royce Hydra CFD Code on GPUs using OP2 Abstraction	GPU Technology Conference (GTC)	Search
2014	GPU implementation of finite difference solvers	2014 Seventh Workshop on High Performance Computational Finance - IEEE	Search
2012	Op2 airfoil example	URL https://citeseerx. ist. psu. edu/document	Search
2014	Abstraction and Implementation of Unstructured Grid Algorithms on Massively Parallel Heterogeneous Architectures	Pazmany Peter Katolikus Egyetem	Search
2014	Performance analysis of a high-level abstractions-based hydrocode on future computing systems	International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems - Springer International Publishing Cham	Search
2012	OP2 Developers Guide-Distributed Memory (MPI) Parallelisation	Publication details unavailable	Search
2014	High-level abstractions for performance, portability and continuity of scientific software on future computing systems	Technical report, April	Search
2015	High-level Abstractions for Performance, Portability and Continuity of Scientific Software on Future Computing Systems-CloverLeaf 3D	Publication details unavailable	Search
2015	Analysis of parallel processor architectures for the solution of the Black-Scholes PDE	2015 IEEE International Symposium on Circuits and Systems (ISCAS) - IEEE	Search
2016	Block-structured compressible Navier-Stokes solution using the OPS high-level abstraction	International Journal of Computational Fluid Dynamics - Taylor & Francis	Search
2015	Design and development of domain specific active libraries with proxy applications	2015 IEEE International Conference on Cluster Computing - IEEE	Search
2015	AmgX: A library for GPU accelerated algebraic multigrid and preconditioned iterative methods	SIAM Journal on Scientific Computing - Society for Industrial and Applied Mathematics	Search
2015	Benchmarking the IBM Power8 processor	CASCON	Search
2016	Auto-vectorizing a large-scale production unstructured-mesh CFD application	Proceedings of the 3rd Workshop on Programming Models for SIMD/Vector Processing	Search
2013	Op2 c++ user's manual	Publication details unavailable	Search
2013	Tsunami simulation using the OP2 parallel framework	Publication details unavailable	Search
2016	High performance computing on the ibm power8 platform	International Conference on High Performance Computing - Springer International Publishing Cham	Search
2016	Development of strategy and procedure for assessing portability of industrial codes to GPUs hardware: the TITAN example	Publication details unavailable	Search
2017	Loop tiling in large-scale stencil codes at run-time with OPS	IEEE Transactions on Parallel and Distributed Systems - IEEE	Search
2019	Low complexity algorithmic trading by feedforward neural networks	Computational Economics - Springer US New York	Search
2017	Achieving performance portability for a heat conduction solver mini-application on modern multi-core systems	2017 IEEE International Conference on Cluster Computing (CLUSTER) - IEEE	Search
2012	Designing op2 for gpu architectures	Journal of Parallel and Distributed Computing	Search
2017	Beyond 16GB: out-of-core stencil computations	Proceedings of the Workshop on Memory Centric Programming for HPC	Search
2017	Comparison of parallelisation approaches, languages, and compilers for unstructured mesh algorithms on GPUs	International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems - Springer International Publishing Cham	Search
2019	Locality optimized unstructured mesh algorithms on GPUs	Journal of Parallel and Distributed Computing - Academic Press	Search
2018	The VOLNA-OP2 tsunami code (version 1.5)	Geoscientific Model Development - Copernicus GmbH	Search
2016	A fast and flexible toolbox for tracking brain connections in diffusion MRI datasets using GPUs	22nd Annual Meeting of the Organization for Human Brain Mapping (OHBM), Geneva, Switzerland	Search
2019	Using GPUs to accelerate computational diffusion MRI: From microstructure estimation to tractography and connectomes	Neuroimage - Academic Press	Search
2018	An abstraction for local computations on structured meshes and its extension to handling multiple materials	CNNA 2018; The 16th International Workshop on Cellular Nanoscale Networks and their Applications - VDE	Search
2014	Cache-Blocking Tiling of Large Stencil Codes at Runtime	Performance Computing	Search
2018	Op2-clang: A source-to-source translator using clang/llvm libtooling	2018 IEEE/ACM 5th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC) - IEEE	Search
2018	Heterogeneous cpu-gpu execution of stencil applications	2018 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC) - IEEE	Search
2015	OP2-TPDS2015-DATA	University of Oxford	Search
2019	Improving resilience of scientific software through a domain-specific approach	Journal of Parallel and Distributed Computing - Academic Press	Search
2019	Large-scale performance of a DSL-based multi-block structured-mesh application for Direct Numerical Simulation	Journal of Parallel and Distributed Computing - Academic Press	Search
2019	Batch solution of small PDEs with the OPS DSL	International Conference on High Performance Computing - Springer International Publishing Cham	Search
2015	rand	Publication details unavailable	Search
2019	PPCU Sam: Open-source face recognition framework	Procedia Computer Science - Elsevier	Search
2019	Performance portability of multi-material kernels	2019 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC) - IEEE	Search
2019	GPU support for automatic generation of finite-differences Stencil Kernels	Latin American High Performance Computing Conference - Springer International Publishing Cham	Search
2020	Productivity, performance, and portability for computational fluid dynamics applications	Computers & Fluids - Pergamon	Search
2019	GPU Support for Automatic Generation of Finite-Differences Stencil Kernels	arXiv e-prints	Search
2020	Performance portability of the mg-cfd mini-app with sycl	Proceedings of the International Workshop on OpenCL	Search
2020	Bitwise Reproducible task execution on unstructured mesh applications	2020 20th IEEE/ACM international symposium on cluster, cloud and Internet computing (CCGRID) - IEEE	Search
2020	Automatic parallel implementations of adjoint codes for structured mesh applications	2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID) - IEEE	Search
2020	GPU Support for Automatic Generation	High Performance Computing: 6th Latin American Conference, CARLA 2019, Turrialba, Costa Rica, September 25-27, 2019, Revised Selected Papers - Springer Nature	Search
2021	High-level FPGA accelerator design for structured-mesh-based explicit numerical solvers	2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS) - IEEE	Search
2020	Modernising an industrial cfd application	2020 Eighth International Symposium on Computing and Networking Workshops (CANDARW) - IEEE	Search
2015	reguly/volna: VOLNA-OP2	Github	Search
2011	OP-DSL/OP2-Common: OP2: open-source framework for the execution of unstructured grid applications on clusters of GPUs or multi-core CPUs	Github	Search
2021	Under the hood of sycl-an initial performance analysis with an unstructured-mesh cfd application	International Conference on High Performance Computing - Springer International Publishing Cham	Search
2022	Microsimulation based quantitative analysis of COVID-19 management strategies	PLoS computational biology - Public Library of Science San Francisco, CA USA	Search
2021	Automatic Parallelisation of Sturctured Mesh Computations with SYCL	2021 IEEE International Conference on Cluster Computing (CLUSTER) - IEEE	Search
2021	Scalable many-core algorithms for tridiagonal solvers	Computing in Science & Engineering - IEEE	Search
2021	Predictive analysis of large-scale coupled cfd simulations with the cpx mini-app	2021 IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC) - IEEE	Search
2022	High throughput multidimensional tridiagonal system solvers on FPGAs	Proceedings of the 36th ACM International Conference on Supercomputing	Search
2022	Loop analysis quantifying human impact in a river ecosystem model	Ecological Complexity - Elsevier	Search
2022	FPGA acceleration of structured-mesh-based explicit and implicit numerical solvers using SYCL	Proceedings of the 10th International Workshop on OpenCL	Search
2023	Integral representation method based efficient rule optimizing framework for anti-money laundering	Journal of Money Laundering Control - Emerald Publishing Limited	Search
2022	Towards virtual certification of gas turbine engines with performance-portable simulations	2022 IEEE International Conference on Cluster Computing (CLUSTER) - IEEE	Search
2021	Query complexity in modern database DSLs	ACM Transactions on Information Systems	Search
2022	The design and utilisation of PanSim, a portable pandemic simulator	2022 First Combined International Workshop on Interactive Urgent Supercomputing (CIW-IUS) - IEEE	Search
2022	Virtual certification of gas turbine engines-visualizing the DLR Rig250 compressor	Publication details unavailable	Search
2023	Wastewater-based modeling, reconstruction, and prediction for COVID-19 outbreaks in Hungary caused by highly immune evasive variants	Water Research - Pergamon	Search
2023	Communication-avoiding optimizations for large-scale unstructured-mesh applications with op2	Proceedings of the 52nd International Conference on Parallel Processing	Search
2023	Quantifying and comparing the impact of combinations of non-pharmaceutical interventions on the spread of COVID-19	2023 31st Mediterranean Conference on Control and Automation (MED) - IEEE	Search
2023	Comparative evaluation of bandwidth-bound applications on the intel xeon cpu max series	Proceedings of the SC'23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis	Search
2023	Evaluating the performance portability of SYCL across CPUs and GPUs on bandwidth-bound applications	Proceedings of the SC'23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis	Search
2024	Enabling Bitwise Reproducibility for the Unstructured Computational Motif	Applied Sciences - MDPI	Search
2024	Computational tools to predict context-specific protein complexes	Current Opinion in Structural Biology - Elsevier Current Trends	Search
2010	Fadinges csatornamodellek es csatornabecslo protokollok vezetek nelkuli erzekelo halozatok szamara	ppke	Search
2024	Benchmarking the Evolution of Performance and Energy Efficiency Across Recent Generations of Intel Xeon Processors	SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis - IEEE	Search
2025	Performance and efficiency: A multi-generational benchmark of modern processors on bandwidth-bound HPC applications	Future Generation Computer Systems - North-Holland	Search
2025	Smart epidemic control: A hybrid model blending ODEs and agent-based simulations for optimal, real-world intervention planning	PLOS Computational Biology - Public Library of Science San Francisco, CA USA	Search
2025	Reduced and mixed precision turbulent flow simulations using explicit finite difference schemes	arXiv preprint arXiv:2505.20911	Search
2025	Anomaly Detection Algorithms for Real-Time Log Data Analysis at Scale	IEEE Access - IEEE	Search
2025	Agens alapu modellek jelentosege a koronavirus-jarvany kezeleseben= The Importance of Agent-Based Models in the Management of the Coronavirus Epidemic	HUMAN INNOVACIOS SZEMLE	Search
2019	PPCU Sam: Open-source face recognition framework	PROCEDIA COMPUTER SCIENCE - Elsevier BV	Search
2025	Digital Twin Approaches for Interpretable Side Effect Prediction in Drug Discovery	bioRxiv - Cold Spring Harbor Laboratory	Search
2012	GPU acceleration of medical ultrasound imaging	ppke	Search
2025	OPS-SENGA+: A Performance-Portable Solver for High-Fidelity Reacting Flow Simulations on CPUs and GPUs	International Conference on Numerical Combustion ICNC 2025 - Newcastle University	Search

Collaboration Footprint

Regular collaboration channels include the University of Oxford, Imperial College London, and the University of Warwick.

Industrial Relevance

Publication themes are linked to production-level challenges in aerospace simulation, uncertainty-aware modeling, and data-intensive imaging pipelines.

Open Access Strategy

We prioritize repositories and discoverability pathways that make methods reusable by academic and industrial practitioners.