Academic Authority

Publications

The list below is a representative cross-section of our topics and collaborations. Use sorting and filtering to quickly locate work by year, theme, or application area.

Publications

Focus areas include DSL design, CFD acceleration, reproducibility, mixed precision, public health simulation, and medical imaging workloads.

2012 OP2: An active library framework for solving unstructured mesh-based applications on multi-core and many-core architectures 2012 Innovative Parallel Computing (InPar) - IEEE Search
2012 Efficient sparse matrix-vector multiplication on cache-based GPUs 2012 Innovative Parallel Computing (InPar) - IEEE Search
2013 Designing OP2 for GPU architectures Journal of Parallel and Distributed Computing - Academic Press Search
2013 Design and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems Parallel Computing - North-Holland Search
2012 An analytical study of loop tiling for a large-scale unstructured mesh application 2012 SC Companion: High Performance Computing, Networking Storage and Analysis - IEEE Search
2014 Trends in high-performance computing for engineering calculations Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences - The Royal Society Search
2014 Vectorizing unstructured mesh computations for many-core architectures Proceedings of Programming Models and Applications on Multicores and Manycores Search
2015 Acceleration of a full-scale industrial CFD application with OP2 IEEE Transactions on Parallel and Distributed Systems - IEEE Search
2015 Finite element algorithms and data structures on graphical processing units International Journal of Parallel Programming - Springer US Boston Search
2015 A comparison between parallelization approaches in molecular dynamics simulations on GPUs Journal of computational chemistry Search
2014 The OPS domain specific abstraction for multi-block structured grid computations 2014 Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing - IEEE Search
2014 Rolls Royce Hydra CFD Code on GPUs using OP2 Abstraction GPU Technology Conference (GTC) Search
2014 GPU implementation of finite difference solvers 2014 Seventh Workshop on High Performance Computational Finance - IEEE Search
2012 Op2 airfoil example URL https://citeseerx. ist. psu. edu/document Search
2014 Abstraction and Implementation of Unstructured Grid Algorithms on Massively Parallel Heterogeneous Architectures Pazmany Peter Katolikus Egyetem Search
2014 Performance analysis of a high-level abstractions-based hydrocode on future computing systems International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems - Springer International Publishing Cham Search
2012 OP2 Developers Guide-Distributed Memory (MPI) Parallelisation Publication details unavailable Search
2014 High-level abstractions for performance, portability and continuity of scientific software on future computing systems Technical report, April Search
2015 High-level Abstractions for Performance, Portability and Continuity of Scientific Software on Future Computing Systems-CloverLeaf 3D Publication details unavailable Search
2015 Analysis of parallel processor architectures for the solution of the Black-Scholes PDE 2015 IEEE International Symposium on Circuits and Systems (ISCAS) - IEEE Search
2016 Block-structured compressible Navier-Stokes solution using the OPS high-level abstraction International Journal of Computational Fluid Dynamics - Taylor & Francis Search
2015 Design and development of domain specific active libraries with proxy applications 2015 IEEE International Conference on Cluster Computing - IEEE Search
2015 AmgX: A library for GPU accelerated algebraic multigrid and preconditioned iterative methods SIAM Journal on Scientific Computing - Society for Industrial and Applied Mathematics Search
2015 Benchmarking the IBM Power8 processor CASCON Search
2016 Auto-vectorizing a large-scale production unstructured-mesh CFD application Proceedings of the 3rd Workshop on Programming Models for SIMD/Vector Processing Search
2013 Op2 c++ user's manual Publication details unavailable Search
2013 Tsunami simulation using the OP2 parallel framework Publication details unavailable Search
2016 High performance computing on the ibm power8 platform International Conference on High Performance Computing - Springer International Publishing Cham Search
2016 Development of strategy and procedure for assessing portability of industrial codes to GPUs hardware: the TITAN example Publication details unavailable Search
2017 Loop tiling in large-scale stencil codes at run-time with OPS IEEE Transactions on Parallel and Distributed Systems - IEEE Search
2019 Low complexity algorithmic trading by feedforward neural networks Computational Economics - Springer US New York Search
2017 Achieving performance portability for a heat conduction solver mini-application on modern multi-core systems 2017 IEEE International Conference on Cluster Computing (CLUSTER) - IEEE Search
2012 Designing op2 for gpu architectures Journal of Parallel and Distributed Computing Search
2017 Beyond 16GB: out-of-core stencil computations Proceedings of the Workshop on Memory Centric Programming for HPC Search
2017 Comparison of parallelisation approaches, languages, and compilers for unstructured mesh algorithms on GPUs International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems - Springer International Publishing Cham Search
2019 Locality optimized unstructured mesh algorithms on GPUs Journal of Parallel and Distributed Computing - Academic Press Search
2018 The VOLNA-OP2 tsunami code (version 1.5) Geoscientific Model Development - Copernicus GmbH Search
2016 A fast and flexible toolbox for tracking brain connections in diffusion MRI datasets using GPUs 22nd Annual Meeting of the Organization for Human Brain Mapping (OHBM), Geneva, Switzerland Search
2019 Using GPUs to accelerate computational diffusion MRI: From microstructure estimation to tractography and connectomes Neuroimage - Academic Press Search
2018 An abstraction for local computations on structured meshes and its extension to handling multiple materials CNNA 2018; The 16th International Workshop on Cellular Nanoscale Networks and their Applications - VDE Search
2014 Cache-Blocking Tiling of Large Stencil Codes at Runtime Performance Computing Search
2018 Op2-clang: A source-to-source translator using clang/llvm libtooling 2018 IEEE/ACM 5th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC) - IEEE Search
2018 Heterogeneous cpu-gpu execution of stencil applications 2018 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC) - IEEE Search
2015 OP2-TPDS2015-DATA University of Oxford Search
2019 Improving resilience of scientific software through a domain-specific approach Journal of Parallel and Distributed Computing - Academic Press Search
2019 Large-scale performance of a DSL-based multi-block structured-mesh application for Direct Numerical Simulation Journal of Parallel and Distributed Computing - Academic Press Search
2019 Batch solution of small PDEs with the OPS DSL International Conference on High Performance Computing - Springer International Publishing Cham Search
2015 rand Publication details unavailable Search
2019 PPCU Sam: Open-source face recognition framework Procedia Computer Science - Elsevier Search
2019 Performance portability of multi-material kernels 2019 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC) - IEEE Search
2019 GPU support for automatic generation of finite-differences Stencil Kernels Latin American High Performance Computing Conference - Springer International Publishing Cham Search
2020 Productivity, performance, and portability for computational fluid dynamics applications Computers & Fluids - Pergamon Search
2019 GPU Support for Automatic Generation of Finite-Differences Stencil Kernels arXiv e-prints Search
2020 Performance portability of the mg-cfd mini-app with sycl Proceedings of the International Workshop on OpenCL Search
2020 Bitwise Reproducible task execution on unstructured mesh applications 2020 20th IEEE/ACM international symposium on cluster, cloud and Internet computing (CCGRID) - IEEE Search
2020 Automatic parallel implementations of adjoint codes for structured mesh applications 2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID) - IEEE Search
2020 GPU Support for Automatic Generation High Performance Computing: 6th Latin American Conference, CARLA 2019, Turrialba, Costa Rica, September 25-27, 2019, Revised Selected Papers - Springer Nature Search
2021 High-level FPGA accelerator design for structured-mesh-based explicit numerical solvers 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS) - IEEE Search
2020 Modernising an industrial cfd application 2020 Eighth International Symposium on Computing and Networking Workshops (CANDARW) - IEEE Search
2015 reguly/volna: VOLNA-OP2 Github Search
2011 OP-DSL/OP2-Common: OP2: open-source framework for the execution of unstructured grid applications on clusters of GPUs or multi-core CPUs Github Search
2021 Under the hood of sycl-an initial performance analysis with an unstructured-mesh cfd application International Conference on High Performance Computing - Springer International Publishing Cham Search
2022 Microsimulation based quantitative analysis of COVID-19 management strategies PLoS computational biology - Public Library of Science San Francisco, CA USA Search
2021 Automatic Parallelisation of Sturctured Mesh Computations with SYCL 2021 IEEE International Conference on Cluster Computing (CLUSTER) - IEEE Search
2021 Scalable many-core algorithms for tridiagonal solvers Computing in Science & Engineering - IEEE Search
2021 Predictive analysis of large-scale coupled cfd simulations with the cpx mini-app 2021 IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC) - IEEE Search
2022 High throughput multidimensional tridiagonal system solvers on FPGAs Proceedings of the 36th ACM International Conference on Supercomputing Search
2022 Loop analysis quantifying human impact in a river ecosystem model Ecological Complexity - Elsevier Search
2022 FPGA acceleration of structured-mesh-based explicit and implicit numerical solvers using SYCL Proceedings of the 10th International Workshop on OpenCL Search
2023 Integral representation method based efficient rule optimizing framework for anti-money laundering Journal of Money Laundering Control - Emerald Publishing Limited Search
2022 Towards virtual certification of gas turbine engines with performance-portable simulations 2022 IEEE International Conference on Cluster Computing (CLUSTER) - IEEE Search
2021 Query complexity in modern database DSLs ACM Transactions on Information Systems Search
2022 The design and utilisation of PanSim, a portable pandemic simulator 2022 First Combined International Workshop on Interactive Urgent Supercomputing (CIW-IUS) - IEEE Search
2022 Virtual certification of gas turbine engines-visualizing the DLR Rig250 compressor Publication details unavailable Search
2023 Wastewater-based modeling, reconstruction, and prediction for COVID-19 outbreaks in Hungary caused by highly immune evasive variants Water Research - Pergamon Search
2023 Communication-avoiding optimizations for large-scale unstructured-mesh applications with op2 Proceedings of the 52nd International Conference on Parallel Processing Search
2023 Quantifying and comparing the impact of combinations of non-pharmaceutical interventions on the spread of COVID-19 2023 31st Mediterranean Conference on Control and Automation (MED) - IEEE Search
2023 Comparative evaluation of bandwidth-bound applications on the intel xeon cpu max series Proceedings of the SC'23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis Search
2023 Evaluating the performance portability of SYCL across CPUs and GPUs on bandwidth-bound applications Proceedings of the SC'23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis Search
2024 Enabling Bitwise Reproducibility for the Unstructured Computational Motif Applied Sciences - MDPI Search
2024 Computational tools to predict context-specific protein complexes Current Opinion in Structural Biology - Elsevier Current Trends Search
2010 Fadinges csatornamodellek es csatornabecslo protokollok vezetek nelkuli erzekelo halozatok szamara ppke Search
2024 Benchmarking the Evolution of Performance and Energy Efficiency Across Recent Generations of Intel Xeon Processors SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis - IEEE Search
2025 Performance and efficiency: A multi-generational benchmark of modern processors on bandwidth-bound HPC applications Future Generation Computer Systems - North-Holland Search
2025 Smart epidemic control: A hybrid model blending ODEs and agent-based simulations for optimal, real-world intervention planning PLOS Computational Biology - Public Library of Science San Francisco, CA USA Search
2025 Reduced and mixed precision turbulent flow simulations using explicit finite difference schemes arXiv preprint arXiv:2505.20911 Search
2025 Anomaly Detection Algorithms for Real-Time Log Data Analysis at Scale IEEE Access - IEEE Search
2025 Agens alapu modellek jelentosege a koronavirus-jarvany kezeleseben= The Importance of Agent-Based Models in the Management of the Coronavirus Epidemic HUMAN INNOVACIOS SZEMLE Search
2019 PPCU Sam: Open-source face recognition framework PROCEDIA COMPUTER SCIENCE - Elsevier BV Search
2025 Digital Twin Approaches for Interpretable Side Effect Prediction in Drug Discovery bioRxiv - Cold Spring Harbor Laboratory Search
2012 GPU acceleration of medical ultrasound imaging ppke Search
2025 OPS-SENGA+: A Performance-Portable Solver for High-Fidelity Reacting Flow Simulations on CPUs and GPUs International Conference on Numerical Combustion ICNC 2025 - Newcastle University Search

Collaboration Footprint

Regular collaboration channels include the University of Oxford, Imperial College London, and the University of Warwick.

Industrial Relevance

Publication themes are linked to production-level challenges in aerospace simulation, uncertainty-aware modeling, and data-intensive imaging pipelines.

Open Access Strategy

We prioritize repositories and discoverability pathways that make methods reusable by academic and industrial practitioners.