Collaboration Footprint
Regular collaboration channels include the University of Oxford, Imperial College London, and the University of Warwick.
The list below is a representative cross-section of our topics and collaborations. Use sorting and filtering to quickly locate work by year, theme, or application area.
Focus areas include DSL design, CFD acceleration, reproducibility, mixed precision, public health simulation, and medical imaging workloads.
| 2012 | OP2: An active library framework for solving unstructured mesh-based applications on multi-core and many-core architectures | 2012 Innovative Parallel Computing (InPar) - IEEE | Search |
| 2012 | Efficient sparse matrix-vector multiplication on cache-based GPUs | 2012 Innovative Parallel Computing (InPar) - IEEE | Search |
| 2013 | Designing OP2 for GPU architectures | Journal of Parallel and Distributed Computing - Academic Press | Search |
| 2013 | Design and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems | Parallel Computing - North-Holland | Search |
| 2012 | An analytical study of loop tiling for a large-scale unstructured mesh application | 2012 SC Companion: High Performance Computing, Networking Storage and Analysis - IEEE | Search |
| 2014 | Trends in high-performance computing for engineering calculations | Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences - The Royal Society | Search |
| 2014 | Vectorizing unstructured mesh computations for many-core architectures | Proceedings of Programming Models and Applications on Multicores and Manycores | Search |
| 2015 | Acceleration of a full-scale industrial CFD application with OP2 | IEEE Transactions on Parallel and Distributed Systems - IEEE | Search |
| 2015 | Finite element algorithms and data structures on graphical processing units | International Journal of Parallel Programming - Springer US Boston | Search |
| 2015 | A comparison between parallelization approaches in molecular dynamics simulations on GPUs | Journal of computational chemistry | Search |
| 2014 | The OPS domain specific abstraction for multi-block structured grid computations | 2014 Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing - IEEE | Search |
| 2014 | Rolls Royce Hydra CFD Code on GPUs using OP2 Abstraction | GPU Technology Conference (GTC) | Search |
| 2014 | GPU implementation of finite difference solvers | 2014 Seventh Workshop on High Performance Computational Finance - IEEE | Search |
| 2012 | Op2 airfoil example | URL https://citeseerx. ist. psu. edu/document | Search |
| 2014 | Abstraction and Implementation of Unstructured Grid Algorithms on Massively Parallel Heterogeneous Architectures | Pazmany Peter Katolikus Egyetem | Search |
| 2014 | Performance analysis of a high-level abstractions-based hydrocode on future computing systems | International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems - Springer International Publishing Cham | Search |
| 2012 | OP2 Developers Guide-Distributed Memory (MPI) Parallelisation | Publication details unavailable | Search |
| 2014 | High-level abstractions for performance, portability and continuity of scientific software on future computing systems | Technical report, April | Search |
| 2015 | High-level Abstractions for Performance, Portability and Continuity of Scientific Software on Future Computing Systems-CloverLeaf 3D | Publication details unavailable | Search |
| 2015 | Analysis of parallel processor architectures for the solution of the Black-Scholes PDE | 2015 IEEE International Symposium on Circuits and Systems (ISCAS) - IEEE | Search |
| 2016 | Block-structured compressible Navier-Stokes solution using the OPS high-level abstraction | International Journal of Computational Fluid Dynamics - Taylor & Francis | Search |
| 2015 | Design and development of domain specific active libraries with proxy applications | 2015 IEEE International Conference on Cluster Computing - IEEE | Search |
| 2015 | AmgX: A library for GPU accelerated algebraic multigrid and preconditioned iterative methods | SIAM Journal on Scientific Computing - Society for Industrial and Applied Mathematics | Search |
| 2015 | Benchmarking the IBM Power8 processor | CASCON | Search |
| 2016 | Auto-vectorizing a large-scale production unstructured-mesh CFD application | Proceedings of the 3rd Workshop on Programming Models for SIMD/Vector Processing | Search |
| 2013 | Op2 c++ user's manual | Publication details unavailable | Search |
| 2013 | Tsunami simulation using the OP2 parallel framework | Publication details unavailable | Search |
| 2016 | High performance computing on the ibm power8 platform | International Conference on High Performance Computing - Springer International Publishing Cham | Search |
| 2016 | Development of strategy and procedure for assessing portability of industrial codes to GPUs hardware: the TITAN example | Publication details unavailable | Search |
| 2017 | Loop tiling in large-scale stencil codes at run-time with OPS | IEEE Transactions on Parallel and Distributed Systems - IEEE | Search |
| 2019 | Low complexity algorithmic trading by feedforward neural networks | Computational Economics - Springer US New York | Search |
| 2017 | Achieving performance portability for a heat conduction solver mini-application on modern multi-core systems | 2017 IEEE International Conference on Cluster Computing (CLUSTER) - IEEE | Search |
| 2012 | Designing op2 for gpu architectures | Journal of Parallel and Distributed Computing | Search |
| 2017 | Beyond 16GB: out-of-core stencil computations | Proceedings of the Workshop on Memory Centric Programming for HPC | Search |
| 2017 | Comparison of parallelisation approaches, languages, and compilers for unstructured mesh algorithms on GPUs | International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems - Springer International Publishing Cham | Search |
| 2019 | Locality optimized unstructured mesh algorithms on GPUs | Journal of Parallel and Distributed Computing - Academic Press | Search |
| 2018 | The VOLNA-OP2 tsunami code (version 1.5) | Geoscientific Model Development - Copernicus GmbH | Search |
| 2016 | A fast and flexible toolbox for tracking brain connections in diffusion MRI datasets using GPUs | 22nd Annual Meeting of the Organization for Human Brain Mapping (OHBM), Geneva, Switzerland | Search |
| 2019 | Using GPUs to accelerate computational diffusion MRI: From microstructure estimation to tractography and connectomes | Neuroimage - Academic Press | Search |
| 2018 | An abstraction for local computations on structured meshes and its extension to handling multiple materials | CNNA 2018; The 16th International Workshop on Cellular Nanoscale Networks and their Applications - VDE | Search |
| 2014 | Cache-Blocking Tiling of Large Stencil Codes at Runtime | Performance Computing | Search |
| 2018 | Op2-clang: A source-to-source translator using clang/llvm libtooling | 2018 IEEE/ACM 5th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC) - IEEE | Search |
| 2018 | Heterogeneous cpu-gpu execution of stencil applications | 2018 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC) - IEEE | Search |
| 2015 | OP2-TPDS2015-DATA | University of Oxford | Search |
| 2019 | Improving resilience of scientific software through a domain-specific approach | Journal of Parallel and Distributed Computing - Academic Press | Search |
| 2019 | Large-scale performance of a DSL-based multi-block structured-mesh application for Direct Numerical Simulation | Journal of Parallel and Distributed Computing - Academic Press | Search |
| 2019 | Batch solution of small PDEs with the OPS DSL | International Conference on High Performance Computing - Springer International Publishing Cham | Search |
| 2015 | rand | Publication details unavailable | Search |
| 2019 | PPCU Sam: Open-source face recognition framework | Procedia Computer Science - Elsevier | Search |
| 2019 | Performance portability of multi-material kernels | 2019 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC) - IEEE | Search |
| 2019 | GPU support for automatic generation of finite-differences Stencil Kernels | Latin American High Performance Computing Conference - Springer International Publishing Cham | Search |
| 2020 | Productivity, performance, and portability for computational fluid dynamics applications | Computers & Fluids - Pergamon | Search |
| 2019 | GPU Support for Automatic Generation of Finite-Differences Stencil Kernels | arXiv e-prints | Search |
| 2020 | Performance portability of the mg-cfd mini-app with sycl | Proceedings of the International Workshop on OpenCL | Search |
| 2020 | Bitwise Reproducible task execution on unstructured mesh applications | 2020 20th IEEE/ACM international symposium on cluster, cloud and Internet computing (CCGRID) - IEEE | Search |
| 2020 | Automatic parallel implementations of adjoint codes for structured mesh applications | 2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID) - IEEE | Search |
| 2020 | GPU Support for Automatic Generation | High Performance Computing: 6th Latin American Conference, CARLA 2019, Turrialba, Costa Rica, September 25-27, 2019, Revised Selected Papers - Springer Nature | Search |
| 2021 | High-level FPGA accelerator design for structured-mesh-based explicit numerical solvers | 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS) - IEEE | Search |
| 2020 | Modernising an industrial cfd application | 2020 Eighth International Symposium on Computing and Networking Workshops (CANDARW) - IEEE | Search |
| 2015 | reguly/volna: VOLNA-OP2 | Github | Search |
| 2011 | OP-DSL/OP2-Common: OP2: open-source framework for the execution of unstructured grid applications on clusters of GPUs or multi-core CPUs | Github | Search |
| 2021 | Under the hood of sycl-an initial performance analysis with an unstructured-mesh cfd application | International Conference on High Performance Computing - Springer International Publishing Cham | Search |
| 2022 | Microsimulation based quantitative analysis of COVID-19 management strategies | PLoS computational biology - Public Library of Science San Francisco, CA USA | Search |
| 2021 | Automatic Parallelisation of Sturctured Mesh Computations with SYCL | 2021 IEEE International Conference on Cluster Computing (CLUSTER) - IEEE | Search |
| 2021 | Scalable many-core algorithms for tridiagonal solvers | Computing in Science & Engineering - IEEE | Search |
| 2021 | Predictive analysis of large-scale coupled cfd simulations with the cpx mini-app | 2021 IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC) - IEEE | Search |
| 2022 | High throughput multidimensional tridiagonal system solvers on FPGAs | Proceedings of the 36th ACM International Conference on Supercomputing | Search |
| 2022 | Loop analysis quantifying human impact in a river ecosystem model | Ecological Complexity - Elsevier | Search |
| 2022 | FPGA acceleration of structured-mesh-based explicit and implicit numerical solvers using SYCL | Proceedings of the 10th International Workshop on OpenCL | Search |
| 2023 | Integral representation method based efficient rule optimizing framework for anti-money laundering | Journal of Money Laundering Control - Emerald Publishing Limited | Search |
| 2022 | Towards virtual certification of gas turbine engines with performance-portable simulations | 2022 IEEE International Conference on Cluster Computing (CLUSTER) - IEEE | Search |
| 2021 | Query complexity in modern database DSLs | ACM Transactions on Information Systems | Search |
| 2022 | The design and utilisation of PanSim, a portable pandemic simulator | 2022 First Combined International Workshop on Interactive Urgent Supercomputing (CIW-IUS) - IEEE | Search |
| 2022 | Virtual certification of gas turbine engines-visualizing the DLR Rig250 compressor | Publication details unavailable | Search |
| 2023 | Wastewater-based modeling, reconstruction, and prediction for COVID-19 outbreaks in Hungary caused by highly immune evasive variants | Water Research - Pergamon | Search |
| 2023 | Communication-avoiding optimizations for large-scale unstructured-mesh applications with op2 | Proceedings of the 52nd International Conference on Parallel Processing | Search |
| 2023 | Quantifying and comparing the impact of combinations of non-pharmaceutical interventions on the spread of COVID-19 | 2023 31st Mediterranean Conference on Control and Automation (MED) - IEEE | Search |
| 2023 | Comparative evaluation of bandwidth-bound applications on the intel xeon cpu max series | Proceedings of the SC'23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis | Search |
| 2023 | Evaluating the performance portability of SYCL across CPUs and GPUs on bandwidth-bound applications | Proceedings of the SC'23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis | Search |
| 2024 | Enabling Bitwise Reproducibility for the Unstructured Computational Motif | Applied Sciences - MDPI | Search |
| 2024 | Computational tools to predict context-specific protein complexes | Current Opinion in Structural Biology - Elsevier Current Trends | Search |
| 2010 | Fadinges csatornamodellek es csatornabecslo protokollok vezetek nelkuli erzekelo halozatok szamara | ppke | Search |
| 2024 | Benchmarking the Evolution of Performance and Energy Efficiency Across Recent Generations of Intel Xeon Processors | SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis - IEEE | Search |
| 2025 | Performance and efficiency: A multi-generational benchmark of modern processors on bandwidth-bound HPC applications | Future Generation Computer Systems - North-Holland | Search |
| 2025 | Smart epidemic control: A hybrid model blending ODEs and agent-based simulations for optimal, real-world intervention planning | PLOS Computational Biology - Public Library of Science San Francisco, CA USA | Search |
| 2025 | Reduced and mixed precision turbulent flow simulations using explicit finite difference schemes | arXiv preprint arXiv:2505.20911 | Search |
| 2025 | Anomaly Detection Algorithms for Real-Time Log Data Analysis at Scale | IEEE Access - IEEE | Search |
| 2025 | Agens alapu modellek jelentosege a koronavirus-jarvany kezeleseben= The Importance of Agent-Based Models in the Management of the Coronavirus Epidemic | HUMAN INNOVACIOS SZEMLE | Search |
| 2019 | PPCU Sam: Open-source face recognition framework | PROCEDIA COMPUTER SCIENCE - Elsevier BV | Search |
| 2025 | Digital Twin Approaches for Interpretable Side Effect Prediction in Drug Discovery | bioRxiv - Cold Spring Harbor Laboratory | Search |
| 2012 | GPU acceleration of medical ultrasound imaging | ppke | Search |
| 2025 | OPS-SENGA+: A Performance-Portable Solver for High-Fidelity Reacting Flow Simulations on CPUs and GPUs | International Conference on Numerical Combustion ICNC 2025 - Newcastle University | Search |
Regular collaboration channels include the University of Oxford, Imperial College London, and the University of Warwick.
Publication themes are linked to production-level challenges in aerospace simulation, uncertainty-aware modeling, and data-intensive imaging pipelines.
We prioritize repositories and discoverability pathways that make methods reusable by academic and industrial practitioners.