2024 OFA Virtual Workshop Sessions

The Annual OFA Workshop is a premier means of fostering collaboration among those who develop fabrics, deploy fabrics, and create applications that rely on fabrics. It is the only event of its kind where fabric developers and users can discuss emerging fabric technologies, collaborate on future industry requirements, and address problems that exist today.

Day 1
Monday, April 22
8:00 am - 2:00 pm PT

Opening Remarks
Phil Cayton, Intel
8:00-8:05 am PT


Pavan Balaji, Meta
8:05-9:00 am PT


Session 1
“OFI 2.0 Update” 

Jianxin Xiong, Intel
9:00-9:30 am PT


15-minute break
9:30-9:45 am PT

Session 2
“Status of OpenFabrics Interfaces (OFI) Support in MPICH”

Yanfei Guo, Argonne National Laboratory
9:45-10:15 am PT


Session 3
"Accelerating MPI AllReduce Communication with Efficient GPU-Based Compression Schemes

on Modern GPU Clusters"
Hari Subramoni and Qinghua Zhou, The Ohio State University
10:15-10:45 am PT


15-minute break
10:45-11:00 am PT

Session 4
"High Performance & Scalable MPI library over Broadcom RoCE"

Mustafa Abduljabbar, The Ohio State University; Hemal Shah, Broadcom Inc; and Shulei Xu, The
Ohio State University
11:00-11:30 am PT


Lunch and email break
11:30 am - 1:00 pm PT

Session 5
"Scaling Large Language Model Training using Hybrid GPU-based Compression in MVAPICH"

Speakers: Aamir Shafi and Lang Xu, The Ohio State University
1:00-1:30 pm PT


Session 6
"OFI Integrated Shared Memory Offload"

Speakers: Alexia Ingerson, Intel; Shi Jin, Amazon; and Amir Shehata, Oak Ridge National Laboratories
1:30-2:00 pm PT


Day 2
Monday, April 23
8:00 am - 2:00 pm PT

Session 7
"Managing Composable Disaggregated Infrastructure With OFA Sunfish"

Christian Pinto, IBM Research Europe; Michael Aguilar, Sandia National Laboratories; Phil
Cayton, Intel; Russ Herrell, Hewlett Packard Enterprise; and Brian Pan, H3 Platform
8:00-8:30 am PT


Session 8
"An Integrated Deep Reinforcement Learning Agent for Sunfish and HPC Workload Manager

Composable Disaggregated Resource Scheduling"
Speakers: Catherine Appleby and Michael Aguilar, Sandia National Laboratories
8:30-9:00 am PT


Session 9
"Cornelis Networks CN5000 Adapter and Software Update"

Dennis Dalessandro, Cornelis Networks
9:00-9:30 am PT


15-minute break
9:30-9:45 am PT

Session 10
"System Composability Using CXL"

Kurtis Bowman, CXL Consortium MWG Co-Chair
9:45-10:15 am PT


Session 11
"Optimized All-to-all Connection Establishment for High-Performance MPI Libraries over
Mustafa Abduljabbar and Dhabaleswar Panda, The Ohio State University
10:15-10:45 am PT


Session 12
"RecoNIC: RDMA-enabled Compute Offloading on FPGA-based SmartNIC"
Speaker: Guanwen Zhong, AMD
10:45-11:15 am PT


Session 13
"Designing In-Network Computing Aware Reduction Collectives in MPI"
Speakers: Dhabaleswar Panda and Bharath Ramesh, The Ohio State University
11:15-11:45 am PT


Lunch and email break
11:45 am - 1:00 pm PT

"How to setup RDMA CI using the FSDP cluster" and "How to do manual RDMA testing using the FSDP cluster"
Doug Ledford, Redhat and Jeremy Spewock, UNH InterOperability Lab (IOL)
1:00-1:55 pm PT


Closing Remarks
Doug Ledford
1:55-2:00 pm PT