Title: Dynamic Analysis of Data Inconsistencies and Data Races in OpenMP Programs

 

Date: Monday, May 6, 2024

Time: 2:00 pm - 4:00 pm ET

Location: Klaus 3402 and Virtual (https://gatech.zoom.us/j/3092779704?pwd=SDVZaTlkTFpsZDRoYUU3Y0pzdmNvdz09)

Lechen Yu

Ph.D. Student

School of Computer Science

Georgia Institute of Technology

 

Committee:

Dr. Vivek Sarkar (Advisor) – School of Computer Science, Georgia Institute of Technology

Dr. Santosh Pande – School of Computer Science, Georgia Institute of Technology

Dr. Qirun Zhang – School of Computer Science, Georgia Institute of Technology

Dr. Hyesoon Kim – School of Computer Science, Georgia Institute of Technology

Dr. Tiago Cogumbreiro – Department of Computer Science, University of Massachusetts Boston

 

Abstract:

 

OpenMP is a popular intra-node parallel programming model that supports several

problem decomposition approaches, including task parallelism, data parallelism, and heterogeneous 

parallelism. When OpenMP introduces new parallel paradigms or features, it must

ensure that these additions align with the existing constructs to maintain consistency and

avoid unspecified behaviors. New features can result in revisions to the behavior of existing

constructs, which may in turn require programmers to re-evaluate their understanding of

existing constructs and adapt their code accordingly. Consequently, writing correct OpenMP

programs can be challenging even for experienced programmers.

 

To alleviate the intricacy of writing correct OpenMP programs, this proposal outlines

several dynamic analysis techniques that help programmers identify programming errors in

OpenMP programs. First, we describe various studies on device offloading, a recent OpenMP

feature still in its developmental phase. Our studies reveal discrepancies between the LLVM

implementation and the intended runtime behavior of device offloading. Additionally,

erroneous usage of device offloading constructs can lead to an assortment of memory

anomalies. Since these memory anomalies can generate disparities between host variables

and their corresponding counterparts on accelerator devices, we classify such bugs as data

inconsistencies. By establishing permissible memory accesses on the host and accelerator,

this proposal introduces a dynamic approach to detect data inconsistencies automatically.

The dynamic approach leverages a per-variable state transition model, which can be used to

establish the validity of the memory location before executing any memory accesses. Beyond

data inconsistencies, this proposal also delves into novel dynamic approaches for identifying

data races in OpenMP programs. By extending the SPD3 race detection algorithm, originally

designed for async-finish task parallelism, our dynamic race detection approach can handle a

large subset of parallel constructs in OpenMP, thereby checking more precise happens-before

relations among tasks relative to existing per-thread vector-clock-based approaches.