Automatic Fence Insertion: A Static Analysis Approach | Schemes and Mind Maps Computer Science

Don’t sit on the fence⋆

A static analysis approach to automatic fence insertion

Jade Alglave1, Daniel Kroening2, Vincent Nimal2, and Daniel Poetzl2

1University College London 2University of Oxford

Abstract Modern architectures rely on memory fences to prevent undesired weak-

enings of memory consistency. As the fences’ semantics may be subtle, the au-

tomation of their placement is highly desirable. But precise methods for restoring

consistency do not scale to deployed systems code. We choose to trade some pre-

cision for genuine scalability: our technique is suitable for large code bases. We

implement it in our new musketeer tool, and detail experiments on more than

350 executables of packages found in Debian Linux 7.1, e.g. memcached (about

10000 LoC).

1 Introduction

Concurrent programs are hard to design and implement, especially when running on

multiprocessor architectures. Multiprocessors implement weak memory models, which

feature e.g. instruction reordering,store buffering (both appearing on x86), or store

atomicity relaxation (a particularity of Power and ARM). Hence, multiprocessors allow

more behaviours than Lamport’s Sequential Consistency (SC) [20], a theoretical model

where the execution of a program corresponds to an interleaving of the different threads.

This has a dramatic effect on programmers, most of whom learned to program with SC.

Fortunately, architectures provide special fence (or barrier) instructions to prevent

certain behaviours. Yet both the questions of where and how to insert fences are con-

tentious, as fences are architecture-specific and expensive.

Attempts at automatically placing fences include Visual Studio 2013, which offers

an option to guarantee acquire/release semantics (we study the performance impact of

this policy in Sec. 2). The C++11 standard provides an elaborate API for inter-thread

communication, giving the programmer some control over which fences are used, and

where. But the use of such APIs might be a hard task, even for expert programmers. For

example, Norris and Demsky reported a bug found in a published C11 implementation

of a work-stealing queue [27].

We address here the question of how to synthesise fences, i.e. automatically place

them in a pro gram to enforce robustness/stability [9,5] (which implies SC). This should

lighten the programmer’s burden. The fence synthesis tool needs to be based on a pre-

cise model of weak memory. In verification, models commonly adopt an operational

style, where an execution is an interleaving of transitions accessing the memory (as

in SC). To address weaker architectures, the models are augmented with buffers and

⋆Supported by SRC/2269.002, EPSRC/H017585/1 and ERC/280053.

Automatic Fence Insertion: A Static Analysis Approach, Schemes and Mind Maps of Computer Science

Related documents

Partial preview of the text

Download Automatic Fence Insertion: A Static Analysis Approach and more Schemes and Mind Maps Computer Science in PDF only on Docsity!

Don’t sit on the fence⋆

A static analysis approach to automatic fence insertion

1 Introduction

2 Motivation

3 Related work

4 Axiomatic memory model

5 Static detection of critical cycles

6 Synthesis

7 Implementation and Experiments

References