The Ghost Threading Compiler Infrastructure

Welcome to the Ghost Threading Compiler project!

Overview

This repository contains the source code for an LLVM-based compiler that enables Ghost Threading — a software-only prefetching mechanism that utilizes idle Simultaneous Multithreading (SMT) contexts to launch lightweight helper threads. The technique is described in the paper:

Ghost Threading: Helper-Thread Prefetching for Real Systems
Yuxin Guo, Alexandra W. Chadwick, Márton Erdős,, Utpal Bora, Akshay Bhosale, Giacomo Gabrielli and Timothy M. Jones
International Symposium on Microarchitecture (MICRO)
October 2025

Please cite this paper if you produce any work that uses this repository.

Getting the Source Code and Building the compiler

Consult the Getting Started with LLVM page for information on building and running LLVM.

Prerequisites

You can install the required dependencies using the following commands:

For Ubuntu/Debian:

sudo apt-get update
sudo apt-get install python3 llvm clang lld ninja-build cmake

For macOS:

brew install llvm lld python ninja cmake

Getting the Ghost Threading compiler

Clone the repository:

git clone https://github.com/CompArchCam/GhostThreadingCompiler.git

Build the compiler

cd GhostThreadingCompiler
mkdir build && cd build
export LLVMDIR="/llvm/install/path"
cmake -G Ninja \
  -DCMAKE_INSTALL_PREFIX="${LLVMDIR}" \
  -DCMAKE_C_COMPILER=clang -DCMAKE_CXX_COMPILER=clang++ \
  -DCMAKE_BUILD_TYPE=Release \
  -DLLVM_OPTIMIZED_TABLEGEN=On \
  -DLLVM_ENABLE_PROJECTS="clang;lld;openmp" \
  -DLLVM_TARGETS_TO_BUILD="X86" \
  -DLLVM_PARALLEL_COMPILE_JOBS=6 \
  -DLLVM_PARALLEL_LINK_JOBS=4 \
  -DLLVM_USE_LINKER=lld \
  ../llvm

ninja install
export PATH="${LLVMDIR}/bin:$PATH"

Testing Installation

Once installed, you can verify the installation by running the following command:

opt --help-hidden | grep ghostthreading
clang --version

This should print the version of the compiler that you have installed.

Compiling test cases

The compiler is pragma driven and an expensive memory load in a loop can be annotated with prefetch intrinsic as show below:

// test.c
#include<stdio.h>
extern int foo(int);
extern int **Data;
extern unsigned Length;
int main() {
  long int Sum = 0;
  #pragma ghost_threading sync_frequency(14) skip_iter(8) serial_max_threshold(100) serial_min_threshold(10)
  for(unsigned i = 0; i < Length; i++) {
    __builtin_prefetch(Data[i+64]);
    Sum += foo(*Data[i]);
  }
  printf("Sum %ld\n", Sum);
}

The hyper-parameters are described in detail in the paper and must be tuned for each workload and target machine to achieve optimal performance.

The Ghost Threading pass is enabled by defaul but can be enabled/disabled with the command line flag -ghostthreading=[true|false].

clang -O3 -w -Wall -std=c11 \
  -mtune=native -march=native \
  -mllvm -ghostthreading=true \
  test.c -o test.out

Compiling and Running Benchmarks

The benchmarks or workloads used to evaluate this automated technique can be fetched from the repository ghost-threading-bmk. These are workloads from gap and htpf as described in the aforementioned MICRO paper. The scripts to compile and execute the baseline, automatic Ghost Threading, and manual Ghost Threading technique can be found in the directory workdir.

The script config.sh sets the relevant flags for each of the techniques.

Acknowledgements

This work was supported by the Engineering and Physical Sciences Research Council (EPSRC), grant EP/W00576X/1, and Arm. Additional data related to this publication is available in the repository at url.

Contribute

We welcome contributions from the community! If you want to improve the project or add new features, follow these steps:

Fork the repository.
Create a new branch (git checkout -b feature-name).
Implement your feature or bug fix.
Run the benchmarks and ensure all tests pass.
Commit your changes and push them to your fork.
Create a pull request describing your changes.

License

LLVM License

Name		Name	Last commit message	Last commit date
Latest commit History 526,003 Commits
.ci		.ci
.github		.github
bolt		bolt
clang-tools-extra		clang-tools-extra
clang		clang
cmake		cmake
compiler-rt		compiler-rt
cross-project-tests		cross-project-tests
flang		flang
kernels		kernels
libc		libc
libclc		libclc
libcxx		libcxx
libcxxabi		libcxxabi
libunwind		libunwind
lld		lld
lldb		lldb
llvm-libgcc		llvm-libgcc
llvm		llvm
mlir		mlir
offload		offload
openmp		openmp
polly		polly
pstl		pstl
runtimes		runtimes
third-party		third-party
utils/bazel		utils/bazel
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.git-blame-ignore-revs		.git-blame-ignore-revs
.gitattributes		.gitattributes
.gitignore		.gitignore
.mailmap		.mailmap
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.TXT		LICENSE.TXT
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

The Ghost Threading Compiler Infrastructure

Table of Contents

Overview

Getting the Source Code and Building the compiler

Prerequisites

For Ubuntu/Debian:

For macOS:

Getting the Ghost Threading compiler

Testing Installation

Compiling test cases

Compiling and Running Benchmarks

Acknowledgements

Contribute

License

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

CompArchCam/GhostThreadingCompiler

Folders and files

Latest commit

History

Repository files navigation

The Ghost Threading Compiler Infrastructure

Table of Contents

Overview

Getting the Source Code and Building the compiler

Prerequisites

For Ubuntu/Debian:

For macOS:

Getting the Ghost Threading compiler

Testing Installation

Compiling test cases

Compiling and Running Benchmarks

Acknowledgements

Contribute

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages