Please read these instructions before posting any event on Fermilab Indico

Indico will be unavailable on Wednesday, Nov 20th from 7-7:30 CST due to server maintenance.

May 9 – 12, 2022
Virtual
US/Central timezone
Join us shape the future

Awkward Arrays to RDataFrame and back

May 10, 2022, 8:55 AM
15m
One West (Virtual)

One West

Virtual

**This meeting is held virtually** Registered participants received the video conferencing link on Sunday, 8th May 2022.
Presentation Second Session

Speaker

Ianna Osborne

Description

Awkward Arrays and RDataFrame provide two very different ways of performing calculations at scale. By adding the ability to zero-copy convert between them, users get the best of both. It gives users a better flexibility in mixing different packages and languages in their analysis.

In Awkward Array version 2, the ak.to_rdataframe function presents a view of an Awkward Array as an RDataFrame source. This view is generated on demand and the data is not copied. The column readers are generated based on the run-time type of the views. The readers are passed to a generated source derived from ROOT::RDF::RDataSource.

The ak.from_rdataframe function converts the selected columns as native Awkward Arrays.

We discuss the details of the implementation exploiting JIT techniques. We present examples of analysis of data stored in Awkward Arrays via a high-level interface of an RDataFrame.

We show a few examples of the column definition, applying user-defined filters written in C++, and plotting or extracting the columnar data as Awkward Arrays.

We discuss current limitations and future plans.

Primary authors

Ianna Osborne Jim Pivarski (Fermilab)

Presentation materials