DUNE Global Computing Sites

US/Pacific
https://fnal.zoom.us/j/636941598

https://fnal.zoom.us/j/636941598

Andrew Mcnab, Heidi Schellman (Oregon state), Kenneth Herner (Fermilab), Michael Kirby (FNAL), Steven Timm (Fermilab), Stuart Fuess (Fermilab), peter clarke (University Edinburgh)
Description
Weekly meeting for sites doing DUNE computing
    • 07:00 07:20
      General discussion 20m
      Speakers: Dr Andrew McNab (University of Manchester), Heidi Schellman (Oregon state), Dr Michael Kirby (FNAL), Dr Steven Timm (Fermilab), Prof. peter clarke (University Edinburgh)

      Sites meeting

      Manchester—more files are transfer now, we are not sure why.

       

      RAL user area—want to manage with sam4users won’t release 

       

      Ernesto Kemp—Unicamp

      There is a proposal for several Brazilian institutions to access the supercomputing center 

      Getting several science institutions to make a proposal mostly for dune

       

      Production 

      Fairly large section of jobs seems to be failing with read buffer errors

      Nothing specific to RAL

       

      Can be one of several things—

      Either stream interrupted due to network problems

      Or if we spend a very long amount of time on one event, longer than the timeout, the xrootd connection dies..

      Have lengthened the timeout.

       

      Can also happen in cases where you don’t hit the timeout, it will close the connection if it doesn’t get a reply within 60s.

       

      Will try to lengthen that timeout as well.

       

      In recovery pass we read in the whole file instead of streaming.

       

      Update on perfsonar matrix—no update, J. Hays will try to get one by next week.

       

      Anything on the CRIC

       

      A. McNab There will be a dune dedicated instance sometime within the week.

       

      Site renaming—tagging with DUNE_GLIDEINSite

       

      Now that production is a bit less will start requesting.

       

      Ken shows site distribution plot from recent production to date.