Hello,
Anyone with an idea why the scheduler sometimes selects very few files for writing from a tapepool despite having thousands in the queue? In this example, we have around 23k queued but only around ~13 get selected for a mount which then leads to a lot of mounts and dismounts
Oct 24 11:54:12 tpm102 cta-taped: LVL="INFO" PID="22130" TID="22130" MSG="In Scheduler::getNextMountDryRun(): Found a potential mount (archive)" thread="MainThread" tapeDrive="rz1-2,3,1" tapePool="MID-202302" tapeVid="L09876" mountType="ARCHIVE_FOR_USER" existingMountsDistinctTypeForThisTapepool="0" existingMountsBasicTypeForThisVo="0" bytesQueued="196757071817215" minBytesToWarrantMount="500000000000" filesQueued="23300" minFilesToWarrantMount="1000" oldestJobAge="310740" youngestJobAge="879" minArchiveRequestAge="1000" getMountInfoTime="0.016226" getTapeInfoTime="0.043179" candidateSortingTime="0.005268" getTapeForWriteTime="0.006439" decisionTime="0.000078" schedulerDbTime="0.016226" catalogueTime="0.049618"
Oct 24 11:54:13 tpm102 cta-taped: LVL="INFO" PID="22130" TID="22130" MSG="In Algorithms::popNextBatch(): did one round of elements retrieval." thread="MainThread" tapeDrive="rz1-2,3,1" tapeVid="L09876" mountId="2392" C="ArchiveQueueToTransferForUser" tapepool="MID-202302" containerAddress="ArchiveQueueToTransferForUser-MID-202302-Frontend-cta-frontend.desy.de-60178-20231019-14:15:59-0-7" filesAdded="13" bytesAdded="108163318755" filesBefore="0" bytesBefore="0" filesAfter="13" bytesAfter="108163318755" queueJobsBefore="23300" queueBytesBefore="196757071817215" queueJobsAfter="23287" queueBytesAfter="196648908498460" findLockFetchQueueTime="0.001930" jobSelectionTime="0.054522" ownershipAdditionTime="0.000037" asyncUpdateLaunchTime="0.005853" asyncUpdateCompletionTime="0.069838" updateResultProcessingTime="0.000224" containerUpdateTime="0.198190" containerTrimmingTime="0.000057" containerUnlockTime="0.000431" structureProcessingTime="0.004213"
Oct 24 11:54:13 tpm102 cta-taped: LVL="INFO" PID="22130" TID="22130" MSG="In Algorithms::popNextBatch(): elements retrieval complete." thread="MainThread" tapeDrive="rz1-2,3,1" tapeVid="L09876" mountId="2392" C="ArchiveQueueToTransferForUser" tapepool="MID-202302" files="13" findLockFetchQueueTime="0.001930" jobSelectionTime="0.054522" ownershipAdditionTime="0.000037" asyncUpdateLaunchTime="0.005853" asyncUpdateCompletionTime="0.069838" updateResultProcessingTime="0.000224" containerUpdateTime="0.198190" containerTrimmingTime="0.000057" containerUnlockTime="0.000431" structureProcessingTime="0.004213" schedulerDbTime="0.336694" iterationCount="1"
It’s not an often occurrence but mostly affects tapepools with a lot of data
Mwai