We had some files belonging to three tapepools on the disk buffer. The EOS redundancy was d1::t0, but some of these files were already on tape while others were not.
We tried to resubmit closew for all files, but we got the following error:
“In ArchiveMount::reportJobsBatchTransferred(): got an exception.”
Then we tried to remove sys.archive.error and resubmit closew, but it was not useful.
Next, we removed sys.archive.error and sys.cta.archive.objectstore.id (set to null) and resubmitted closew, but again there was no result.
Also I checked cta-admin fr ls --log, and apart from the reportJobs exception, there were no other exceptions or errors.
We dedicated one tape server to one of these tapepools, and In the last attempt, I changed the taped ArchiveFlushBytesFiles parameter from
32000000000,1000
to
32000000000,1,
and then restarted cta-taped.
After this change, all files went to tape, and the EOS redundancy of the files already on tape was updated to d0:t1.
An interesting point is that other files that were stuck on disk but belonged to other tapepools also went to tape.(from different tapeserver)
I am wondering: when we changed this parameter to 1, what exactly happened?
I would appreciate any help in understanding what happened and any clues about the root cause.