Summary
Since Thursday last week the tapes have been mounted and unmounted continuously. Normally the drives will hold on to a tape until all data is written now it seems to unmount and remount the tape all the time even when there is more data to write to that tape.
Details
CTA version: 5.11.9.0-1
Operating System and version: Alma Linus 9.6
Xrootd version: Using dCache
Objectstore backend:Ceph
Relevant logs and/or screenshots
{
“epoch_time”: 1752020946.612130255,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “ERROR”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “In Agent::deleteAndUnregisterSelf: agent still owns objects. Here is a part of the list.”,
“drive_name”: “F1_X”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“agentObject”: “Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0”,
“objects”: “RetrieveRequest-Frontend-ctax-3454028-20250618-15:54:03-0-3553295 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3554330 RetrieveRequest-Frontend-ctax-3454028-20250618-15:54:03-0-3554358 RetrieveRequest-Frontend-ctax-3454028-20250618-15:54:03-0-3554413 RetrieveRequest-Frontend-ctax-3454028-20250618-15:54:03-0-3554477 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3554616 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3554648 RetrieveRequest-Frontend-ctax-3454028-20250618-15:54:03-0-3554934 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3555007 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3555011 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3555100 RetrieveRequest-Frontend-ctax-3454028-20250618-15:54:03-0-3556555 RetrieveRequest-Frontend-ctax-3454028-20250618-15:54:03-0-3556860 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3556867 RetrieveRequest-Frontend-ctax-3454028-20250618-15:54:03-0-3556870 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3556913 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3559215 RetrieveRequest-Frontend-ctax-3454028-20250618-15:54:03-0-3559232 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3559365 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3561331 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3561333 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3561342 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3561347 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3561378 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3586491”,
“startIndex”: 1150,
“endIndex”: 1174,
“totalObjects”: 1290
}
{
“epoch_time”: 1752020946.612216583,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “ERROR”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “In Agent::deleteAndUnregisterSelf: agent still owns objects. Here is a part of the list.”,
“drive_name”: “F1_xx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“agentObject”: “Maintenance-tpsrvx-2765584-20250628-07:25:38-0”,
“objects”: “RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3587928 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3587959 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3587981 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3588033 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3588068 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3588157 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3588390 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3589441 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3589443 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3589663 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3589679 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3589855 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3592652 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3593300 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3593315 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3593330 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3593352 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3595085 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3595093 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3595097 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3595101 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3595102 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3596976 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3596989 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3596996”,
“startIndex”: 1175,
“endIndex”: 1199,
“totalObjects”: 1290
}
{
“epoch_time”: 1752020946.612307238,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “ERROR”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “In Agent::deleteAndUnregisterSelf: agent still owns objects. Here is a part of the list.”,
“drive_name”: “F1_xxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“agentObject”: “Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0”,
“objects”: “RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3597044 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3597124 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3597190 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3597202 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3597204 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3663139 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3663894 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3665562 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3693289 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3693606 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3704977 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3705518 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3705579 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3783350 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3783351 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3784940 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3785691 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3786215 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3787441 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3787443 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3787444 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3787445 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-3787448 RetrieveRequest-Frontend-ctaxx-3454028-20250618-15:54:03-0-3787449 RetrieveRequest-Frontend-ctaxx-3953274-20250618-15:58:28-0-308850”,
“startIndex”: 1200,
“endIndex”: 1224,
“totalObjects”: 1290
}
{
“epoch_time”: 1752020946.612408135,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “ERROR”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “In Agent::deleteAndUnregisterSelf: agent still owns objects. Here is a part of the list.”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“agentObject”: “Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0”,
“objects”: “RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-319324 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-319327 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-329565 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-330413 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-331946 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-332601 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-333843 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-335897 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-335899 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-340469 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-340475 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-340491 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-344336 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-344337 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-344927 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-350100 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-350171 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-350257 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-350483 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-350833 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-351068 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-357115 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-359094 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-359114 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-359177”,
“startIndex”: 1225,
“endIndex”: 1249,
“totalObjects”: 1290
}
{
“epoch_time”: 1752020946.612528872,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “ERROR”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “In Agent::deleteAndUnregisterSelf: agent still owns objects. Here is a part of the list.”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“agentObject”: “Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0”,
“objects”: “RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-359228 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-359266 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-359342 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-362085 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-364585 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-364590 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-365381 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-365392 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-365477 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-365485 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-371067 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-371101 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-371192 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-372136 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-372167 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-372189 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-372549 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-372606 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-372624 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-372635 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-372650 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-372659 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-374349 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-375986 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-377269”,
“startIndex”: 1250,
“endIndex”: 1274,
“totalObjects”: 1290
}
{
“epoch_time”: 1752020946.612654423,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “ERROR”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “In Agent::deleteAndUnregisterSelf: agent still owns objects. Here is a part of the list.”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“agentObject”: “Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0”,
“objects”: “RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-378097 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383441 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383468 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383507 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383518 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383534 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383542 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383556 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383899 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383945 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383956 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383973 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383975 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-383983 RetrieveRequest-Frontend-cta02-3953274-20250618-15:58:28-0-384077”,
“startIndex”: 1275,
“endIndex”: 1289,
“totalObjects”: 1290
}
{
“epoch_time”: 1752020946.617172154,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “CRIT”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “In BackendPopulator::~BackendPopulator(): error deleting agent (cta::exception::Exception). Backtrace follows.”,
“drive_name”: “F1_xxxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”
}
{
“epoch_time”: 1752020946.617291622,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 0,
“traceFrame”: “/lib64/libctacommon.so.0(cta::exception::Backtrace::Backtrace(bool)+0x6b) [0x7f62b8ba9c49]”
}
{
“epoch_time”: 1752020946.617343915,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 1,
“traceFrame”: “/lib64/libctacommon.so.0(cta::exception::Exception::Exception(std::basic_string_view<char, std::char_traits >, bool)+0x91) [0x7f62b8baad11]”
}
{
“epoch_time”: 1752020946.617406205,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 2,
“traceFrame”: “/lib64/libctaobjectstore.so.0(cta::objectstore::Agent::AgentStillOwnsObjects::Exception(std::basic_string_view<char, std::char_traits >, bool)+0x4c) [0x7f62c04bb430]”
}
{
“epoch_time”: 1752020946.617452489,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 3,
“traceFrame”: “/lib64/libctaobjectstore.so.0(cta::objectstore::Agent::removeAndUnregisterSelf(cta::log::LogContext&)+0x67c) [0x7f62c04b9984]”
}
{
“epoch_time”: 1752020946.617496075,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 4,
“traceFrame”: “/lib64/libctaobjectstore.so.0(cta::objectstore::BackendPopulator::~BackendPopulator()+0x2d3) [0x7f62c060a001]”
}
{
“epoch_time”: 1752020946.617539342,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 5,
“traceFrame”: “/lib64/libctaobjectstore.so.0(cta::objectstore::BackendPopulator::~BackendPopulator()+0x27) [0x7f62c060a469]”
}
{
“epoch_time”: 1752020946.617587795,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 6,
“traceFrame”: “/usr/bin/cta-taped() [0x4c16b9]”
}
{
“epoch_time”: 1752020946.617629478,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 7,
“traceFrame”: “/usr/bin/cta-taped() [0x4bd99b]”
}
{
“epoch_time”: 1752020946.617670567,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 8,
“traceFrame”: “/usr/bin/cta-taped() [0x4c22a3]”
}
{
“epoch_time”: 1752020946.617711434,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 9,
“traceFrame”: “/usr/bin/cta-taped() [0x4eaf94]”
}
{
“epoch_time”: 1752020946.617752635,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 10,
“traceFrame”: “/usr/bin/cta-taped() [0x4e9778]”
}
{
“epoch_time”: 1752020946.617798997,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 11,
“traceFrame”: “/usr/bin/cta-taped() [0x4ee867]”
}
{
“epoch_time”: 1752020946.617850908,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 12,
“traceFrame”: “/usr/bin/cta-taped() [0x4ed811]”
}
{
“epoch_time”: 1752020946.617892623,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 13,
“traceFrame”: “/usr/bin/cta-taped() [0x4a69a0]”
}
{
“epoch_time”: 1752020946.617934091,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 14,
“traceFrame”: “/usr/bin/cta-taped() [0x4a66e0]”
}
{
“epoch_time”: 1752020946.617975535,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 15,
“traceFrame”: “/usr/bin/cta-taped() [0x4a61a1]”
}
{
“epoch_time”: 1752020946.618022150,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 16,
“traceFrame”: “/usr/bin/cta-taped() [0x490f05]”
}
{
“epoch_time”: 1752020946.618064386,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 17,
“traceFrame”: “/usr/bin/cta-taped() [0x491ab8]”
}
{
“epoch_time”: 1752020946.618105806,
“local_time”: “2025-07-08T19:29:06-0500”,
“hostname”: “tpsrvf2205”,
“program”: “cta-taped”,
“log_level”: “INFO”,
“pid”: 2765584,
“tid”: 2765584,
“message”: “Stack trace”,
“drive_name”: “F1_xxxx”,
“instance”: “prd”,
“sched_backend”: “cephUser”,
“errorMessage”: “In Agent::removeAndUnregisterSelf: agent (agentObject=Maintenance-tpsrvf2205-2765584-20250628-07:25:38-0) still owns objects. Here’s the first few: RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280460 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1280563 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1283258 RetrieveRequest-Frontend-cta01-3454028-20250618-15:54:03-0-1940365 [… trimmed at 3 of 1290]”,
“traceFrameNumber”: 18,
“traceFrame”: “/lib64/libc.so.6(+0x295d0) [0x7f62b7a295d0]”
}
Possible causes
I was thinking based on the critical error message that possible ceph and cta had gotten out of synch maybe but I am not sure how. I know generally when which was round Wednesday/Thursday last week.
