You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
With multi-valued metadata (#38) the payload metadata "Actual-Content-Length" and "Trailing-Slop-Length" is duplicated in WAT records stemming from WARC metadata and WARC response records. Here one example:
this is also cumbersome, because other WARC types, e.g. WARC request, have the first value (4), as the value is set only once and is never overwritten.
unfortunately, the documentation of "Trailing-Slop-Length" ("Number of trailing slop bytes" in the WAT spec is not really useful to understand which of the two values is the correct one. Both make sense: 4 bytes (\r\n\r\n) used as WARC record separator resp. zero superfluous bytes.
With multi-valued metadata (#38) the payload metadata "Actual-Content-Length" and "Trailing-Slop-Length" is duplicated in WAT records stemming from WARC metadata and WARC response records. Here one example:
The reason is that these values are set (or appended) from the classes WARCResource and WARCMetaDataResourceFactory resp. HTTPHeadersResourceFactory.
4is set in WARCResource while0is set in the factory classes. Before Make MetaData multi-valued to preserve values of repeating WARC and HTTP headers #38 the last value0made into the WAT record.4), as the value is set only once and is never overwritten.\r\n\r\n) used as WARC record separator resp. zero superfluous bytes.Anyway, the duplicated values should be dropped.