Following @mlichtenberg prompt response to #354, 9,884 out of 13,291 bhl part pdf urls produced HTTP OK responses and content could be accessed over period 2025-08-06/2025-08-07. For detailed tracking logs, see hash://md5/7ebb8d8fd4a1a4cd7131a6e4cb38c5d1.
However, 3,407 bhl part pdf urls appear to continue to produce a redirect to a 500 server error, preventing pdf content from being tracked.
With the first unresolved bhl part pdf (i.e. https://www.biodiversitylibrary.org/partpdf/101203) accessed via:
preston cat\
--remote https://linker.bio\
hash://md5/7ebb8d8fd4a1a4cd7131a6e4cb38c5d1\
| grep hasVersion\
| grep well-known\
| head -1\
| grep -oE "https://www.biodiversitylibrary.org/partpdf/[0-9]+"\
| xargs -L1 curl -IL
HTTP/1.1 302 Found
Cache-Control: private
Content-Type: text/html; charset=utf-8
Location: /error
Set-Cookie: ASP.NET_SessionId=atdrq0bp3suni1535ax0oppr; path=/; HttpOnly; SameSite=Lax
X-Server-Name: BHL1
Content-Security-Policy: frame-ancestors 'self'
Date: Thu, 07 Aug 2025 20:59:59 GMT
Set-Cookie: TS010ecf5c=01a3504f4ca09643f9d5686900e504f4effcce24d3fc0822e213e918f0a6eb7b2a48b79fa752a6ba0e0da289239dac1f84fe60eb10f545ce34050190ac477e8d8ebc1ceadb; Path=/; Secure; HttpOnly
Transfer-Encoding: chunked
HTTP/1.1 500 Internal Server Error
Cache-Control: private
Content-Length: 13465
Content-Type: text/html; charset=utf-8
Set-Cookie: ASP.NET_SessionId=5pygb1uj1m005cfaejfleap5; path=/; HttpOnly; SameSite=Lax
X-Server-Name: BHL1
Content-Security-Policy: frame-ancestors 'self'
Date: Thu, 07 Aug 2025 20:59:59 GMT
Set-Cookie: TS010ecf5c=01a3504f4c68dc5547de84856895c76c0f6fcf717dab6a0539c3f2248a215cdd3cc30bd4c83e55c50fb664e84ffa7c04f920eec4102d4d225d8ad5de63421d1aee5f1a148c; Path=/; Secure; HttpOnly
bhl-blr-20250806-20250807-bhl-partpdf-resolved.txt
bhl-blr-20250806-20250807-bhl-partpdf-unresolved.txt
Following @mlichtenberg prompt response to #354, 9,884 out of 13,291 bhl part pdf urls produced HTTP OK responses and content could be accessed over period 2025-08-06/2025-08-07. For detailed tracking logs, see hash://md5/7ebb8d8fd4a1a4cd7131a6e4cb38c5d1.
However, 3,407 bhl part pdf urls appear to continue to produce a redirect to a 500 server error, preventing pdf content from being tracked.
With the first unresolved bhl part pdf (i.e.
https://www.biodiversitylibrary.org/partpdf/101203) accessed via:bhl-blr-20250806-20250807-bhl-partpdf-resolved.txt
bhl-blr-20250806-20250807-bhl-partpdf-unresolved.txt