Eleven Missing SEC Filings
Background
I reprocessed the SEC on Monday, using 500 CF Containers to process 18.48 million filings in ~2 hours. It cost $166.50 in R2 Class A operations, $87.77 in Containers Egress, $33.01 in Container Memory, and $13.01 in Container vCPU.
Eleven filings failed to process, due to not existing on the SEC website.
Missing
These filings have index urls, but are not accessible
https://www.sec.gov/Archives/edgar/data/000000000007021812/0000000000-07-021812-index.html
https://www.sec.gov/Archives/edgar/data/000000000006058958/0000000000-06-058958-index.html
https://www.sec.gov/Archives/edgar/data/000090946595000005/0000909465-95-000005-index.html
https://www.sec.gov/Archives/edgar/data/000000000005035484/0000000000-05-035484-index.html
https://www.sec.gov/Archives/edgar/data/000000000007018503/0000000000-07-018503-index.html
https://www.sec.gov/Archives/edgar/data/000095013406020010/0000950134-06-020010-index.html
https://www.sec.gov/Archives/edgar/data/000089873394000138/0000898733-94-000138-index.html
https://www.sec.gov/Archives/edgar/data/000006510395000120/0000065103-95-000120-index.html
https://www.sec.gov/Archives/edgar/data/000000000012026764/0000000000-12-026764-index.html
https://www.sec.gov/Archives/edgar/data/000000000014004389/0000000000-14-004389-index.html
This filing has no index url: 000091010826000021. It is notable because it is no longer exists in the SEC's metadata.json or online. My archive still has it, albeit the sgml was malformed. Fixable.
> Note: this is seperate issue from e.g. https://www.sec.gov/Archives/edgar/data/1556739/000162828025000960/0001628280-25-000960-index.html. Where the link to the processed files does not work, but you can inject the other cik to make it work https://www.sec.gov/Archives/edgar/data/1573298/000162828025000960/wk-form4_1736373069.xml.
000000000007021812

- has a missing sgml file.
000000000006058958

- has a missing sgml file.
- all other files missing as well.
000090946595000005

- has a missing sgml file.
- all other files missing as well.
- I tried multiple cik links.
000000000005035484

- has a missing sgml file.
- all other files missing as well.
000000000007018503

- has a missing sgml file.
- all other files missing as well.
000095013406020010

- has a missing sgml file.
- all other files missing as well.
- I checked this against the daily archive uploads. Also missing there
000089873394000138

- has a missing sgml file.
000006510395000120

- has a missing sgml file.
- I tried both cik links (sometimes one is broken but not the other)
- https://www.sec.gov/Archives/edgar/data/65100/0000065103-95-000120.txt
- https://www.sec.gov/Archives/edgar/data/919549/0000065103-95-000120.txt
000000000012026764

- has a missing sgml file.
- has a corrupted pdf file that could be salvaged.
000000000014004389

- has a missing sgml file.
- has a corrupted pdf file that could be salvaged.