PubTables-1M: Towards comprehensive table extraction from unstructured documents (2021-09-30T00:00:00.000000Z)