Updated: How to extract a date from a PDF file using Python


I have updated the script to support a new parameter. Here is my use case:

Why this update? When I generate a PDF from a web-page, the resulting PDF often contains the current date at the very top, before all other dates. However, that’s not the date that I am interested in. I introduced a new before-parameter that will return the first date that is before the given date.

I have updated the previous post with the changes. (Maybe it’s time to publish this as a github project, but it still feels like overkill.)


See also