I would like to parse form fields from signed PDF's. With this I mean for example the checkboxes. I have already tried different ways (with Python) like PyPDF2, pikepdf or even pdfminer, however I only get the letters out and not the form fields. If someone has an approach how I could parse form fields from signed PDFs it would be my salvation. I can parse the individual letters, but not the form fields. I'm already thinking about trying OCR, but it seems very complicated to me and it might be easier.
Does anyone of you have an idea how I can parse the form fields out of signed PDF?
Thanks in advance!
You can extract (but also manipulate) Form Fields with PyMuPDF - whether signed or not: