PDF
Extract Text
4min
there is an easy way to extract pdf text extract text from your pdf documents with the help of extract text action if you wondered how to extract text from a pdf, you can't go wrong using procesio the result (extracted text) contains 2 parts text all text from the pdf concatenated in a string; words each word from the pdf is extracted separately in a list of jsons, each with details such as id, row #, position, font etc how to configure the extract text action? 1\ create a process and give it a name 2\ drag the extract text action to the canvas and link it to the other actions 3\ create the variables needed for the configuration of the action, and then add them to the configuration panel 4\ save, validate and run the process 6\ the process will ask for an input file ( pdf file) 7\ click run 8\ click check instance to view the results you will see the extracted text in output variable example this is the pdf we used for this example this is a preview of the extracted text the result contains 2 parts text all text from the pdf concatenated in a string; words each word from the pdf is extracted separately in a list of jsons, each with details such as id, row #, position, font etc