Hi, i´m looking for a way to get the content of a pdf in a text format. Any ideas? thanks

It very much depends on what kind of pdfs you are trying to read, I use something like this to scrape pdf bank statements: {pdf_as_text,_} = System.cmd("pdftotext", ~w[-raw bank_statement.pdf -], cd: "/Users/myuser/hello_phoenix/pdfs/") pdf_as_text |> String.split("\n") |> Stream.map(fn line -> …

Lib for pdf processing

Questions / Help

AstonJ December 27, 2016, 12:58pm 2

Have you looked at:

More PDF libraries here.

7 Likes

Parsing pdf file