Power Query Does Not Import Multiple PDF Pages

legalhustler

Well-known Member
Joined
Jun 5, 2014
Messages
1,062
Hi,

I'm using Power Query for Power BI Desktop and when I connect to a single PDF file the last record only shows the data for the first page and not the rest of the pages. The rest of the data is the same as the first page. Can Power Query read multiple pages for one PDF? If so, how can I get it to work?

Thanks in advance.
 
Last edited:

sandy666

Well-known Member
Joined
Oct 24, 2015
Messages
3,278
I connect pdf with over 800 pages (tables) and it works well
but PBD is useless to me :LOL:
 
Last edited:

legalhustler

Well-known Member
Joined
Jun 5, 2014
Messages
1,062
So you can see all the records once you connect to the single PDF, including the last record on the last page?

Why do you find PBD useless? I don't think Power Query for Excel can connect to a PDF data source. I was going to transform the PDF data in PBD then export the data to Excel.
 
Last edited:

sandy666

Well-known Member
Joined
Oct 24, 2015
Messages
3,278
So you can see all the records once you connect to the single PDF, including the last record on the last page?
Yes I see as I said before
Why do you find PBD useless?
Just because :) :) :) it's too heavy and problem with high resolution. All what I need I have in Excel: PQ, PowerPivot
I don't think Power Query for Excel can connect to a PDF data source.
Right PQ in Excel cannot but there is a few ways to do it, OCR or Matt Allington advice and some more
I was going to transform the PDF data in PBD then export the data to Excel.
Via csv not directly to Excel
 
Last edited:

legalhustler

Well-known Member
Joined
Jun 5, 2014
Messages
1,062
I think I see the problem. I'm not technically connecting using "PDF" data source, I'm using "Folder" and when I do that it shows PDF file with 10 objects, of which 5 are tables (the one's I need to transform) and 5 non-important stuff. How do I connect Power Query to the 5 table objects only? Can't seem to select multiple objects. The Sample File drop down shows all my PDF files with different number of table objects....can Power Query combine table objects for each PDF automatically?
 
Last edited:

sandy666

Well-known Member
Joined
Oct 24, 2015
Messages
3,278
remove not necessary rows then TransformData and choose from Attribute columns you need
 
Last edited:

legalhustler

Well-known Member
Joined
Jun 5, 2014
Messages
1,062
Once I connect via "Folder" data source I can select Combine, Load, or Transform.

If I select Combine it will allow for either "combine & transform" or "load". If I select "combine & transform", I have to select a sample file and it only allows me to select one table object (can't select multiple) then when I load it, it only shows the first table from each PDF instead of all the tables from each PDF.

If I select Transform, I have the Content and Attributes column but I don't see anyway to select the table objects from each PDF.

I can connect via "PDF" data source but I have to manually select each file select the checkbox next to each object, then I would probably have to combine all the tables. This seems like it will work but I don't know how to combine each table. Table.Combine?
 
Last edited:

sandy666

Well-known Member
Joined
Oct 24, 2015
Messages
3,278


TransformData (post#6) == Transform

btw. I know what is there so describe what are you doing only not whole layout
 
Last edited:

legalhustler

Well-known Member
Joined
Jun 5, 2014
Messages
1,062
I don't want to assume what you know or don't know, so I like to describe in detail what I'm observing so you or anyone else can comprehend the issue. Kind of like your quote "I know you know but I forgot my Crystal Ball and don't know what you know" :)

If I click the double arrow to expand, it doesn't shows the table objects, it shows "Content", "Kind", "Size", "Hidden", "System", etc.....don't see anything related to the objects.
 

sandy666

Well-known Member
Joined
Oct 24, 2015
Messages
3,278
because there are columns of the tables
Expand all
then transform/clean data and stay with data what you need

PBD (PQ) is not a Click&Go, sometimes you need to know what are you want to achieve

maybe it will help: Link
 
Last edited:

Forum statistics

Threads
1,078,137
Messages
5,338,462
Members
399,234
Latest member
WaddoAU

Some videos you may like

This Week's Hot Topics

Top