Help me overcome this nightmare: PDF to Excel

I have been out of IB for almost a year, but cannot shake this reoccurring nightmare: people asking me to build models from financials in PDF. It was cute at first, but has quickly grown burdensome as my industry has complex financials (e.g. dozens of line items just for revenue and 30+ year projections). Please help me keep my sanity.

What is the best software for converting PDFs to Excel? Cost is of no concern to me, but it has to be a software package that I can install on my computer.

 

I was going to say I think MrExcel.com recommended the first one. If you really did want to automate a delimiter macro you could check out some freelance programming sites (www.fiverr.com). Obviously don't send anything real, but close enough that what you get will actually be useful. Otherwise there is plenty of legacy code out there to modify, but this would require some knowledge of VBA. If i find anything more substantial I will PM you.

 

I frequently use PDF2XL by CogniView for my current internship, works great! You simply indicate the row/column by moving around the margins and with a touch of button, it pops out in a excel worksheet

cogniview .com/pdf-to-excel/pdf2xl-basic

 

Doing a vba macro is probably the best way to go.

However if that's not an option, you could also try to first convert the PDF to word, and then see if the Word raw data is easier to manipulate than the PDF - it probably will be, since word and Excel both run on VBA at least.

Another nice thing is that if you're tech-stupid, Word had a handy tool that lets you "record" a macro, which basically means you hit "record" and then do some actions and word, then hit "record" again. Any actions you performed will be recorded automatically as a VBA script, so if you're creative enough, you may be able to utilize VBA without actually understanding its mechanics. I have definitely used this trick before to bring some order to unstructured PDF text.

Array
 

Try the premium adobe acrobat (believe it is called DC). you can export files in a number of formats, including excel. Has worked fairly well for me the few times I've used it

 

There are software packages but since you say cost is not a problem for you I recommend an outsource group to India. My team uses an outsource company in India for $1,800 / month to do literally every mundane thing you can imagine. We have pre-built like 100 company and news screens with them and they just send us weekly one-off reports, they screen massive company lists for us and yes, they will turn around any PDF financial statement document withing about 24 hours. They also tend to do this stuff while you're sleeping given time zones.

It's a different type of solution but for my team it's low cost to outsource stuff that isn't super time sensitive.

"If you want to succeed in this life, you need to understand that duty comes before rights and that responsibility precedes opportunity."
 
TheBigBambino:

There are software packages but since you say cost is not a problem for you I recommend an outsource group to India. My team uses an outsource company in India for $1,800 / month to do literally every mundane thing you can imagine. We have pre-built like 100 company and news screens with them and they just send us weekly one-off reports, they screen massive company lists for us and yes, they will turn around any PDF financial statement document withing about 24 hours. They also tend to do this stuff while you're sleeping given time zones.

It's a different type of solution but for my team it's low cost to outsource stuff that isn't super time sensitive.

Who do you use? I haven't had rates quoted at that level before, so very interested.

 

Can you not just google "PDF to Excel" and use all the free ones online?

My only worry is that you have no idea what they're doing with the documents, but especially if they're public financials it shouldn't matter.

I literally just used one of those free online PDF to XXX to get an industry rankings list into Excel

"I did it for me...I liked it...I was good at it. And I was really... I was alive."
 

Assuming that the pdf's you're looking at aren't images, you do realize that within adobe reader if you hold down alt it will allow you to highlight any column in a straight line by selecting with your mouse (regardless of any formatting), for easy copy pasting into excel...Doing that should cut down your time by like 20x vs manually typing out numbers or trying to organize an unformatted mess

As a poster above alluded to no software solution will be 100% accurate. You're better off with either a quasi-manual approach or farming it out to some other humans

 

i had the same problem and bought Adobe Acrobat Reader DC...just download it from their site, requires a subscription (equates to ~$30/yr) and it has a function that allows you to convert PDFs to excel, word, powerpoint, etc. saves a ton of time for pdfs with huge tables

 

Quas error quidem atque repellendus architecto mollitia nobis. Et nesciunt quis non nostrum. Aut voluptatum ut vel et iusto. Id sunt id qui voluptates temporibus. Ratione qui facere cupiditate molestias harum. Labore cupiditate veniam nisi natus sapiente enim velit.

A suscipit quia voluptatem asperiores ipsa aperiam corporis. Itaque velit temporibus velit rerum. Quia rerum perferendis sit cum autem ut accusantium. Repellendus magni odio minima doloribus dolores. Dolores molestiae accusantium optio voluptas ea alias temporibus in. Delectus et aut aut qui iusto.

Aut provident tempore aut. Beatae nesciunt et pariatur qui deserunt. Neque sit tempore optio sunt. Repudiandae aut unde deserunt commodi ad rerum. Rerum ut quia nemo delectus.

 

Placeat est laudantium officiis harum. Illum voluptatem qui repudiandae recusandae ut nisi. Eum id quis eaque.

Temporibus autem et eum reprehenderit. Voluptatum cumque deleniti perferendis.

Perferendis sed error voluptatem alias nihil sit qui et. Repellendus assumenda itaque omnis aliquam sunt. Et libero corporis autem commodi. Quis sed nesciunt exercitationem veniam ut aspernatur.

 

Esse at molestiae ex odit odio. Animi ipsa beatae porro voluptas.

Aspernatur qui incidunt exercitationem ipsum ut sed facilis. Et dolor ratione non illo. Omnis odio aut tenetur voluptates non quam in. Dolores occaecati id labore voluptatem sed iusto. Ipsa molestiae odit iure nemo quo eius.

Aliquid eius dolores corporis sunt quis aut. Voluptatem ex debitis harum rem.

[Comment removed by mod team]

Career Advancement Opportunities

April 2024 Investment Banking

  • Jefferies & Company 02 99.4%
  • Goldman Sachs 19 98.8%
  • Harris Williams & Co. New 98.3%
  • Lazard Freres 02 97.7%
  • JPMorgan Chase 03 97.1%

Overall Employee Satisfaction

April 2024 Investment Banking

  • Harris Williams & Co. 18 99.4%
  • JPMorgan Chase 10 98.8%
  • Lazard Freres 05 98.3%
  • Morgan Stanley 07 97.7%
  • William Blair 03 97.1%

Professional Growth Opportunities

April 2024 Investment Banking

  • Lazard Freres 01 99.4%
  • Jefferies & Company 02 98.8%
  • Goldman Sachs 17 98.3%
  • Moelis & Company 07 97.7%
  • JPMorgan Chase 05 97.1%

Total Avg Compensation

April 2024 Investment Banking

  • Director/MD (5) $648
  • Vice President (19) $385
  • Associates (87) $260
  • 3rd+ Year Analyst (14) $181
  • Intern/Summer Associate (33) $170
  • 2nd Year Analyst (66) $168
  • 1st Year Analyst (205) $159
  • Intern/Summer Analyst (146) $101
notes
16 IB Interviews Notes

“... there’s no excuse to not take advantage of the resources out there available to you. Best value for your $ are the...”

Leaderboard

1
redever's picture
redever
99.2
2
BankonBanking's picture
BankonBanking
99.0
3
Betsy Massar's picture
Betsy Massar
99.0
4
Secyh62's picture
Secyh62
99.0
5
GameTheory's picture
GameTheory
98.9
6
CompBanker's picture
CompBanker
98.9
7
dosk17's picture
dosk17
98.9
8
kanon's picture
kanon
98.9
9
Linda Abraham's picture
Linda Abraham
98.8
10
numi's picture
numi
98.8
success
From 10 rejections to 1 dream investment banking internship

“... I believe it was the single biggest reason why I ended up with an offer...”