Regression/ML Modeling in Commodities

Currently delving into a python project to build a fully automatable U.S. S&D model for crude. I'm using the EIA api to quickly pull and filter data but I'm struggling with what data to actually use as inputs for the model. Should I use both supply and demand data as inputs or is just inventories fine? I guess I'm struggling with what the best practice actually is...I know using rolling regressions is somewhat commonplace in S&T at banks but can any traders or analysts comment on what kind of inputs I should be using, what kind of ML model makes the most sense, key things to keep in mind when creating such a model, etc. I don't want to create anything overly complicated just a bit lost on what sort of analysis is actually considered valuable on the trade floor. Thanks!

5 Comments
 
Most Helpful

Start with the variable you are interested in predicting, do some EDA, and find out what variables you have on the supply and demand side that correlate to this variable. This will give you a sense of linear relationships between your target and the data you have. 

If you want to look at non-linear relationships among your data then look at the Mutual Independence. 

In terms of the type of model, start with a simple linear regression using multiple variables and use this as a benchmark score once you have identified the right variables to include. Then, if you want to and have time, explore using more complex models like trees (CatBoost is a great start). 

This is how I would approach this from a data science perspective, where I know nothing of the relationships between target and data. Obviously, with experienced traders around mentoring you, there is the benefit of their input on model creation, but when doing so blindly or when working in a new domain, adopt this type of systematic approach. 

Hope this helps and Merry Christmas.

 

Crude doesn’t really lend itself to a lot of convenient statistical modeling.  There are some pockets where there are decent input-output relationships that you can model purely statistically.  But much of it involves understanding how oil moves around physically, the internal logic of market participants, and modelling the supply chain by adapting what you know about the commodity.  And this can also be done in a systematic way, but not necessarily an ML way. 

 

Odit accusantium fugit nemo quia sunt omnis. Dolorum fuga optio sequi sunt voluptatibus qui saepe provident.

Quasi adipisci numquam eum esse facilis nemo officiis odio. Hic debitis laborum nobis architecto. Est nihil aut quo aut. Animi perspiciatis voluptatum dolor ut ut illum corrupti. Minus natus ea ea non reprehenderit. Aut sint consequatur culpa aut.

Ipsa voluptatem velit aut repellat consequatur voluptatem consequatur. Quaerat voluptas enim facilis commodi et illo laudantium. Rerum corrupti sequi a eligendi modi quidem quam. Qui eos enim atque voluptatem.

Career Advancement Opportunities

June 2026 Investment Banking

  • Evercore 01 99.4%
  • Moelis & Company 01 98.8%
  • JPMorgan 01 98.2%
  • Guggenheim Partners 01 97.7%
  • Morgan Stanley 07 97.1%

Overall Employee Satisfaction

June 2026 Investment Banking

  • Moelis & Company No 99.4%
  • Morgan Stanley 01 98.8%
  • Evercore 01 98.2%
  • BMO Capital Markets 12 97.6%
  • Banco Santander 01 97.1%

Professional Growth Opportunities

June 2026 Investment Banking

  • Moelis & Company No 99.4%
  • Evercore No 98.8%
  • Morgan Stanley 05 98.2%
  • JPMorgan No 97.7%
  • BMO Capital Markets 12 97.1%

Total Avg Compensation

June 2026 Investment Banking

  • Vice President (14) $434
  • Associates (43) $259
  • 3rd+ Year Analyst (8) $210
  • 2nd Year Analyst (22) $179
  • Intern/Summer Associate (13) $156
  • 1st Year Analyst (75) $151
  • Intern/Summer Analyst (65) $101
notes
16 IB Interviews Notes

“... there’s no excuse to not take advantage of the resources out there available to you. Best value for your $ are the...”

Leaderboard

1
redever's picture
redever
99.2
2
Secyh62's picture
Secyh62
99.0
3
BankonBanking's picture
BankonBanking
99.0
4
kanon's picture
kanon
99.0
5
DrApeman's picture
DrApeman
98.9
6
dosk17's picture
dosk17
98.9
7
CompBanker's picture
CompBanker
98.9
8
GameTheory's picture
GameTheory
98.9
9
Betsy Massar's picture
Betsy Massar
98.9
10
numi's picture
numi
98.8
success
From 10 rejections to 1 dream investment banking internship

“... I believe it was the single biggest reason why I ended up with an offer...”