Ultimately, strategies become black boxes, not because black boxes are desirable, but in spite of black boxes being undesirable. Suppose we had a model which predicted the frequency of Uber rides using a binary indicator of whether the ground was wet and it performed extremely well.

Obviously, the ground being wet directly has nothing to do with Uber rides but, indirectly, rain causes the ground to be wet and rain also causes people to want to take Uber more. Even though our spurious model performs well, it is susceptible to tail-risk. If a water pipe bursts in a section of the city, causing the ground to be wet or there is natural flooding, we would wrongly predict that Uber rides should increase in frequency in that area. In general, when A implies B (A=B) and A implies C (A=C), a model of B=C might work but only incidentally. So it is imperative that predictive relationships conform with intuition and common sense. It is not enough to blindly data mine and find strong predictive signals, but we should aim to unravel any confounding factors from them before the signals aggregate into a black box, upon which, these factors will then be increasingly difficult to unravel.

Pair EURUSD.

A model of A=C will work but is inferior to a model of B=C. First, A=C leaves some money on the table because A may not be the only thing which causes (in the Granger causal sense) B; maybe A' also causes B. Second, if the relationship A=B breaks down, the A=C model also breaks down but the B=C model still works. Moving to multi-factor models, features should ideally be as orthogonal as possible to each other.

For example, suppose we were investigating ice cream price as a function of sugar price and milk price.

Perhaps a better model would be to use sugar price and season (spring, summer, fall, winter). The former model features are linked by inflation, are both of the category "food primitives/inputs" and "consumables", and are both from the supply side of ice cream production while the latter model has 2 features which are much more orthogonal (one from the supply side and one from the demand side). Obviously using the 3 features of sugar price, milk price, and season would make a more accurate model but as the dimensionality of the model increases, calibrations will take at least super-linearly longer if not exponentially longer.

By the time you have 20 features, it becomes intractable to run certain optimization methods like gradient descent so feature selection is key. We should drop correlated features in favor of more orthogonal features. Both empiricism and deductive reasoning are valuable in the context of designing quantitative models. One flaw of a purely empirical approach is that we cannot run controlled experiments in the markets. We cannot fix a point in time and try two different actions to see which performed better. In other words, there are no true counterfactuals in the soft/social sciences, unlike in the hard sciences. In trading, in particular, we are also unable to measure precisely the effect of our own actions on the market. In other words, during a historical time when we were not actively trading, we cannot know how the order book and flow would have behaved had we been actively trading and during a historical time when we were actively trading, we cannot know how the order book and flow would have behaved had we not been in the market. Another flaw of empiricism is that for any given historic pattern, there are an infinite number of models which would conform to the pattern but each could make an entirely different prediction of the future. Hume, Wittgenstein, Kripke, Quine, Popper, and Taleb all have many critiques and defenses of logical empiricism and the problem of induction that expounds further on these ideas.

One issue with pure deductive reasoning is that we as humans are error-prone. Any mistake of logic along a chain of deduction would immediately void the result. Furthermore, soundness of a conclusion requires not just that each logical step along the way is valid but that the premises we assume are true themselves and in accordance with reality.

Since models must be tractable to be useful, they are often simplifications of the world and make assumptions which do not hold against reality. Suppose you were looking to run a Monte Carlo simulation for the trajectory of an asset price.

