| Thread Navigator

Canvas & Ratio

Choose your destination platform format

Layout Template

Choose a content structure for your slides

Preset Themes

Typography & Sizing

Font Family

Title Font Size36px

Body Font Size18px

Header & Footer Size12px

Brand Kit Customization

AGENCY

Configure brand assets for headers & footers

MULTI-PROFILES (AGENCY)

Active Brand Profile

Show Brand Watermark

Brand Watermark Text

Social Handle

Brand Logo URL (PNG) AGENCY

SAVE PRESETS (AGENCY)

Save current as Preset

Outro Slide CTA

Customize your closing call-to-action slide

CTA Title

CTA Message & Emojis

Custom CTA Buttons

Background Pattern

Source Content

Build Your Carousel

Drag and drop any post card below onto a slide, or use the quick buttons to insert content/images instantly!

Drag Post #1

Gappy (Giuseppe Paleologo)

@__paleologo

It's been a long time since I wrote anything remotely informative on X. I have something minor that sits somewhere in between [finance] signal analysis and statistics, which is also very practical, simple, and with a familiar ending. A loong 🧵 👇. 1/18

Drag Post #2

Gappy (Giuseppe Paleologo)

@__paleologo

Say that you have excess independent asset returns📉📈, i.e., you have removed common components of returns so that your n-dimensional return vector z[t] has independent elements. Also, z-score them so that they have unit volatility. There are T investment epochs/year. 2/18

Drag Post #3

Gappy (Giuseppe Paleologo)

@__paleologo

Your holy grail is to find a signal vector x[t], such that the cross-sectional correlation IC:=corr(x, z) is as high as possible (IC stands for "Information Coefficient"). Then the Sharpe Ratio is SR=IC*sqrt(n*T). Fundamental Law of Active Management. So far, so good. 3/18

Drag Post #4

Gappy (Giuseppe Paleologo)

@__paleologo

However, say that you have a model in mind, where some asset-specific variables determine returns. Most common variable: EPS - baseline, were the baseline is consensus EPS, or something like it (because consensus is a bit too easy to beat!). 4/18

Drag Post #5

Gappy (Giuseppe Paleologo)

@__paleologo

EPS is only one choice. The variable depends a lot on asset class, investment class, etc. It could be drug approval; or credit downgrade, etc. You get the idea. And there might be more than one (but we stick to one). 5/18

Drag Post #6

Gappy (Giuseppe Paleologo)

@__paleologo

The important thing is: the variable, at some point, is observed. We can estimate the relationship variable-returns, after the it's been observed. You estimate IC_ideal := corr(y, z). If you choose the variable well, this correlation can be quite high. If you only knew y. 6/18

Drag Post #7

Gappy (Giuseppe Paleologo)

@__paleologo

... you'd be rich. At least, you have a different problem: forecasting the variable y. You might like this problem, because you can encode a lot of expertise in the forecast. Say that you have forecasts x. The R^2 of the model y ~ x is Rsq. You'd like to use x as signals... 7/18

Drag Post #8

Gappy (Giuseppe Paleologo)

@__paleologo

... as that is your best available information. You know that corr(x, y)=sqrt(Rsq). What you need is to know is the Sharpe Ratio of the strategy: corr(x, z)*sqrt(n). 8/18

Drag Post #9

Gappy (Giuseppe Paleologo)

@__paleologo

The statistics part is this: if I know corr(x,y)=rho_1 and corr(y, z) = rho_2, what is corr(x, z)? 9/18

Drag Post #10

Gappy (Giuseppe Paleologo)

@__paleologo

You have seen this at some point. A necessary condition for the correlation is that the determinant of the correlation matrix be positive definite. 10/18

Apply Image

Drag Post #11

Gappy (Giuseppe Paleologo)

@__paleologo

From which we get a worst-case bound: r:=sqrt[(1-rho_1^2)*(1-rho_2^2)] and |rho_1*rho_2 - rho_xz| <= r. The problem is that they are usually not binding. A realistic example: rho_1=0.1, rho_2=0.6, then rho_1*rho_2=0.06 with a range of 0.8. Too wide! 11/18

Drag Post #12

Gappy (Giuseppe Paleologo)

@__paleologo

An alternative way to obtain the same bound (and more) is via partial correlation. The partial correlation of forecast and returns, given the realized variable, is below, and must be in [-1, 1]. 12/18

Apply Image

Drag Post #13

Gappy (Giuseppe Paleologo)

@__paleologo

Geometrical view: rho_xz = rho_1*rho_2 + r*cos(theta), with theta = cor(x, z|y) for high n, it is reasonable to assume that partial correlation concentrates around 0. Given the realization of the variable, the forecast contains no information about the returns. 13/18

Apply Image

Drag Post #14

Gappy (Giuseppe Paleologo)

@__paleologo

So we have an expectation identity: E[rho_xz]=rho_1*rho_2. Now you can factor the Information Coefficient into the product of the sqrt(Rsq) of your forecast, and the IC_ideal, under knowledge of the key variables. 14/18

Drag Post #15

Gappy (Giuseppe Paleologo)

@__paleologo

The annualized Sharpe Ratio is Sharpe = sqrt(Rsq)*IC_ideal*sqrt(breath)*sqrt(# decisions/yr) So there you have it. 15/18

Drag Post #16

Gappy (Giuseppe Paleologo)

@__paleologo

Sharpe decompositions appear often in quantitative investment, but they have very different meanings. This one generalizes the Transfer Coefficient. Replace y with the *true* expected returns, x with the forecasts, and you get the TC identity in the Clarke et al. paper. 16/18

Drag Post #17

Gappy (Giuseppe Paleologo)

@__paleologo

Last thing: I told the story with y as variables that predict returns from their historical realizations. But y and x can be anything. For example, y could be forecasts and x can be the noisy, corrupted estimates of these forecasts. Worst things can happen. 17/18

Drag Post #18

Gappy (Giuseppe Paleologo)

@__paleologo

The story-time interpretation, though, is useful. We should use Oracles more, and then predict those Oracles, and combine them. People do this without thinking carefully about pitfalls and power-ups. Underrated and overrrated at the same time. And that is the end. 18/18