A coherent Bayesian framework for cross-sectional equity ranking. Twelve academic factor families flow into one posterior-weighted composite with honest uncertainty disclosure. Every math layer cross-references the others — no arithmetic averaging, no hand-waving.
This page explains the concepts we use and cites the primary literature. Exact hyperparameters, shrinkage constants, bin thresholds, process-noise settings and value-function internals are proprietary and omitted. Live diagnostic panels show current aggregate outputs as evidence that each layer actually runs end-to-end.
APEX is organised as a pipeline where one online posterior over the current market regime feeds several downstream decisions simultaneously — factor weighting, covariance shaping, prediction-interval width, and confidence scaling. The same posterior powers all of them, so the outputs are coherent rather than independently averaged.
Twelve academic factor families are composed into a per-ticker 0–100 score. A confluence engine then applies literature-backed multi-factor pattern overrides (e.g. Sloan + gross-margin crack; Asquith short-squeeze setup). Every override carries a published effect size and is conditionally fired only when the participating factors show genuine joint tail-dependence — not merely high average correlation.
The engine is Bayesian end-to-end: priors come from the literature, posteriors update from live residuals (once forward returns accumulate), and uncertainty is reported alongside every point estimate as a conformal interval.
Each factor returns a 0–100 score per ticker per day; higher is more bullish. Factor scorers are self-normalising against the live universe so a 70 means "top-quintile today", not against a stale literature baseline.
Twelve factors is a headline number; the effective breadth after removing redundancy is lower. We report that explicitly — the composite optimiser penalises correlated signals rather than double-counting them.
Linear correlation describes average co-movement; in crises what matters is what factors do together specifically in the tails. A Gaussian copula with ρ = 0.5 has zero tail-dependence; a Student-t copula at the same ρ has meaningful joint-crash probability. We estimate lower- and upper-tail dependence non-parametrically for every factor pair, per regime, and shrink toward an independence prior so thin-sample pairs do not dominate.
The Confluence Engine uses this directly: a pattern's confidence is amplified when its participating factors also show genuine tail-alignment in history, and damped when the alignment is weak. This prevents "textbook pattern, no real co-movement" overrides.
Method: Schmidt & Stadtmüller 2006; Frahm, Junker & Schmidt 2005; Embrechts, McNeil & Straumann 2002.
Standard mean-variance (Markowitz) optimisation solves for weights that minimise factor-score variance for a given information-coefficient vector. Using Spearman ρ as the covariance input under-estimates correlated crash risk — exactly the risk that matters for sizing. APEX blends the correlation matrix with the tail-dependence matrix, with the blend weight determined by the current regime posterior: crash-dominated in risk-off, squeeze-dominated in risk-on, unbiased in calm regimes.
A Tikhonov ridge auto-tunes to guarantee positive-definiteness even when the empirical tail-dependence matrix is near-singular on thin samples. The same regime posterior that decides the blend also supplies the factor weights themselves — one posterior, two coherent uses.
Method: Grinold & Kahn 1999; Embrechts, McNeil & Straumann 2002; Higham 2002 on nearest-correlation projection.
Hidden Markov Models give discrete regime labels that flip abruptly. APEX uses a Bayesian online changepoint detector that maintains a full posterior over how long the current regime has been running. From the run-length posterior we derive a probability vector across three market regimes — risk-on, transition, risk-off — so the engine is honest about uncertainty near turning points rather than forcing a binary decision.
Fresh regime (low expected run-length) ⇒ the engine admits ignorance and leans on transition priors. Mature regime ⇒ directional bias takes over. The same posterior powers factor-weight blending, covariance blending, interval width, and dynamic exposure updates (sections 3, 5, 6). One posterior, four coherent decisions.
Method: Adams & MacKay 2007; Hamilton 1989 for baseline hazard rate.
Marginal conformal intervals guarantee coverage on AVERAGE across the universe — over-covering easy tickers and under-covering hard ones. APEX uses a bin-conditional (Mondrian) variant that partitions tickers by tail-alignment strength and regime, then calibrates one width per partition. The result: the stated 90% interval is close to 90% inside each bin, not only on average.
Until enough forward-return observations accumulate per partition, APEX falls back to a marginal interval inflated by average tail-dependence. This is the single place where the engine publicly admits the difference between "calibrated" and "prior" — every ticker page shows it.
Method: Vovk, Lindsay, Mammen & Vovk 2003 (Mondrian Confidence Machine); Angelopoulos & Bates 2021 for introduction.
Static factor weights lag regime shifts. APEX tracks factor exposures as a Kalman dynamic linear model where the per-state random walk is accelerated on days the regime detector signals a change-point. Process noise is driven directly by the change-point posterior, so exposures pivot fastest on precisely the days they need to — again the same posterior powering yet another decision.
The DLM output is blended with the regime-aware Markowitz solution. Early in calibration the blend leans on Markowitz priors; as the DLM accumulates observations it takes over. Both are reported; the blend is transparent.
Method: Kalman 1960; West & Harrison 1986, 1997 on Bayesian forecasting and dynamic models.
When the Confluence Engine overrides the linear composite (e.g. a value-trap pattern pulls a superficially-cheap ticker down), which factor deserves the credit? The naïve linear answer is misleading — overrides fire only under joint conditions, so part of each factor's contribution comes from its role in enabling the pattern, not from its standalone score.
APEX uses a Shapley decomposition of the score attributable to each factor. Every contribution splits cleanly into a linear share (the coefficient-style piece) and an interaction share (the pattern-enabling non-linearity). The four uniqueness axioms guarantee a consistent story across tickers. Per-ticker attribution is available in the authenticated research API.
Method: Shapley 1953; Štrumbelj & Kononenko 2014 for ML interpretability.
Adams & MacKay (2007). Bayesian Online Changepoint Detection, arXiv:0710.3742.
Angelopoulos & Bates (2021). A Gentle Introduction to Conformal Prediction, arXiv:2107.07511.
Asness, Moskowitz & Pedersen (2013). Value and Momentum Everywhere, Journal of Finance 68 (3).
Asquith, Pathak & Ritter (2005). Short Interest, Institutional Ownership, and Stock Returns, JFE 78.
Bernard & Thomas (1989). Post-Earnings-Announcement Drift, Journal of Accounting Research 27.
Cohen & Frazzini (2008). Economic Links and Predictable Returns, Journal of Finance 63 (4).
Embrechts, McNeil & Straumann (2002). Correlation and dependence in risk management: properties and pitfalls, in Risk Management: Value at Risk and Beyond.
Fama & French (1992). The Cross-Section of Expected Stock Returns, Journal of Finance 47.
Frahm, Junker & Schmidt (2005). Estimating the tail-dependence coefficient, Insurance Math. Econ. 37.
Grinold & Kahn (1999). Active Portfolio Management, 2nd ed., McGraw-Hill.
Hamilton (1989). A New Approach to the Economic Analysis of Nonstationary Time Series and the Business Cycle, Econometrica 57.
Higham (2002). Computing the Nearest Correlation Matrix, IMA J. Numer. Anal. 22.
Jegadeesh & Titman (1993). Returns to Buying Winners and Selling Losers, Journal of Finance 48.
Kalman (1960). A New Approach to Linear Filtering and Prediction Problems, J. Basic Engineering 82.
Lakonishok, Shleifer & Vishny (1994). Contrarian Investment, Extrapolation, and Risk, Journal of Finance 49.
Li (2008). Annual Report Readability, Current Earnings, and Earnings Persistence, JAE 45.
Loughran & McDonald (2011). When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10-Ks, Journal of Finance 66.
Moskowitz & Grinblatt (1999). Do Industries Explain Momentum?, Journal of Finance 54.
Novy-Marx (2013). The Other Side of Value: The Gross Profitability Premium, JFE 108.
Pan & Poteshman (2006). The Information in Option Volume for Future Stock Prices, RFS 19.
Schmidt & Stadtmüller (2006). Non-parametric Estimation of Tail Dependence, Scand. J. Stat. 33 (2).
Seyhun (1998). Investment Intelligence from Insider Trading, MIT Press.
Shapley (1953). A Value for n-Person Games, Annals of Mathematical Studies 28.
Sloan (1996). Do Stock Prices Fully Reflect Information in Accruals and Cash Flows about Future Earnings?, Accounting Review 71.
Štrumbelj & Kononenko (2014). Explaining prediction models and individual predictions with feature contributions, KIS 41.
Vovk, Gammerman & Shafer (2005). Algorithmic Learning in a Random World, Springer.
Vovk, Lindsay, Mammen & Vovk (2003). Mondrian Confidence Machine, COLT.
West & Harrison (1986, 1997). Bayesian Forecasting and Dynamic Models, Springer.