Q22 Quick Start S&P500¶
Introduction to S&P500 stocks, a quick-start example strategy that dynamically selects low-volatility stocks and allocates capital based on their transaction volume.
You can clone and edit this example there (tab Examples).
Introduction to S&P500 stocks¶
This template demonstrates how to build a trading strategy, specifically designed for use with a dataset containing historical constituents of the S&P 500. The strategy can only trade stocks that are part of the S&P 500 index, and only when the stock is considered liquid. A company is deemed liquid if it is included in the S&P 500 at the observed point in time.
The strategy takes a low-risk approach by allocating weights to stocks based on their volatility. It assigns capital only when the ratio of the Average True Range (ATR(14)) to the closing price is below 0.0205, signaling lower volatility. The allocation is further refined by the stock's transaction money flow, where stocks with higher liquidity and trading volume receive a larger portion of the capital (weight). This approach ensures that more capital is directed towards assets with higher market activity, while also maintaining a risk-aware allocation.
Important Considerations:
- Trade only on historical S&P 500 stocks, and only when they are considered liquid (i.e., part of the index at the time of observation).
- The in-sample period begins on 2006-01-01. Earlier data may be used for testing and training purposes.
- The strategy must achieve a minimum in-sample Sharpe Ratio of 0.70 to be considered valid.
- The maximum allocation to any single asset is capped at 10% of total capital. If a weight exceeds this, it will be limited to 0.1.
- Manual stock selection or direct hand-picking is not permitted. The allocation process must be automatic.
- The strategy can open both long and short positions.
For official Q22 rules, click here.
Accessing the S&P 500 Stocks Dataset - Data loading¶
For the Q22 contest, a new dataset of S&P 500 stocks is available. You can obtain this dataset by calling the function stocks_load_spx_data() from the data module:
snp_stocks_data = qndata.stocks_load_spx_data(min_date='2005-06-01')
To check the list of S&P 500 companies starting from 2006-01-01, along with additional stock information, use the load_list() function:
snp_stocks_list = qndata.stocks_load_spx_list()
Quantiacs also provides data for other instruments, such as cryptocurrencies, indices, futures, as well as additional fundamental stock data and macroeconomic data. Note that there is also a Nasdaq-100 dataset available, and it is not a subset of the S&P 500. However, there is a significant overlap between the two.
The data is provided in xarray.DataArray format. Check here for more details on manipulating xarray data.
Technical analysis - indicators
Once the data is loaded, various technical indicators from the qnt.ta module can be used to generate trading signals. For a complete list of indicators and examples of their implementation, check the documentation page.
Trading strategy - algorithm¶
# Necessary imports
import xarray as xr
import numpy as np
import qnt.stats as qnstats
import qnt.data as qndata
import qnt.output as qnout
import qnt.ta as qnta
import qnt.backtester as qnbt
import qnt.graph as qngraph
def strategy(data, state=None, mfsp=135, limit=0.0205): # parameter state is used in the backtester (multi-pass) for stateful strategies
vol = data.sel(field="vol") ### Volume values
liq = data.sel(field="is_liquid") ### Liquidity values, 1.0 or 0.0 -> True or False
close = data.sel(field="close") ### close prices
high = data.sel(field="high") ### daily high
low = data.sel(field="low") ### daily low
atrs = qnta.atr(high=high, low=low, close=close, ma=14) ### Average True Range of 14 bars (working days)
ratio = atrs / close ### indicator
weights = xr.where(ratio > limit, 0, 1) ### Strategy condition - zero weight if ratio is bigger than limit
money_vol = vol * liq * close ### Daily money flow per liquid asset
total_money_vol = money_vol.sum(dim='asset', skipna=True) ### Total daily money flow of liquid assets
money_vol_share = money_vol / total_money_vol ### Daily money flow share per liquid asset
mvs_mov = qnta.sma(money_vol_share, mfsp) ### weights allocation by average money flow share in period of 135 days
return mvs_mov * weights
Weights - Single / Multi pass approach¶
In Single pass backtesting, which is significantly faster, weights are calculated using the entire dataset in a single run.
Multi pass backtesting evaluates weights on a day-by-day basis by slicing the dataset for each individual day.
If applicable, we recommend using the single pass approach for efficiency, while verifying strategy statistics with the multi pass backtester. If the statistics from single pass and multi pass match exactly, it indicates that forward-looking bias has likely not been introduced, and the strategy can be confidently submitted as single pass.
In rare cases, even if results are identical between single and multi pass, forward-looking bias might unintentionally occur (e.g., by incorporating global data variables into the logic). Such issues are generally mitigated in production but can result in discrepancies in statistics comparing development and production results.
### SINGLE PASS ###
## The min_date parameter is set to '2005-06-01' to ensure that the dataset includes at least 135 bars (approximately 190 days) of data,
## which is necessary for creating indicators and determining weights for the strategy.
snp_stocks_data = qndata.stocks_load_spx_data(min_date='2005-06-01')
weights = strategy(snp_stocks_data)
weights = qnout.clean(weights, snp_stocks_data) # fix liquidity
0% (0 of 367973) | | Elapsed Time: 0:00:00 ETA: --:--:--
100% (367973 of 367973) |################| Elapsed Time: 0:00:00 Time: 0:00:00
0% (0 of 123769) | | Elapsed Time: 0:00:00 ETA: --:--:--
100% (123769 of 123769) |################| Elapsed Time: 0:00:00 Time: 0:00:00
0% (0 of 13212064) | | Elapsed Time: 0:00:00 ETA: --:--:--
43% (5813280 of 13212064) |##### | Elapsed Time: 0:00:00 ETA: 00:00:00
86% (11494440 of 13212064) |########## | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13212064 of 13212064) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 1/20 2s
0% (0 of 13212064) | | Elapsed Time: 0:00:00 ETA: --:--:--
43% (5813280 of 13212064) |##### | Elapsed Time: 0:00:00 ETA: 00:00:00
84% (11230200 of 13212064) |########## | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13212064 of 13212064) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 2/20 3s
0% (0 of 13212060) | | Elapsed Time: 0:00:00 ETA: --:--:--
13% (1849680 of 13212060) |# | Elapsed Time: 0:00:00 ETA: 00:00:00
26% (3567240 of 13212060) |### | Elapsed Time: 0:00:00 ETA: 0:00:00
48% (6473880 of 13212060) |###### | Elapsed Time: 0:00:00 ETA: 0:00:00
63% (8455680 of 13212060) |######## | Elapsed Time: 0:00:00 ETA: 0:00:00
82% (10965960 of 13212060) |######### | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13212060 of 13212060) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 3/20 4s
0% (0 of 13212064) | | Elapsed Time: 0:00:00 ETA: --:--:--
22% (3038760 of 13212064) |## | Elapsed Time: 0:00:00 ETA: 00:00:00
48% (6473880 of 13212064) |###### | Elapsed Time: 0:00:00 ETA: 0:00:00
53% (7134480 of 13212064) |####### | Elapsed Time: 0:00:00 ETA: 0:00:00
94% (12551400 of 13212064) |########### | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13212064 of 13212064) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 4/20 5s
0% (0 of 13212060) | | Elapsed Time: 0:00:00 ETA: --:--:--
100% (13212060 of 13212060) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 5/20 7s
0% (0 of 13212060) | | Elapsed Time: 0:00:00 ETA: --:--:--
100% (13212060 of 13212060) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 6/20 9s
0% (0 of 13212032) | | Elapsed Time: 0:00:00 ETA: --:--:--
100% (13212032 of 13212032) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 7/20 11s
0% (0 of 13212032) | | Elapsed Time: 0:00:00 ETA: --:--:--
11% (1585440 of 13212032) |# | Elapsed Time: 0:00:00 ETA: 00:00:00
21% (2906640 of 13212032) |## | Elapsed Time: 0:00:00 ETA: 0:00:00
43% (5813280 of 13212032) |##### | Elapsed Time: 0:00:00 ETA: 0:00:00
65% (8719920 of 13212032) |######## | Elapsed Time: 0:00:00 ETA: 0:00:00
91% (12155040 of 13212032) |########### | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13212032 of 13212032) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 8/20 12s
0% (0 of 13211952) | | Elapsed Time: 0:00:00 ETA: --:--:--
22% (3038737 of 13211952) |## | Elapsed Time: 0:00:00 ETA: 00:00:00
44% (5945355 of 13211952) |##### | Elapsed Time: 0:00:00 ETA: 0:00:00
67% (8984092 of 13211952) |######## | Elapsed Time: 0:00:00 ETA: 0:00:00
91% (12154948 of 13211952) |########### | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13211952 of 13211952) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 9/20 14s
0% (0 of 13211952) | | Elapsed Time: 0:00:00 ETA: --:--:--
21% (2906618 of 13211952) |## | Elapsed Time: 0:00:00 ETA: 00:00:00
43% (5813236 of 13211952) |##### | Elapsed Time: 0:00:00 ETA: 0:00:00
59% (7927140 of 13211952) |####### | Elapsed Time: 0:00:00 ETA: 0:00:00
81% (10833758 of 13211952) |######### | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13211952 of 13211952) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 10/20 16s
0% (0 of 13211952) | | Elapsed Time: 0:00:00 ETA: --:--:--
22% (3038737 of 13211952) |## | Elapsed Time: 0:00:00 ETA: 00:00:00
40% (5416879 of 13211952) |##### | Elapsed Time: 0:00:00 ETA: 0:00:00
62% (8323497 of 13211952) |######## | Elapsed Time: 0:00:00 ETA: 0:00:00
84% (11230115 of 13211952) |########## | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13211952 of 13211952) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 11/20 17s
0% (0 of 13211952) | | Elapsed Time: 0:00:00 ETA: --:--:--
18% (2510261 of 13211952) |## | Elapsed Time: 0:00:00 ETA: 00:00:00
59% (7927140 of 13211952) |####### | Elapsed Time: 0:00:00 ETA: 0:00:00
91% (12154948 of 13211952) |########### | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13211952 of 13211952) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 12/20 19s
0% (0 of 13211952) | | Elapsed Time: 0:00:00 ETA: --:--:--
20% (2774499 of 13211952) |## | Elapsed Time: 0:00:00 ETA: 00:00:00
43% (5813236 of 13211952) |##### | Elapsed Time: 0:00:00 ETA: 0:00:00
62% (8323497 of 13211952) |######## | Elapsed Time: 0:00:00 ETA: 0:00:00
97% (12947662 of 13211952) |########### | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13211952 of 13211952) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 13/20 20s
0% (0 of 13211952) | | Elapsed Time: 0:00:00 ETA: --:--:--
22% (3038737 of 13211952) |## | Elapsed Time: 0:00:00 ETA: 00:00:00
43% (5813236 of 13211952) |##### | Elapsed Time: 0:00:00 ETA: 0:00:00
64% (8587735 of 13211952) |######## | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13211952 of 13211952) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 14/20 21s
0% (0 of 13211952) | | Elapsed Time: 0:00:00 ETA: --:--:--
20% (2774499 of 13211952) |## | Elapsed Time: 0:00:00 ETA: 00:00:00
43% (5813236 of 13211952) |##### | Elapsed Time: 0:00:00 ETA: 0:00:00
65% (8719854 of 13211952) |######## | Elapsed Time: 0:00:00 ETA: 0:00:00
87% (11626472 of 13211952) |########## | Elapsed Time: 0:00:00 ETA: 0:00:00
94% (12551305 of 13211952) |########### | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13211952 of 13211952) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 15/20 23s
0% (0 of 13211952) | | Elapsed Time: 0:00:00 ETA: --:--:--
41% (5548998 of 13211952) |##### | Elapsed Time: 0:00:00 ETA: 00:00:00
77% (10305282 of 13211952) |######### | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13211952 of 13211952) |############| Elapsed Time: 0:00:00 ETA: 00:00:00
100% (13211952 of 13211952) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 16/20 25s
0% (0 of 13211952) | | Elapsed Time: 0:00:00 ETA: --:--:--
41% (5548998 of 13211952) |##### | Elapsed Time: 0:00:00 ETA: 00:00:00
63% (8455616 of 13211952) |######## | Elapsed Time: 0:00:00 ETA: 0:00:00
84% (11230115 of 13211952) |########## | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13211952 of 13211952) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 17/20 26s
0% (0 of 13211952) | | Elapsed Time: 0:00:00 ETA: --:--:--
21% (2906618 of 13211952) |## | Elapsed Time: 0:00:00 ETA: 00:00:00
43% (5813236 of 13211952) |##### | Elapsed Time: 0:00:00 ETA: 0:00:00
62% (8323497 of 13211952) |######## | Elapsed Time: 0:00:00 ETA: 0:00:00
70% (9380449 of 13211952) |######### | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (13211952 of 13211952) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 18/20 27s
0% (0 of 13211952) | | Elapsed Time: 0:00:00 ETA: --:--:--
100% (13211952 of 13211952) |############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 19/20 29s
0% (0 of 9351152) | | Elapsed Time: 0:00:00 ETA: --:--:--
100% (9351152 of 9351152) |##############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 20/20 30s
Data loaded 32s
Output cleaning...
fix uniq
ffill if the current price is None...
Check liquidity...
WARNING! Strategy trades non-liquid assets.
Fix liquidity...
Ok.
Check missed dates...
Ok.
Normalization...
Output cleaning is complete.
When using the backtester for weight calculation, there is no need to manually load data or define a "load_data" function as input. It is only required to specify the competition_type (in this case, "stocks_s&p500"), and the corresponding data will be loaded automatically by default.
### MULTI PASS ###
w=qnbt.backtest(
competition_type="stocks_s&p500",
lookback_period=250,
start_date="2006-01-01",
strategy=strategy,
analyze=True,
)
Run last pass...
Load data...
0% (0 of 123769) | | Elapsed Time: 0:00:00 ETA: --:--:--
100% (123769 of 123769) |################| Elapsed Time: 0:00:00 Time: 0:00:00
0% (0 of 8906228) | | Elapsed Time: 0:00:00 ETA: --:--:--
34% (3117170 of 8906228) |#### | Elapsed Time: 0:00:00 ETA: 00:00:00
96% (8639014 of 8906228) |############# | Elapsed Time: 0:00:00 ETA: 0:00:00
100% (8906228 of 8906228) |##############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 1/1 27s
Data loaded 27s
Run strategy...
Can't load state. [Errno 2] No such file or directory: '/root/state.in.pickle.gz'
Load data for cleanup...
0% (0 of 2234864) | | Elapsed Time: 0:00:00 ETA: --:--:--
100% (2234864 of 2234864) |##############| Elapsed Time: 0:00:00 Time: 0:00:00
fetched chunk 1/1 9s
Data loaded 9s
Output cleaning...
fix uniq
ffill if the current price is None...
Check liquidity...
Ok.
Check missed dates...
Ok.
Normalization...
Output cleaning is complete.
Write result...
Write output: /root/fractions.nc.gz
Statistics¶
While the multi pass approach automatically displays the strategy’s statistics through the backtester, single pass mode only calculates the weights, so visualizing the stats must be done manually. You can use the helper print_stats function for this purpose:
def print_stats(stat):
display(stat.sel(time=slice('2006-01-01', None)).to_pandas().tail(10))
performance = stat.to_pandas()["equity"]
qngraph.make_plot_filled(performance.index, performance, name="PnL (Equity)", type="log")
def get_sharpe(stat):
return stat.isel(time=-1).sel(field="sharpe_ratio").item()
stat = qnstats.calc_stat(snp_stocks_data, weights.sel(time=slice('2006-01-01', None)))
print_stats(stat)
field | equity | relative_return | volatility | underwater | max_drawdown | sharpe_ratio | mean_return | bias | instruments | avg_turnover | avg_holding_time |
---|---|---|---|---|---|---|---|---|---|---|---|
time | |||||||||||
2025-05-09 | 2.121924 | -0.000003 | 0.046777 | -0.016068 | -0.072093 | 0.848812 | 0.039705 | 1.0 | 687.0 | 0.028887 | 27.695910 |
2025-05-12 | 2.121226 | -0.000329 | 0.046773 | -0.016392 | -0.072093 | 0.848341 | 0.039679 | 1.0 | 687.0 | 0.028883 | 27.695549 |
2025-05-13 | 2.120985 | -0.000114 | 0.046768 | -0.016503 | -0.072093 | 0.848119 | 0.039665 | 1.0 | 687.0 | 0.028879 | 27.691447 |
2025-05-14 | 2.120948 | -0.000017 | 0.046763 | -0.016520 | -0.072093 | 0.848008 | 0.039656 | 1.0 | 687.0 | 0.028875 | 27.691148 |
2025-05-15 | 2.121592 | 0.000304 | 0.046758 | -0.016222 | -0.072093 | 0.848266 | 0.039664 | 1.0 | 687.0 | 0.028869 | 27.690980 |
2025-05-16 | 2.121813 | 0.000104 | 0.046754 | -0.016120 | -0.072093 | 0.848295 | 0.039661 | 1.0 | 687.0 | 0.028864 | 27.690579 |
2025-05-19 | 2.122375 | 0.000265 | 0.046749 | -0.015859 | -0.072093 | 0.848509 | 0.039667 | 1.0 | 687.0 | 0.028862 | 27.690573 |
2025-05-20 | 2.122213 | -0.000076 | 0.046744 | -0.015934 | -0.072093 | 0.848331 | 0.039654 | 1.0 | 687.0 | 0.028860 | 27.690550 |
2025-05-21 | 2.120421 | -0.000844 | 0.046740 | -0.016765 | -0.072093 | 0.847259 | 0.039601 | 1.0 | 687.0 | 0.028862 | 27.690264 |
2025-05-22 | 2.119759 | -0.000312 | 0.046735 | -0.017072 | -0.072093 | 0.846808 | 0.039576 | 1.0 | 687.0 | 0.028858 | 27.658773 |
get_sharpe(stat)
0.8468079182993304
Submit strategy to the competition¶
Use Submit button on my strategies page
Make sure that qnout.write(weights) has been added to cell, and the weights have been written. It is not required when using Multi pass backtester.
Note: After submitting the strategy to the contest, any weight exceeding 0.1 will be capped at that limit. You can apply weight normalization functions before submission, which may be more suitable, such as those that maintain the ratio between allocated weights. Details on these functions can be found here.
### Run this cell to get all weights below 0.1. This "hard cut" way is used on production if any weight exceeds 0.1.
### Any other normalization method can be used.
import qnt.exposure as qnexp
weights_capped = qnexp.cut_big_positions(weights=weights, max_weight=0.1)
Compare weights before and after normalization. Check the statistics again, it can change not only in negative way.
qnout.write(weights_capped)
Write output: /root/fractions.nc.gz
Common Reasons for Submission Rejection and Their Solutions¶
Here are some of the frequent reasons causing submission rejection in algorithmic trading competitions, and their corresponding remedies.
1) Missed call to write_output
Save algorithm weights, run code
qnt.output.write(weights)
2) Not eligible send to contest. In-Sample Sharpe must be larger than 0.7
Improve your algorithm. For example, you can use sections and get an algorithm that will pass the filter
- Example Trading System Optimization
- Example of a strategy using technical analysis indicators
Need help? Check the Documentation and find solutions/report problems in the Forum section.
3) Not enough bid information.
Run code
min_time = weights.time[abs(weights).fillna(0).sum('asset')> 0].min()
min_time
min_time must be less than or equal to January 1, 2006.
If min_time is larger than the starting date, we recommend to fill the starting values of the time series with non-vanishing values, for example a simple buy-and-hold strategy.
def get_enough_bid_for(data_, weights_):
time_traded = weights_.time[abs(weights_).fillna(0).sum('asset') > 0]
is_strategy_traded = len(time_traded)
if is_strategy_traded:
return xr.where(weights_.time < time_traded.min(), data_.sel(field="is_liquid"), weights_)
return weights_
weights_new = get_enough_bid_for(data, weights)
weights_new = weights_new.sel(time=slice("2006-01-01",None))