Trading is the hardest way to make easy money...
Similar Threads
Backtesting Optimization parameters 0 replies
Optimized Parameters in Metatrader Testing? 3 replies
OrderSend parameters change? 3 replies
pivot points and MACD parameters 0 replies
Can parameters run on different periods in the same EA? 4 replies

DislikedUse Monte Carlo simulation to select the most reliable backtest parameters.Ignored
I am familiar with the Monte Carlo Permutation Method (MCP) as described in Aronson's book "Evidence-Based Technical Analysis" (short online extract here). He uses that method to compare against the Null Hypothesis. What is the likelihood the test results are pure luck?
However, in the example in post #1, I backtest 1250 different parameter combinations. Do you propose to run MC's over each of these 1250 backtests? That will give 1250 profit profiles similar to the graphs in post #21.
Suppose half of them have a likelihood of over p=0.995 that they are profitable. What would then be the next step to select the best one?
- Joined Nov 2020 | Status: Member | 1,897 Posts
Disliked{quote} Probably off-topic, but I couldn't resist. I really like @PercyJames description of the market as a giant balancing machine... {image} Check-out the image above -- the indicator is based on DSS-Bressert. It's a simple algorithm, just a few lines -- based on double-smoothed Stochastics. Do you see the rhythmic pattern of peaks and valleys at repeating intervals? Really quite amazing. The price curve above doesn't display anything that would reflect that type of pattern. This repeating pattern of peaks and valleys occurs with...Ignored



yoriz,
You're asking good questions - ones that I asked a long time ago. There is an essential but long forgotten thread called Systematic Trading and I strongly suggest you read at least the first 20 pages or so. This thread should answer most of the questions that have been asked and fill in a lot of detail that I can't provide here. However I think I should point out a couple of things here to make it more obvious where to focus.
Your edge or systematic advantage is not your parameters. Yet this is what is typically done:
Think about the story with the monkeys throwing darts at the financial papers. The monkeys were better than fund managers at picking stocks, not because they threw the dart one time but because they threw it many times and thereby gained an average of the stock returns of a random portfolio. A random portfolio has a positive advantage because the stock market is long term upward trending. This was the monkey's edge along with diversification.
When you select a single system to trade you are betting that those parameters will remain highly profitable into the future. However the system results are much more likely to mean revert back to the average of the advantage rather than remain as an outlier. You are the fund manager, chasing past returns and underperforming a monkey.
This is why it seems that the market changes and the system falls apart. The market really doesn't change, in fact the market's nature is constant change so it stays the same. You are, however vastly overestimating the edge that your system should provide. And you aren't making use of the law of large numbers and are thus subject to huge random variation.
So what is your advantage or edge if not one particular system with its parameters? Your edge is the average result from your set of rules that you have parametrized. In other words you should trade all the systems in your parameter space. However probably half or more of these systems will be marginal or losers and your average profit will likely be at or near zero if you do so.
So you split your in-sample data and apply a fitness metric or function to weed out the chaff. A fitness function can be any sort of rule or performance metric (like avg profit per trade, total profit, profit / max drawdown [Sterling ratio], Sharpe Ratio, Kelly Total growth, profit factor, etc.). Discard the systems in your system space that don't meet your minimum fitness metric and run the data forward on the second half (part) of your data. You don't really care about the performance of any one system, because you aren't trading any one of them but a combination of multiple systems. So your average or combined performance is what matters.
There are effectively two ways to combine multiple systems into a single equity curve: trade a portfolio of all the chosen systems with certain weights - such as modern portfolio theory, or trade a combination of the chosen systems in an ensemble (machine learning). You should be conversant in both but you will get much more performance from ensembles though stability is more challenging.
In terms of robustness, the most robust set of weights is equal weights. You may be tempted to optimize the weights but you are very likely to waste a lot of time, only to overfit the data when you do this and end up with something that fails out of sample.
Your goal in all this, is not to create a single system, but a system creation framework that generally produces out of sample results that match at least 70% of your in sample performance and thus has good consistency. Because you are using a fitness metric to select, you will periodically need to reoptimize or create a new system that includes the latest data since some of your systems will drop below the minimum threshold while others will rise above it.
That's about as much as I want to share on this topic. Read the thread linked in this post and you should be on your way. It's not a short term project and has taken me many years to get to this point with some failures and success along the way. Good luck!
You're asking good questions - ones that I asked a long time ago. There is an essential but long forgotten thread called Systematic Trading and I strongly suggest you read at least the first 20 pages or so. This thread should answer most of the questions that have been asked and fill in a lot of detail that I can't provide here. However I think I should point out a couple of things here to make it more obvious where to focus.
Your edge or systematic advantage is not your parameters. Yet this is what is typically done:
Inserted Code
run a series of backtests select the best parameter based on some metric trade it forward: it ends in tears.
Think about the story with the monkeys throwing darts at the financial papers. The monkeys were better than fund managers at picking stocks, not because they threw the dart one time but because they threw it many times and thereby gained an average of the stock returns of a random portfolio. A random portfolio has a positive advantage because the stock market is long term upward trending. This was the monkey's edge along with diversification.
When you select a single system to trade you are betting that those parameters will remain highly profitable into the future. However the system results are much more likely to mean revert back to the average of the advantage rather than remain as an outlier. You are the fund manager, chasing past returns and underperforming a monkey.
This is why it seems that the market changes and the system falls apart. The market really doesn't change, in fact the market's nature is constant change so it stays the same. You are, however vastly overestimating the edge that your system should provide. And you aren't making use of the law of large numbers and are thus subject to huge random variation.
So what is your advantage or edge if not one particular system with its parameters? Your edge is the average result from your set of rules that you have parametrized. In other words you should trade all the systems in your parameter space. However probably half or more of these systems will be marginal or losers and your average profit will likely be at or near zero if you do so.
So you split your in-sample data and apply a fitness metric or function to weed out the chaff. A fitness function can be any sort of rule or performance metric (like avg profit per trade, total profit, profit / max drawdown [Sterling ratio], Sharpe Ratio, Kelly Total growth, profit factor, etc.). Discard the systems in your system space that don't meet your minimum fitness metric and run the data forward on the second half (part) of your data. You don't really care about the performance of any one system, because you aren't trading any one of them but a combination of multiple systems. So your average or combined performance is what matters.
There are effectively two ways to combine multiple systems into a single equity curve: trade a portfolio of all the chosen systems with certain weights - such as modern portfolio theory, or trade a combination of the chosen systems in an ensemble (machine learning). You should be conversant in both but you will get much more performance from ensembles though stability is more challenging.
In terms of robustness, the most robust set of weights is equal weights. You may be tempted to optimize the weights but you are very likely to waste a lot of time, only to overfit the data when you do this and end up with something that fails out of sample.
Your goal in all this, is not to create a single system, but a system creation framework that generally produces out of sample results that match at least 70% of your in sample performance and thus has good consistency. Because you are using a fitness metric to select, you will periodically need to reoptimize or create a new system that includes the latest data since some of your systems will drop below the minimum threshold while others will rise above it.
That's about as much as I want to share on this topic. Read the thread linked in this post and you should be on your way. It's not a short term project and has taken me many years to get to this point with some failures and success along the way. Good luck!
3
- Joined Nov 2020 | Status: Member | 1,897 Posts
One more thing is to concentrate on similarities that aren't similar (framework). Hidden in the obvious.
- Joined Aug 2014 | Status: Trading | 2,142 Posts
Disliked{quote} Can you please elaborate on that? I am familiar with the Monte Carlo Permutation Method (MCP) as described in Aronson's book "Evidence-Based Technical Analysis" (short online extract here). He uses that method to compare against the Null Hypothesis. What is the likelihood the test results are pure luck? However, in the example in post #1, I backtest 1250 different parameter combinations. Do you propose to run MC's over each of these 1250 backtests? That will give 1250 profit...Ignored
https://towardsdatascience.com/impro...s-abacde033adf
Trading is the hardest way to make easy money...
- Joined Nov 2020 | Status: Member | 1,897 Posts
Disliked{quote} Hahaha, now you sound like a fairy tale wizard speaking in riddles ;-) Not sure what to do with this advice...Ignored
https://en.wikipedia.org/wiki/Congruence_(geometry)
--- This fairy tales might help
In Euclidean geometry, two objects are similar if they have the same shape, or one has the same shape as the mirror image of the other. More precisely, one can be obtained from the other by uniformly scaling (enlarging or reducing), possibly with additional translation, rotation and reflection. This means that either object can be rescaled, repositioned, and reflected, so as to coincide precisely with the other object. If two objects are similar, each is congruent to the result of a particular uniform scaling of the other.
DislikedThere is an essential but long forgotten thread called Systematic Trading and I strongly suggest you read at least the first 20 pages or so.Ignored
DislikedYou don't really care about the performance of any one system, because you aren't trading any one of them but a combination of multiple systems. So your average or combined performance is what matters.Ignored
A few quick tests averaging the forward test profits of the top-x strategies in the spreadsheet from post #2 already showed nice results. This is definitely something I am going to study more. Thank you for this valuable insight!
DislikedThere are effectively two ways to combine multiple systems into a single equity curve: trade a portfolio of all the chosen systems with certain weights - such as modern portfolio theory, or trade a combination of the chosen systems in an ensemble (machine learning).Ignored
- Joined Nov 2020 | Status: Member | 1,897 Posts
Disliked{quote} Hahaha, now you sound like a fairy tale wizard speaking in riddles ;-) Not sure what to do with this advice...Ignored








1
Dislikedthe following link is a good start to be familiar with MC application in backtesting.Ignored
How can we apply MC for system selection? We can run MC simulations on each of the hundreds or thousands of backtests we run when varying the parameters. Using MC simulation will surely improve the accuracy of estimates like Return/DD and similar backtest metrics. As usual, we will likely see that some parameters are clearly not profitable, while others appear to be profitable.
But still the question remains how to select the most robust, stable parameters (or show to select a subset of systems to create a portfolio of promising systems, as FXEZ pointed out). I assume that selecting the most profitable strategies from the MC'd backtest results is not wise, because these are probably just lucky, curve fitted instances (i.e. the 90+ percentile in the profit distribution in post #1).
Any suggestions?
- Joined Aug 2011 | Status: Member | 1,305 Posts
I see your graph as a noisy image in parameter space. Running a low pass filter (blur) on it was my first idea. I took your data and applied a 5x5 uniform kernel on it (a 2D SMA). You can see that the lucky peak gets averaged out. Then I spotted (in red) the three best local maximas. The rational is that you seek paramters which give similar (and good!) results when slighly changed (a flat high plateau); so that they are assumed robust. You can take the top N best in your portfolio.
There is also this surprisingly good results in the forward test. Two hypothesis pop to my mind
1/ this set of parameters has no predictive power and result is random
2/ It highly depends on the market condition which has changed between the two datasets.
You said you used E/U from 2020-01-01 to 2020-07-01. Right after this period is a smoother uptrend. Can this be an explanation?
There is also this surprisingly good results in the forward test. Two hypothesis pop to my mind
1/ this set of parameters has no predictive power and result is random
2/ It highly depends on the market condition which has changed between the two datasets.
You said you used E/U from 2020-01-01 to 2020-07-01. Right after this period is a smoother uptrend. Can this be an explanation?
No greed. No fear. Just maths.
1
DislikedI took your data and applied a 5x5 uniform kernel on it (a 2D SMA). You can see that the lucky peak gets averaged out. Then I spotted (in red) the three best local maximas.Ignored
DislikedThere is also this surprisingly good results in the forward test. Two hypothesis pop to my mind
1/ this set of parameters has no predictive power and result is randomIgnored
I intend to capture the lessons learned from this thread in code to have the EA re-optimize itself automatically. Then I can do some serious WFA testing, because the current manual approach is very time consuming. Indeed, I might then discover that this simple strategy has no predictive power.
Disliked2/ It highly depends on the market condition which has changed between the two datasets. You said you used E/U from 2020-01-01 to 2020-07-01. Right after this period is a smoother uptrend. Can this be an explanation?Ignored
- Joined Aug 2014 | Status: Trading | 2,142 Posts
Disliked{quote} Thanks again for the link. I have read the article. What it describes is a pragmatic introduction to MC, what Aronson describes in more detail in his book. However, both the article and Aronson only consider one single system at a time. How can we apply MC for system selection? We can run MC simulations on each of the hundreds or thousands of backtests we run when varying the parameters. Using MC simulation will surely improve the accuracy of estimates like Return/DD and similar backtest metrics. As usual, we will likely see that some parameters...Ignored
Hope this helps you with your portfolio!
/Matt
Trading is the hardest way to make easy money...
- Joined Dec 2017 | Status: Member | 4,378 Posts
The following comments won't be well-recieved, but a little bit of controversy never hurts...
Obviously y'all are smart fellows -- but you aren't traders. Your satisfaction comes from thinking about and developing theoretical systems. And there is nothing wrong with that. My comments that follow are primarily intended for those who might consider following in your foot steps...
Your data mining, statistical analysis, Monte Carlo analysis are great for optimizing a system with known constraints -- e.g. traffic control, where there are known constraints and the goal is to optimize a few parameters to achieve the greatest traffic throughput. But in FX everything is in motion -- there are no known constraints that don't change with timeframe or pair or market conditions.
If your over-fitted systems really worked you wouldn't need a thread to discuss how to make them work -- instead, you'd spend your time actually trading, right?
But let's say one of your systems did work for some interval of time. Y'all already acknowledge that constant recalibration is required, right? Well -- a real trader knows that by the time you acknowledge your current system isn't good enough and requires recalibration you've already lost more than what you may have previouisly profitted.
I'm a strong believer in algorthmic trading -- so, my intent is not to discourage you. Rather, to encourage you to favor strategies that don't rely on settings that require calibration. That's a bit difficult because 95% of the indicators out there employ some flavor of moving average that requires a "period" parameter who's optimal value changes with time frame, pair, market conditions, etc. If your strategy employs a moving average then you will always be chasing your tail...
For those readers at a cross-roads as to what approach to take, first learn how to discretionary trade. The window between a successful algo trader and failed one is much smaller than you think -- until you appreciate that you'll be doomed to failure. Study the actual price data at the ground level. Statistical analysis is the view from 30,000 feet. Once you become a good discretionary trader then you are in the position to design and develop a system -- if that is what you enjoy doing. And most important of all -- KISS...
Obviously y'all are smart fellows -- but you aren't traders. Your satisfaction comes from thinking about and developing theoretical systems. And there is nothing wrong with that. My comments that follow are primarily intended for those who might consider following in your foot steps...
Your data mining, statistical analysis, Monte Carlo analysis are great for optimizing a system with known constraints -- e.g. traffic control, where there are known constraints and the goal is to optimize a few parameters to achieve the greatest traffic throughput. But in FX everything is in motion -- there are no known constraints that don't change with timeframe or pair or market conditions.
If your over-fitted systems really worked you wouldn't need a thread to discuss how to make them work -- instead, you'd spend your time actually trading, right?
But let's say one of your systems did work for some interval of time. Y'all already acknowledge that constant recalibration is required, right? Well -- a real trader knows that by the time you acknowledge your current system isn't good enough and requires recalibration you've already lost more than what you may have previouisly profitted.
I'm a strong believer in algorthmic trading -- so, my intent is not to discourage you. Rather, to encourage you to favor strategies that don't rely on settings that require calibration. That's a bit difficult because 95% of the indicators out there employ some flavor of moving average that requires a "period" parameter who's optimal value changes with time frame, pair, market conditions, etc. If your strategy employs a moving average then you will always be chasing your tail...
For those readers at a cross-roads as to what approach to take, first learn how to discretionary trade. The window between a successful algo trader and failed one is much smaller than you think -- until you appreciate that you'll be doomed to failure. Study the actual price data at the ground level. Statistical analysis is the view from 30,000 feet. Once you become a good discretionary trader then you are in the position to design and develop a system -- if that is what you enjoy doing. And most important of all -- KISS...
Building upon the excellent suggestion from @FXEZ, I tried to build portfolios of many different parameter sets to average out the performance of the individual settings. In the plots below, I sorted on the various backtest metrics as provided by the MetaTrader strategy tester and created portfolio's of the top-1, top-2, top-3, etc. strategies (truncated at max. 500 strategies for scaling):
Recovery Factor and Equity DD % look like the most robust metrics as all portfolios between N=6..332, respectively N=2..400 are profitable, albeit with varying profits obviously. Expected Payoff is a useless metric for this strategy. In fact, by inverting the sort order (sorted by ascending order, so building a portfolio of the worst Expected Payoffs), I got excellent results but it does not feel robust to use something that is counter-intuitive:
What worries me is that the Backtest Profit is also a very bad predictor of Forward Test Profit. Is that a sign that the stategy has no predictive power?
Several curves start with a deep dip and climb back up again. Does that confirm my hypothesis that we should filter out the top scoring strategies because these are just lucky curve fits?
Recovery Factor and Equity DD % look like the most robust metrics as all portfolios between N=6..332, respectively N=2..400 are profitable, albeit with varying profits obviously. Expected Payoff is a useless metric for this strategy. In fact, by inverting the sort order (sorted by ascending order, so building a portfolio of the worst Expected Payoffs), I got excellent results but it does not feel robust to use something that is counter-intuitive:
What worries me is that the Backtest Profit is also a very bad predictor of Forward Test Profit. Is that a sign that the stategy has no predictive power?
Several curves start with a deep dip and climb back up again. Does that confirm my hypothesis that we should filter out the top scoring strategies because these are just lucky curve fits?
DislikedThe trick is to consider each BASKET BUY and BASKET SELL as the outcomes of the synthetic symbol (like someone is trading a single asset).Ignored
The only thing that worries me about this approach is how to make it practical. Brute-forcing all combinations is obviously impossible. Should I follow an additive approach by first trying to pair only 2 strategies (that is "only" 1.5 million combinations to try), and then try which 3rd strategy matches best with already found pair? Etc. Still a huge amount CPU cycles, but doable in days or weeks.
1
DislikedThe following comments won't be well-recieved, but a little bit of controversy never hurts...Ignored
DislikedIf your over-fitted systems really worked you wouldn't need a thread to discuss how to make them workIgnored
Also highly recommended is the excellent thread "Machine Learning with AlgoTraderJo". He describes how he mines (or optimizes if you like) a strategy on real symbol data, but also on 200 randomized synthetic data series (think of it like shuffling the returns of the bars in a chart to remove any causality between subsequent bars while preserving the overal ranges and dynamics of the symbol). To check data-mining bias, he allows only some 'x'% of the stategies mined from random data to be better than those mined from the real data. That is a very expensive but thorough way to ensure you are not curve fitting!
Dislikedencourage you to favor strategies that don't rely on settings that require calibration.Ignored
Unfortunately, these are fairly limiting in what rules are possible. And beware that often there is an implicit parameter hidden in the rules of these strategies. For example: "if prices crosses the resistance twice ...". Why twice and not three times? Yes, you hardcoded "2" but only after you experimented or backtested a bit with "1", "3" and "4". So you don't have data-mining bias, but you have selection bias (or survivorship bias) instead.
DislikedOnce you become a good discretionary trader then you are in the position to design and develop a systemIgnored
DislikedNot true at all. That statement simply reflects your limited experience in trading and bias in favor of complexity.Ignored
Dislikedunfortunately skills don't necessarily transfer due to being an acquaintance.Ignored
Dislikedanother reason I know you are not a trader: with all your statistical analysis you don't mention anything about money management.Ignored
DislikedIf you couldn't code a successful strategy with the aid of your discretionary-trading friend, then how does statistical analysis overcome that defiiciency?Ignored
DislikedFor OOS you'll typically use the next chunk of data adjacent to the chunk you optimized for. The probability is very high the adjacent chunk represents similar market conditions as the chunk used for optimizing your settings. [...] your optimized settings that seemed to work on the adjacent OOS data fail miserably on the non-adjacent OOS data. It's a common misconception -- the problem is that your testing software probably forces you to use adjacent data for OOS.Ignored
Thank you for bringing a fresh, different perspective, but I feel we are drifting more and more off-topic. How do you select the best strategies and parameter sets in your trading?
1
- Platform Tech
- /
- Finding The Most Robust Parameters Using Optimization
- Reply to Thread
0 traders viewing now