How to improve the effectiveness of tree learning - General

Forester 2023.03.20 18:00 #29751

Aleksey Nikolayev #:

As far as I understand, "quantisation" (histograms) is used in bousting to speed up, so that there are fewer variants for splits. If so, the solution is good for its universality, but may be bad in a particular case - the real boundary may be lost.

Yes, that's correct. It speeds up and can be attributed to regularisation. But it also loses the exact split.

Aleksey Vyazmikin 2023.03.20 18:53 #29752

Forester #:

Maybe, but I don't see any fish there. I don't use quantisation at all. I prefer to explore float data.

I'm sorry you don't believe me.

I can demonstrate the effectiveness on your sample, compare the learning curve.

Aleksey Vyazmikin 2023.03.20 18:58 #29753

Forester #:

Yes, that's right. It speeds up and can be attributed to regularisation. But it also loses the exact split.

Accurate split on history. If the nature of predictor values distribution is known, quantisation can pick out exactly the range with characteristic stable behaviour. For trading it is just relevant.

Forester 2023.03.20 20:12 #29754

Aleksey Vyazmikin #:

Accurate split on history. If the nature of the distribution of predictor values is known, quantisation can pick out exactly the range with characteristic stable behaviour. For trading it is just relevant.

The task of searching for ranges/splits is solved by tree learning. There is at least some meaningful formula separating the rows with respect to the target.
In quantisation it is just a counter + skipping of doubles. Quantisation occurs without any check on the target function.
.

If a tree with training on the target does not give stability (or gives very weak stability), how will a counter that has no relation to the target give it? Only random and sometimes randomly good segments, which in time will cease to be such.

Interesting topic for many: Is there a pattern Machine learning for robots

Aleksey Vyazmikin 2023.03.20 20:31 #29755

Forester #:
The task of searching for ranges/splits is solved by a tree during training. At least there is some meaningful formula separating the rows with respect to the target.
In quantisation it is just a counter + skipping of doubles. Quantisation occurs without any check on the target function.
.

If a tree with training on the target does not give stability (or gives very weak stability), how will a counter that has no relation to the target give it? Only random and sometimes randomly good segments, which in time will cease to be such.

Quant tables have to be selected for each predictor. Assuming a lucky random is hit, that's what I want to identify. Random or not. Not with 100% reliability, but even by eliminating 30% of randomness you can improve the quality of the trained model.

I am developing my split estimation function (algorithm), which should reduce the disadvantage of trees - greed.

It is strange of course, I have been working on this topic for years, I have done a lot of experiments with different samples, I have statistics on the effectiveness of the approach, I say that the method works, but I meet with distrust.

Is there a pattern Phoenix 2007 (new thread) Developing an EA that

Forester 2023.03.20 20:43 #29756

Aleksey Vyazmikin #:

Quantum tables should be selected for each predictor. Let's say a lucky randoms hit - that's what I want to detect. Random or not.

How can it be NOT random with respect to the target if the target is not involved in choosing the quantisation point? Only random.

Aleksey Vyazmikin 2023.03.21 02:53 #29757

Forester #:

How can it be NOT random with respect to the target, if the target is not involved in choosing the quantisation point? Only random.

It is random, but the pattern is not random. I.e. it will persist in the future. The estimation takes into account the same target.

On the other hand, no one prevents to more accurately immediately split the predictor into quantum segments taking into account the target.

From theory to practice EURUSD - Trends, Forecasts How can I tell

Forester 2023.03.21 05:52 #29758

Aleksey Vyazmikin #:
On the other hand, nobody prevents to split the predictor more accurately at once into quantum segments taking into account the target.

It is the task of the tree to find the best splitting point, so that the purity of the classes of the target right and left sections is maximised.

Do you want to estimate purity during quantisation? Essentially you want to do the same thing the tree will do later. Turn off quantisation and you get what you want. The tree will pick the best split point given the target.

I divide traders into What is the best Errors, bugs, questions

Aleksey Vyazmikin 2023.03.21 12:05 #29759

Forester #:

It is the task of the tree to find the best split point, so that the purity of the target right and left classes is maximised.

You want to estimate purity during quantisation? Essentially you want to do the same thing that the tree will do later. Turn quantisation off and you get what you want. The tree will pick the best split point given the target.

Tired of explaining that "best" is often not the best choice.

Instead of questions - statements - as if we are doing religion....

Maxim Dmitrievsky 2023.03.21 12:31 #29760

Aleksey Vyazmikin #:

Tired of explaining that "best" is often not the best choice.

Instead of questions - statements - as if we are dealing with religion....

You have a specific approach with specific terminology, people are not ready to sacrifice space on their hard drive for such information, without understanding the result. I see the way out is either to make a detailed article to find enthusiasts who will (most likely) start pushing this in the marketplace right away 😀, or pay someone a little bit as a journeyman. Or a student for a stick of sausage.

Here's an example of my last article about meta models even. How many people wrote with suggestions and improvements, ideas? Zero :) and there's a ready-made TC. How many have taken it and are using it? At least several, without feedback.

It's a very difficult question in general, when you do something and then you're afraid to put it in the public domain, so that you don't turn out to be a sucker. But at the same time you want help. Here there are 1000 kites for one Mother Teresa, they will just eat you up. This is such a specific field of activity. Then you're on your own.)

Why are the "Dear who ! who ! Average advisor cost

Machine learning in trading: theory, models, practice and algo-trading - page 2976