Machine learning in trading: theory, models, practice and algo-trading - page 295
You are missing trading opportunities:
- Free trading apps
- Over 8,000 signals for copying
- Economic news for exploring financial markets
Registration
Log in
You agree to website policy and terms of use
If you do not have an account, please register
The question of the correlation of variables has been discussed many times.
Of course, correlation is the most doubtful, very quickly begins to see a correlation with the rings of Saturn, coffee grounds...
For some reason, this has been forgotten:
The Granger causalitytest is a procedure for testing causality ("Granger causality") betweentime series. The idea of the test is that the values (changes) of time series{\displaystyle x_{t}}, which are the cause of changes of time series{\displaystyle y_{t}}, should precede the changes of this time series, and moreover should make a significant contribution to the forecast of its values. If each variable contributes meaningfully to the prediction of the other, then perhaps there is some other variable that affects both.
TheGrangertestconsistently tests the two null hypotheses: "x is not the cause of y by Granger" and "y is not the cause of x by Granger." To test these hypotheses two regressions are constructed: in each regression the dependent variable is one of the variables tested for causality, and the regressors are the lags of both variables (in fact, it is avector autoregression).
And here is the code for this case from here
Of course, correlation is the most dubious, very quickly begins to see a correlation with the rings of Saturn, coffee grounds...
Somehow this has been forgotten:
Nobody forgot anything....
That's when google correlate will ask, and what method esteemed user you want to measure the relationship? your post will be relevant then, but now Google is not asking and will not ask, the service is six years old, if they wanted to do it, they would have done it already
And another thing ...
Google has billions of BPs in the database, there will always be a hundred other BPs that were close by chance, just because the database is huge and it does not matter how they measure proximity, a simple correlation or something arcane and complicated.
The question is how to sift out the random from the non-random.
The question is how to sift the random from the not random
We can
1) divide the Eura series into two parts "1" and "2"
2) throw the row "1" in Google and it will find all of the close rows.
3) memorize names of all close rows.
4) throw row "2" into google way will find all close rows
5) memorize names of all close rows
6) compare the names of rows from items 3) and 5) and look for such a row which is present in 3) and in 5)
Thus we find series which do not accidentally correlate with euros - this is something like crossvalidation in its most primitive form.
But how to get these names I don't know, you probably need to parse the page
Nobody forgot anything....
That's when google correlate will ask, and what method dear user do you want to measure the connection? then your post will be relevant, but now Google is not asking, and will not ask, the service is 6 years old, if they wanted to do it, they would have done it already
And another thing ...
Google has billions of BPs in the database, there will always be a hundred other BPs that were close by chance, just because the database is huge and it does not matter how they measure proximity, a simple correlation or something arcane and complicated.
The question is how to sift out the random from the non-random
So sift out by test from all the garbage that google has collected - that's what I meant.
The difference between the two types of correlations is that in this situation the correlations are different.
I'm not going to do it, I'm not going to do it alone :)
It seems to me that the fossil fuel is the most cheaper in the world, but in this world the fossil fuel is the most cheaper.
No matter how sophisticated a test you apply, they all show a great connection.
I have my doubts about that. A high correlation of trends only means that they are generally growing and falling in approximately the same way. To start with it would be good to look for correlation of not trends but growths, for example you can save those similar trends in csv that google will show, then find lags by yourself and recalculate the correlation, it will be much more objective.
And correlation doesn't guarantee at all that one variable can predict the other. Generally, selecting predictors for prediction by the principle of high correlation is unfortunate. What SanSanych suggested I haven't tried before, but it looks more reliable.
We can
1) divide the eura row into two parts "1" and "2
2) throw row "1" in google way will find all close rows
3) memorize names of all close rows.
4) throw row "2" into google way will find all close rows
5) memorize names of all close rows
6) compare names of rows from points 3) and 5) and look for such a series which is present in both 3) and 5)
Thus we find series which do not accidentally correlate with euro; this is something like crossvalidation in its most primitive form.
But how to get these names I don't know, you probably need to parse the page
It's called the CHOU test.
Actually checks for sample heterogeneity in the context of a regression model.
I doubt that. High correlation of trends means only that they generally grow and fall about the same. To begin with it would be good to look for correlation of not trends, but increases, for example, you can save in csv those similar trends that google gives out, then find lags yourself and recalculate the correlation, it will already be much more objective.
Yes, I agree, but google doesn't show us all its base, but only what correlates "by trends", to take what correlates by trends and make increments from it and measure corr. is also not objective, probably ... :) you have to look at the whole database
It's called a PSE test.
Actually tests sample heterogeneity in the context of a regression model.
I read this pamphlet http://www.mirkin.ru/_docs/dissert065.pdfand wanted to use NeuroShell Day Trader Pro