Time-series Transformer Generative Adversarial Networks - A new way to generate tabular data - General

Andrey Dik 2022.12.11 10:37 #28451

Aleksey Nikolayev #:

Judging from the description, it can be understood that first a part of the best passages is selected according to one criterion, then from the selected ones a part of the best passages according to the second criterion is selected, and so on.

"It allows you to select the best passes step by step: first by the number of trades, then from this sample by the mat. expectation of profitability, then by the recovery factor and so on."

the criterion is calculated at once, at each pass of optimisation, not at the end of optimisation taking into account all the results of each pass separately. that is why there is an inconsistency with the fact and the description in the help.

Evgeni Gavrilovi 2022.12.11 14:38 #28452

Maxim Dmitrievsky #:

I didn't immediately see any difference or advantage

A new way to generate tabular data. How much better is it? Or is still GMM out of the competition?

https://github.com/kathrinse/be_great

Maxim Dmitrievsky 2022.12.11 15:32 #28453

Evgeni Gavrilovi #:

A new way to generate tabular data. How much better is it? Or is GMM still out of the competition?

https://github.com/kathrinse/be_great

I don't know, I don't analyse tabular data

Not good for time series

Some T-gan would probably be better

⚙️ Time-series Transformer Generative Adversarial Networks

Github: https://github.com/jsyoon0823/TimeGAN

Paper: https://arxiv.org/abs/2205.11164v1

Stock data: https://finance.yahoo.com/quote/GOOG/history

Energy data: http://archive.ics.uci.edu/ml/datasets/Appliances+energy+prediction

@ai_machinelearning_big_data

Evgeni Gavrilovi 2022.12.12 18:15 #28454

Maxim Dmitrievsky #:
Some T-gan would probably be better

And how do you check the plausibility? Compare the distributions of real and synthetic data separately for each series?

Maxim Dmitrievsky 2022.12.13 09:40 #28455

Evgeni Gavrilovi #:

How do you check the likelihood? Compare the distributions of real and synthetic data separately for each series?

I've seen a visual comparison via PCA somewhere, I can't remember right away. Maybe later.

Maxim Dmitrievsky 2022.12.16 10:08 #28456

Evgeni Gavrilovi #:

How do you check the likelihood? Compare the distributions of real and synthetic data separately for each series?

https://hackernoon.com/a-gan-approach-to-synthetic-time-series-data-pe2r33fd

A GAN approach To Synthetic Time-Series Data | HackerNoon

hackernoon.com

Although sequential data is pretty common to be found and highly useful, there are many reasons that lead to not leverage it

Aleksey Vyazmikin 2022.12.20 20:08 #28457

What predictors can be invented for histograms?

I have attached them as files, as images don't want to be inserted - probably another bug.

Files:

Poisk_List_0.png 108 kb

Poisk_List_1.png 38 kb

Poisk_List_64.png 44 kb

mytarmailS 2022.12.21 16:14 #28458

Aleksey Vyazmikin #:

What predictors can we come up with for histograms?

)))))))

What is the difference between a histogram and points? I'm embarrassed to ask, other than visualisation

Aleksey Vyazmikin 2022.12.21 17:55 #28459

mytarmailS #:
)))))))

What is the difference between a histogram and points? I'm embarrassed to ask, other than visualisation

You can visualise any shape with dots. Visualisation is needed to stimulate abstract thinking, which stimulates the generation of ideas.

Indeed, in the histogram is a binary predictor of the sample, the red bars mean that the signal is gone (zero), and their height means how long there was no signal "1" in the sample.

I assume that the different character of the frequency distribution of signal occurrence in the sample can serve to classify the further use of this predictor in training. Accordingly, the predictor can be excluded or recommended for use only for the construction of upper root splits.

This is why predictors are required to describe histograms. Yes, we can also make predictors for TP+FP balance - ideas for its description are also interesting, except for the well-known ones.

Is there a pattern From theory to practice Machine learning for robots

Aleksey Nikolayev 2022.12.21 18:10 #28460

Aleksey Vyazmikin #:

You can visualise any shape with dots. Visualisation is needed to stimulate abstract thinking, which stimulates the generation of ideas.

Indeed, in the histogram of the binary predictor of the sample, the red bars mean that the signal is missing (zero), and their height means how long there was no signal "1" in the sample.

I assume that the different character of the frequency distribution of signal occurrence in the sample can serve to classify the further use of this predictor in training. Accordingly, the predictor can be excluded or recommended for use only for the construction of upper root splits.

This is why predictors are required to describe histograms. Yes, we can also make predictors for TP+FP balance - ideas for its description are also interesting, except for the well-known ones.

This is not a histogram, or not a histogram in the conventional sense, as Pearson invented it.

Machine learning in trading: theory, models, practice and algo-trading - page 2846