Quantcast
Channel: Statalist
Viewing all articles
Browse latest Browse all 65623

Using dummy variables/interactions in a regression and possible problem of overfitting

$
0
0
I have three type of dummy variables (hours, weekdays, months). Using those dummy variables as interactions in a form
Code:
months#weekdays#hours
creates around 2000 variables (although p-values for most of them are significant).

I am worried regarding overfitting and what other approach I could use? If I use
Code:
months#weekdays weekdays#hours months#hours
I get less variables, but also less Adjusted R2 and also RSME.

Viewing all articles
Browse latest Browse all 65623

Latest Images

Trending Articles



Latest Images

<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>