Linear Regression Diagnostics











up vote
1
down vote

favorite












I am trying to determine if there is a relationship between a dependent variable y and independent variable x by fitting a least squares regression model.



Scatterplot of data:



Diagnostic plots:



The residuals seem to have constant variance, and there isn't any clear pattern in the residual vs fitted plot. However, the R-squared and the significance of the model fit's coefficients are very low. In this case, are there any nonlinearity issues that needs to be remediated with a transformation or can I conclude that my model is adequate with the correct functional form ?



Here is the summary of the model:



lm(formula = y ~ x, data = data)

Residuals:
Min 1Q Median 3Q Max
-331911 -235678 -145867 30576 1749376

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 2.440e+05 7.037e+04 3.468 0.00135 **
x 1.796e-04 6.206e-04 0.289 0.77385
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 390100 on 37 degrees of freedom
Multiple R-squared: 0.002259, Adjusted R-squared: -0.02471
F-statistic: 0.08378 on 1 and 37 DF, p-value: 0.7739









share|cite|improve this question









New contributor




joe is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.




















  • The scatter plot shows the relation is not clearly linear, and accordingly the p-value for $x$ would lead to remove the variable from the model. I'd say the variability is not well explained by $x$ and you should look for other regressors. Also, the $R^2$ is far too low, which means the model explains only a small fration of the variance of $y$.
    – Jean-Claude Arbaut
    Nov 23 at 9:02












  • I completely agree with Jean-Claude Arbaut.
    – Adrian Keister
    yesterday















up vote
1
down vote

favorite












I am trying to determine if there is a relationship between a dependent variable y and independent variable x by fitting a least squares regression model.



Scatterplot of data:



Diagnostic plots:



The residuals seem to have constant variance, and there isn't any clear pattern in the residual vs fitted plot. However, the R-squared and the significance of the model fit's coefficients are very low. In this case, are there any nonlinearity issues that needs to be remediated with a transformation or can I conclude that my model is adequate with the correct functional form ?



Here is the summary of the model:



lm(formula = y ~ x, data = data)

Residuals:
Min 1Q Median 3Q Max
-331911 -235678 -145867 30576 1749376

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 2.440e+05 7.037e+04 3.468 0.00135 **
x 1.796e-04 6.206e-04 0.289 0.77385
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 390100 on 37 degrees of freedom
Multiple R-squared: 0.002259, Adjusted R-squared: -0.02471
F-statistic: 0.08378 on 1 and 37 DF, p-value: 0.7739









share|cite|improve this question









New contributor




joe is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.




















  • The scatter plot shows the relation is not clearly linear, and accordingly the p-value for $x$ would lead to remove the variable from the model. I'd say the variability is not well explained by $x$ and you should look for other regressors. Also, the $R^2$ is far too low, which means the model explains only a small fration of the variance of $y$.
    – Jean-Claude Arbaut
    Nov 23 at 9:02












  • I completely agree with Jean-Claude Arbaut.
    – Adrian Keister
    yesterday













up vote
1
down vote

favorite









up vote
1
down vote

favorite











I am trying to determine if there is a relationship between a dependent variable y and independent variable x by fitting a least squares regression model.



Scatterplot of data:



Diagnostic plots:



The residuals seem to have constant variance, and there isn't any clear pattern in the residual vs fitted plot. However, the R-squared and the significance of the model fit's coefficients are very low. In this case, are there any nonlinearity issues that needs to be remediated with a transformation or can I conclude that my model is adequate with the correct functional form ?



Here is the summary of the model:



lm(formula = y ~ x, data = data)

Residuals:
Min 1Q Median 3Q Max
-331911 -235678 -145867 30576 1749376

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 2.440e+05 7.037e+04 3.468 0.00135 **
x 1.796e-04 6.206e-04 0.289 0.77385
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 390100 on 37 degrees of freedom
Multiple R-squared: 0.002259, Adjusted R-squared: -0.02471
F-statistic: 0.08378 on 1 and 37 DF, p-value: 0.7739









share|cite|improve this question









New contributor




joe is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











I am trying to determine if there is a relationship between a dependent variable y and independent variable x by fitting a least squares regression model.



Scatterplot of data:



Diagnostic plots:



The residuals seem to have constant variance, and there isn't any clear pattern in the residual vs fitted plot. However, the R-squared and the significance of the model fit's coefficients are very low. In this case, are there any nonlinearity issues that needs to be remediated with a transformation or can I conclude that my model is adequate with the correct functional form ?



Here is the summary of the model:



lm(formula = y ~ x, data = data)

Residuals:
Min 1Q Median 3Q Max
-331911 -235678 -145867 30576 1749376

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 2.440e+05 7.037e+04 3.468 0.00135 **
x 1.796e-04 6.206e-04 0.289 0.77385
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 390100 on 37 degrees of freedom
Multiple R-squared: 0.002259, Adjusted R-squared: -0.02471
F-statistic: 0.08378 on 1 and 37 DF, p-value: 0.7739






statistics regression data-analysis linear-regression regression-analysis






share|cite|improve this question









New contributor




joe is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











share|cite|improve this question









New contributor




joe is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|cite|improve this question




share|cite|improve this question








edited Nov 23 at 9:10









Jean-Claude Arbaut

14.9k63362




14.9k63362






New contributor




joe is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked Nov 23 at 8:44









joe

61




61




New contributor




joe is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





joe is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






joe is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.












  • The scatter plot shows the relation is not clearly linear, and accordingly the p-value for $x$ would lead to remove the variable from the model. I'd say the variability is not well explained by $x$ and you should look for other regressors. Also, the $R^2$ is far too low, which means the model explains only a small fration of the variance of $y$.
    – Jean-Claude Arbaut
    Nov 23 at 9:02












  • I completely agree with Jean-Claude Arbaut.
    – Adrian Keister
    yesterday


















  • The scatter plot shows the relation is not clearly linear, and accordingly the p-value for $x$ would lead to remove the variable from the model. I'd say the variability is not well explained by $x$ and you should look for other regressors. Also, the $R^2$ is far too low, which means the model explains only a small fration of the variance of $y$.
    – Jean-Claude Arbaut
    Nov 23 at 9:02












  • I completely agree with Jean-Claude Arbaut.
    – Adrian Keister
    yesterday
















The scatter plot shows the relation is not clearly linear, and accordingly the p-value for $x$ would lead to remove the variable from the model. I'd say the variability is not well explained by $x$ and you should look for other regressors. Also, the $R^2$ is far too low, which means the model explains only a small fration of the variance of $y$.
– Jean-Claude Arbaut
Nov 23 at 9:02






The scatter plot shows the relation is not clearly linear, and accordingly the p-value for $x$ would lead to remove the variable from the model. I'd say the variability is not well explained by $x$ and you should look for other regressors. Also, the $R^2$ is far too low, which means the model explains only a small fration of the variance of $y$.
– Jean-Claude Arbaut
Nov 23 at 9:02














I completely agree with Jean-Claude Arbaut.
– Adrian Keister
yesterday




I completely agree with Jean-Claude Arbaut.
– Adrian Keister
yesterday















active

oldest

votes











Your Answer





StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "69"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});






joe is a new contributor. Be nice, and check out our Code of Conduct.










draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3010131%2flinear-regression-diagnostics%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown






























active

oldest

votes













active

oldest

votes









active

oldest

votes






active

oldest

votes








joe is a new contributor. Be nice, and check out our Code of Conduct.










draft saved

draft discarded


















joe is a new contributor. Be nice, and check out our Code of Conduct.













joe is a new contributor. Be nice, and check out our Code of Conduct.












joe is a new contributor. Be nice, and check out our Code of Conduct.
















Thanks for contributing an answer to Mathematics Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.





Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


Please pay close attention to the following guidance:


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3010131%2flinear-regression-diagnostics%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Berounka

Sphinx de Gizeh

Different font size/position of beamer's navigation symbols template's content depending on regular/plain...